Download Complete Package

High-fidelity synthetic dataset with privacy guarantees

Download ZIP (9.3 MB)

Dataset Overview

High-fidelity synthetic longitudinal dataset modeling bipolar disorder with mixed features (ICD-10: F31.6x). Generated using CTGAN with differential privacy guarantees for ML/AI research, clinical decision support development, and educational purposes.

800
Patients
5,550
Observations
35
Variables
Note: This is fully synthetic data - no real patients were used. Suitable for research and education only.

Reports & Documentation

Professional PDF reports demonstrating data quality and methodology:

Sample Data Preview

Evaluate data structure before downloading the full package:

Privacy Guarantees

k-Anonymity (k=12)
Diff. Privacy (ε<0.8)
MIA Resistant (0.52)

Key Features

Clinical Scales

  • YMRS (Mania): Mean 29.6
  • HAM-D (Depression): Mean 18.1
  • GAF (Function): Mean 44.6

Unique Features

  • Identity Crisis (38.8%)
  • Sleep Aversion (65.1%)
  • Stimulant Misuse (39.2%)
  • Polypharmacy (27.9%)

Data Formats

FormatUse Case
CSVUniversal
ParquetPython/R
SQLiteSQL queries
FHIR R4Healthcare
CDISC ODMClinical trials
Stata DTAStatistical
REDCapResearch

License

CC BY-NC 4.0

Attribution-NonCommercial

For commercial licensing, contact us at contact@mentaldata.io