Home·Curated Catalog·Medical Imaging
🏥 Curated Catalog · Medical Imaging

MIMIC-IV

Deidentified ICU and ED records for 315K patients from BIDMC — credentialed access.

LQS 93 · platinum ⚠ Research-only 454K hospital admissions 67 GB CSV · Parquet Released 2020
Browse commercial Medical Imaging → Visit original source ↗
Source: physionet.org · maintained by MIT Lab for Computational Physiology
454K
hospital admissions
67 GB
Size on disk
93
LQS · platinum
2020
First released

About this dataset

MIMIC-IV (Medical Information Mart for Intensive Care) from MIT Lab for Computational Physiology contains deidentified health records for 315K patients with 454K hospital admissions at Beth Israel Deaconess Medical Center. Includes vitals, labs, meds, notes, radiology reports, and ICU events. Requires PhysioNet credentialed access + HIPAA training.

Formats
CSV · Parquet

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

93
out of 100
platinum tier

High-quality dataset across most dimensions

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 96
Published by maintainer: 96% completeness across annotated fields.
Uniqueness 93
Exact-hash deduplication documented by maintainer.
Validation 93
Ground-truth is the official record itself (filings, medical charts, etc.).
Size adequacy 94
454,000 items — exceeds 20,000 adequacy target for Medical Imaging.
Format compliance 95
Industry-standard format — drop-in compatible with mainstream tooling.
Label density 100
Derived 660.79 labels/item from label_count/item_count ratio.
Class balance 75
Moderate class skew — realistic production distribution.

What it's used for

Common tasks and benchmarks where MIMIC-IV is the default or competitive choice.

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

315K patients, 454K admissions, 73K ICU stays, 425K ED visits. Includes 2.3B chart events, labs, meds, notes, radiology reports.

License

MIMIC-IV is distributed under PhysioNet Credentialed Health Data License. This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Heads up: this dataset's license restricts commercial use. If you need medical imaging data for production, check LabelSets' paid datasets below — every listing has an explicit commercial license.

Need commercial-licensed Medical Imaging data?

LabelSets sellers offer paid medical imaging datasets with what public datasets often can't give you:

Browse paid Medical Imaging → Sell your dataset

Similar public datasets

Other entries in the Medical Imaging catalog.

Frequently Asked Questions

MIMIC-IV is distributed under PhysioNet Credentialed Health Data License, which restricts commercial use. For a commercially-licensed alternative in medical imaging, see LabelSets' paid datasets.
MIMIC-IV contains 454,000 hospital admissions. 315K patients, 454K admissions, 73K ICU stays, 425K ED visits. Includes 2.3B chart events, labs, meds, notes, radiology reports.
MIMIC-IV is maintained by MIT Lab for Computational Physiology and is available at https://physionet.org/content/mimiciv/. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.
LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.