Truveta Data

Complete, timely, and clean regulatory-grade EHR data

EHR data has been hard to use for research because it is fragmented, inaccessible, and unstructured

Truveta Data comes from a growing collective of 30 health systems across the country. We use the Truveta Language Model to ingest and clean billions of daily EHR data points for research.

120M+

patients and growing

5B+

clinical notes

188K+

unique medical devices

8+

years patient history

Explore complete, patient-level data

EHR data is linked with SDOH, mortality, and claims data for a complete view into the patient journey.

Stringent data quality standards are met across all data elements, including:

Clinical notes
Imaging data
Mother-child data
Lab tests and results
Biometric data
Genomic biomarkers
Device data (at UDI level)
Immunizations
Care settings
Biopsy reports
Medication dosage data
Pharmacy fill data
Diagnosis and procedure data
And more

The clinical depth of Truveta Data enables the use of highly-specific inclusion/exclusion criteria.

Diagram detailing inclusion and exclusion criteria applied to a heart failure population in Truveta Data. The criteria encompass medication use, length of stay, outpatient encounters, laboratory results, comorbidities, and device use. The visual representation offers a comprehensive overview of the selection criteria, aiding in understanding the parameters used to define the study population.

Case study

Comparing the safety of novel pulmonary embolism devices

Real-world data is essential for filling knowledge gaps on drugs and devices. To assess the real-world safety of its EKOS device versus a competitor device, Boston Scientific enlisted independent researchers to compare major bleeding event risks using Truveta Data.

Case study

Comparing the safety of novel pulmonary embolism devices

Real-world data is essential for filling knowledge gaps on drugs and devices. To assess the real-world safety of its EKOS device versus a competitor device, Boston Scientific enlisted independent researchers to compare major bleeding event risks using Truveta Data.

Access notes, images, and mother-child data integrated with EHR data

Unlock meaningful data from clinical notes

Understand complete clinical context and pursue novel research with more than 5 billion notes.

Truveta receives all clinical notes generated during a patient’s care. This includes progress notes, nursing evaluations, procedure/operative reports, referral notes, discharge summaries, and more.

L

Stage of illness

L

Treatment, reason for change in treatment regimen

L

Treatment not considered due to patient preference

L

Genomic variants

L

Specific staging information across recurrence staging, clinical staging, and pathology staging

Learn from millions of medical images integrated with complete EHR data

Truveta Data includes medical images across all modalities, including MRI, CT, X-ray, ultrasound, mammogram, PET, and nuclear medicine, searchable by modality and protocol.

Learn more millions of medical images alongside complete EHR data

Truveta Data includes medical images across all modalities, including MRI, CT, X-ray, ultrasound, mammogram, PET, and nuclear medicine, searchable by modality and protocol.

Study maternal and child health with the largest mother-child EHR dataset

Access longitudinal EHR data for more than 1 million mother-child pairs, enabling research from pregnancy through the first 5 years of the child’s life.

Study medication safety, the connection between maternal health and childhood conditions, and more, with immunizations and medications during pregnancy, delivery outcomes and complications, labs at time of birth, and more.

Truveta announces the largest and most complete mother-child electronic health record (EHR) dataset for scientifically rigorous research for mothers and their children.

Research-ready for regulatory-grade

Aligned with FDA guidance, Truveta established rigorous standards of data quality and provenance and audit-ready processes, procedures, and controls to support organizations in meeting the most stringent regulatory requirements.

Truveta Data generates real-world evidence that can help accelerate therapy approval and adoption.

Daily-updated data cleaned with AI

Billions of data points cleaned with unmatched accuracy

Truveta Language Model, a large-language, multi-modal AI model, transforms billions of data points with industry-leading normalization. Data is carefully de-identified with the highest standards of security and privacy protection.

“We are thrilled to partner with Truveta to use their unprecedented access to EHR data for real-world data research and advance our understanding of epilepsy and seizure disorders. Through Truveta, we can uncover insights into patient care and outcomes that we’ve never seen before at this scale and breadth of data, including seizure frequency in clinical notes. Together, we believe we can drive meaningful improvements in the lives of patients living with epilepsy.”

Sean Stern

Director Health Economics Outcomes Research (HEOR), SK Life Science Inc.

Explore whitepapers

Truveta Language Model

Data quality

Data security

Patient privacy

Every Truveta study can be a health equity study

Truveta Data is linked with SDOH data from LexisNexis Risk Solutions including attributes on education, housing, transportation, and social risk.

Education

Type of education
Highest individual eduation
Highest household education

Transportation

Number of vehicles registered to the household
Number of vehicles owned by the household
Number of members in household

Housing stability

Type of dwelling
Time at current address

Social risk categories

Banking experience
Professional license type