MOHD Data
Data Production Plan
We designed our data production to enable comparisons across three major axes:
'Omics measurement: we plan to measure multiple types of multi-omics data in the same biospecimens which will enable systematic comparisons of molecular measurements.
Time: we plan to collect the same molecular measures across multiple timepoints enabling longitudinal studies
Disease state: each Disease Study Site (DSS) will enroll at least 300 participants (200 cases and 100 controls) enabling disease-centric and cross-disease analysis
In total, we plan on generating over 20 thousand molecular experiments spanning thousands of individuals, the majority of which have diverse genetic ancestries.
The MOHD consortium was designed to maximize opportunities for data harmonization and minimize technical biases such as batch effects.
All multi-omic data will be produced by one site, the 'Omics Production Center (OPC). Disease study sites (DSSs) will follow uniform, standardized collection protocols to send biospecimens to the OPC.
All data will be collected and process by one site, the Data Analysis and Coordination Center (DACC). The DACC will work closely with DSSs to collect participant metadata such as clinical measurements and SDoH. The DACC will also obtain raw multi-omics directly from the OPC and will process this data through uniform processing pipelines.
Data Workflow
Data Production Progress
Enrollment is scheduled to begin in Fall 2024. Check back soon for data production updates.
Accessing MOHD Data
All data generated by the MOHD consortium has been consented for General Research Use. There will be two ways to access MOHD data (1) the MOHD Data Portal and (2) the MOHD Workspace on AnVIL.