MOHD Data

Data Production Plan

We designed our data production to enable comparisons across three major axes:

'Omics measurement: we plan to measure multiple types of multi-omics data in the same biospecimens which will enable systematic comparisons of molecular measurements.
Time: we plan to collect the same molecular measures across multiple timepoints enabling longitudinal studies
Disease state: each Disease Study Site (DSS) will enroll at least 300 participants (200 cases and 100 controls) enabling disease-centric and cross-disease analysis

In total, we plan on generating over 20 thousand molecular experiments spanning thousands of individuals.

The MOHD consortium was designed to maximize opportunities for data harmonization and minimize technical biases such as batch effects.

All multi-omic data will be produced by one site, the 'Omics Production Center (OPC). Disease study sites (DSSs) will follow uniform, standardized collection protocols to send biospecimens to the OPC.

All data will be collected and processed by one site, the Data Analysis and Coordination Center (DACC). The DACC will work closely with DSSs to collect participant metadata such as clinical measurements. The DACC will also obtain raw multi-omics directly from the OPC and will process this data through uniform processing pipelines.

Data Workflow

Data Production Progress

Enrollment is scheduled to begin in Winter 2025. Check back soon for data production updates.

Accessing MOHD Data

All data generated by the MOHD consortium has been consented for General Research Use. There will be two ways to access MOHD data (1) the MOHD Data Portal and (2) the MOHD Workspace on AnVIL.

Page updated

Google Sites

Report abuse