MOHD Data

Data Production Plan

We designed our data production to enable comparisons across three major axes:

In total, we plan on generating over 20 thousand molecular experiments spanning thousands of individuals, the majority of which have diverse genetic ancestries.

The MOHD consortium was designed to maximize opportunities for data harmonization and minimize technical biases such as batch effects.

All multi-omic data will be produced by one site, the 'Omics Production Center (OPC). Disease study sites (DSSs) will follow uniform, standardized collection protocols to send biospecimens to the OPC.

All data will be collected and process by one site, the Data Analysis and Coordination Center (DACC). The DACC will work closely with DSSs to collect participant metadata such as clinical measurements and SDoH. The DACC will also obtain raw multi-omics directly from the OPC and will process this data through uniform processing pipelines.

Data Workflow

Data Production Progress

Enrollment is scheduled to begin in Fall 2024. Check back soon for data production updates.

Accessing MOHD Data

All data generated by the MOHD consortium has been consented for General Research Use. There will be two ways to access MOHD data (1) the MOHD Data Portal and (2) the MOHD Workspace on AnVIL.