I keep being pointed to the H3Africa catalogue. What does it actually contain, and what is it for?
The H3Africa Data & Biospecimens Catalogue at
https://catalog.h3africa.org is the consortium's central index
of:
- Datasets — genotyping, sequencing, phenotype, and other
data generated by H3Africa studies. - Biospecimens — sample collections (DNA, plasma, serum,
etc.) curated by H3Africa biorepositories.
What it gives you:
- Open discovery. Browsing the catalogue does not require
approval — you can search by disease area, country, sample
type, data type, etc., and read each study's description. - A single point of contact per dataset. Each entry lists
the contributing study's Data Access Committee (DAC) and
biorepository contact. - A standardised data-access request workflow. Once you find
a relevant dataset, you submit a structured request through
the catalogue itself rather than emailing individual studies.
What it does not give you:
- Direct downloads. Almost all H3Africa data is controlled
access — discovery is open, but obtaining the data requires
DAC approval and a Data Access Agreement (DAA). - Summary allele frequencies — for those, use AGVD
(https://agvd.afrigen-d.org) which publishes per-region
allele frequencies without an access request. - Reference panels for imputation — those are accessed
through the AfriGen-D Imputation Service at
https://fedimpute.afrigen-d.org, not the catalogue.
For the access workflow itself, see the next article.