I keep seeing AGVD referenced in AfriGen-D materials and in Beacon variant pages. What is it, what data does it contain, and what can I use it for?
AGVD — the African Genome Variation Database — is the
AfriGen-D resource that publishes allele frequencies of
human genetic variants across African populations, broken
down by sub-region.
Live at https://agvd.afrigen-d.org.
What it is:
- An aggregated, summary-level resource. AGVD does not host
raw sequencing data or individual-level genotypes — it
publishes minor allele frequencies (MAF) and related summary
statistics computed from underlying African cohorts. - Regional stratification: African allele frequencies are
reported per sub-region (Central, Eastern, Northern, Southern,
and Western Africa) where sample sizes allow. - Open access for read / lookup. No data access committee
approval is needed to query AGVD.
What it's for:
- Looking up the "right" allele frequency for a variant in a
specific African population, when the global gnomAD figure
isn't representative. - Powering variant-frequency callouts on the AfriGen-D Beacon
(https://beacon.afrigen-d.org) variant detail pages, which
cross-link out to AGVD. - Reference data for clinical interpretation, study power
calculations, and population genetics analyses where African
ancestry needs to be considered properly.
For raw genotype or whole-genome sequencing data, see the
related KB article on getting raw WGS data — AGVD is not the
right resource for that.