NEW: Dominic Kwiatkowski’s final paper... more
Ag3.0 (Ag1000G phase 3): Anopheles gambiae data resource

Released on 4 Nov 2021.

Mosquito

This data release includes copy number variant (CNV) calls, genome-wide single nucleotide polymorphism (SNP) calls, haplotypes as well as sample metadata and sequence read alignments from whole-genome sequencing of 2,784 wild-caught mosquitoes collected from 19 countries in sub-Saharan Africa, and 297 mosquitoes comprising parents and progeny of 15 lab crosses. Three mosquito species are represented: Anopheles gambiae, Anopheles coluzzii and Anopheles arabiensis. This data was generated by the Ag1000G project which is part of the MalariaGEN vector observatory but can also be analysed together with data from the Anopheles gambiae genomic surveillance project.

Data sets

Data availability

This data release includes copy number variant (CNV) calls, genome-wide single nucleotide polymorphism (SNP) calls, haplotypes as well as sample metadata and sequence read alignments. These data are available to download from public archives and are also available for access in Google Cloud. For more information about downloading data, please see the data download guide. For more information about accessing data in the cloud, please see the cloud access guide.

Contributing studies

Mosquito specimens sequenced in this data release were contributed by 26 studies. For more information about the researchers and studies who contributed these specimens, collection methods and contact information, please see the contributing studies document.

Ag3.0 Sample Locations

The Ag3.0 data release contains 2,784 wild-caught mosquito specimens collected from 19 countries as well as 297 specimens comprising parents and progeny of 15 crosses.

Ag3.0 Terms of Use

The Data Producers (the Consortium and its Contributing Investigators) will release the Project data prior to publication, in the expectation that they will be valuable for many researchers. In keeping with Fort Lauderdale principles, Data Users may use the data for their own studies, but are expected to allow the Consortium and its Contributing Investigators to make the first presentations and to publish the first papers with global analyses of the data. Researchers who have questions about whether they may make presentations or submit papers using Project data, or whether to include the Anopheles gambiae 1000 Genomes Consortium as an author, may contact Martin Donnelly (M.J.Donnelly@liverpool.ac.uk). Please see Ag1000G Terms of Use for the full terms.

Ag3.0 (Ag1000G phase 3) SNP data release

Further details on genome-wide single nucleotide polymorphism (SNP) calling methods can be found here.

Access the data via the user guide.

Ag3.0 (Ag1000G phase 3) CNV data release

Further details on genome-wide copy number variant (CNV) calls can be found here.

Ag3.0 (Ag1000G phase 3) haplotypes data release

Further details on genome-wide phased haplotypes can be found here.

Ag3.0 (Ag1000G Phase 3) data access

Please visit the release page on the vector data guide to find out more about how to access the data.