Ag1000G phase 3 SNP data release

Project: Ag1000G

Released on 11 Feb 2021

This data release includes sample metadata, sequence read alignments and genome-wide single nucleotide polymorphism (SNP) calls from whole-genome sequencing of 2,784 wild-caught mosquitoes collected from 19 countries in sub-Saharan Africa, and 297 mosquitoes comprising parents and progeny of 15 lab crosses. Three mosquito species are represented: Anopheles gambiae, Anopheles coluzzii and Anopheles arabiensis. All mosquitoes were sequenced using Illumina technology by the Wellcome Sanger Institute Parasites and Microbes programme.

Data availability

This data release comprises sample metadata, sequence read alignments and SNP calls. These data are available to download from public archives and are also available for access in Google Cloud. For more information about downloading data, please see the data download guide. For more information about accessing data in the cloud, please see the cloud access guide.

Contributing studies

Mosquito specimens sequenced in this data release were contributed by 26 studies. For more information about the researchers and studies who contributed these specimens, collection methods and contact information, please see the contributing studies document.

Sequencing and variant calling methods

A complete description of methods used to for sequencing and variant calling will be provided in a future publication. In the interim, a brief description of methods is provided in the SNP calling methods document.