Ag1000G phase 3 haplotypes data release

Project: Ag1000G

Released on 4 Nov 2021

This data release includes phased haplotypes for 2,784 wild-caught mosquitoes collected from 19 countries in sub-Saharan Africa. These haplotypes can be analysed directly or used as haplotype reference panels to improve phasing of other samples. Three mosquito species are represented: Anopheles gambiae, Anopheles coluzzii and Anopheles arabiensis. All mosquitoes were sequenced using Illumina technology by the Wellcome Sanger Institute Parasites and Microbes programme.

Data availability

This data release comprises phased haplotypes at biallelic SNP sites. These data are hosted in Google Cloud. For more information about downloading data, please see the haplotypes section in the Ag3 data download guide. For more information about accessing data in the cloud, please see the haplotypes section in the Ag3 cloud access guide.

Contributing studies

Mosquito specimens sequenced in this data release were contributed by 26 studies. For more information about the researchers and studies who contributed these specimens, collection methods and contact information, please see the Ag1000G partner studies.

Haplotype phasing methods

Haplotypes were phased using a combination of read-backed phasing and statistical phasing. Further information about the phasing methods and links to the pipeline implementation are available from the haplotype phasing section in the Ag3 methods.