This data release includes whole genome sequences from 540 An. dirus mosquitoes collected from different sites across Bangladesh, Cambodia and Thailand. These were collected during field studies led by Dr. Brandy St. Laurent in collaboration with the National Center for Parasitology, Entomology, and Malaria Control (CNM) Cambodia and the NIH NIAID Laboratory of Malaria and Vector Research. These collections were done between 2017 and 2020. Multiple Anopheles species were collected in each of these studies, including the An. dirus. s.l. specimens that have been included in this study.

All mosquitoes were sequenced using Illumina technology with 150bp long reads by the Wellcome Sanger Institute Parasite and Microbes Programme.

Data sets

Adir1.0 Contributing Studies

1276 - Anopheles dirus vector surveillance in Bangladesh 

1277 - Anopheles dirus vector surveillance in Cambodia 

1278- Anopheles dirus vector surveillance in Thailand

Adir1.0 Terms of Use

Data from this project will be made publicly available before journal publication. Unless otherwise stated, analyses of project data are ongoing and publications are in preparation by project partners, and it is not permitted to use project data for publication (including any type of communication with the general public) without prior permission from the originating partner studies.

Although malaria is generally an endemic rather than an epidemic disease, and the focus of this project is on surveillance of disease vectors rather than pathogens, our data terms of use build on MalariaGEN's approach to data sharing, and adopt norms which have been established for rapid sharing of pathogen genomic data during disease outbreaks. The primary rationale for this approach is that malaria remains a public health emergency, where ethically appropriate and rapid sharing of genomic surveillance data can help to detect and respond to biological threats such as new forms of insecticide resistance, and to adapt malaria vector control strategies to different settings and changing circumstances.

If you have any questions regarding these terms of use, please contact support@malariagen.net.

Adir1.0 Data Availability

This data release includes sample metadata, whole genome sequence data, and genome-wide SNP calls from whole genome sequencing of 540 wild-caught mosquitoes collected from Bangladesh, Cambodia and Thailand. These individuals are all An. dirus s.l. These data are hosted in Google Cloud. For more information about accessing data in the cloud, please see the cloud data access guide.

Adir1.0 Whole genome sequencing and variant calling

The methods for sequencing and variant calling closely follow Ag1000G. 150bp sequence reads were aligned to the An. dirus reference genome AdirusWRAIR2.