TY - JOUR
T1 - Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes
AU - Qatar Genome Program Research Consortium
AU - Qatar Genome Project Management
AU - Biobank and Sample Preparation
AU - Sequencing and Genotyping group
AU - Applied Bioinformatics Core
AU - Data Management and Computing Infrastructure group
AU - Consortium Lead Principal Investigators
AU - Razali, Rozaimi Mohamad
AU - Rodriguez-Flores, Juan
AU - Ghorbani, Mohammadmersad
AU - Naeem, Haroon
AU - Aamer, Waleed
AU - Aliyev, Elbay
AU - Jubran, Ali
AU - Ismail, Said I.
AU - Al-Muftah, Wadha
AU - Badji, Radja
AU - Mbarek, Hamdi
AU - Darwish, Dima
AU - Fadl, Tasnim
AU - Yasin, Heba
AU - Ennaifar, Maryem
AU - Abdellatif, Rania
AU - Alkuwari, Fatima
AU - Alvi, Muhammad
AU - Al-Sarraj, Yasser
AU - Saad, Chadi
AU - Althani, Asmaa
AU - Fethnou, Eleni
AU - Qafoud, Fatima
AU - Alkhayat, Eiman
AU - Afifi, Nahla
AU - Tomei, Sara
AU - Liu, Wei
AU - Lorenz, Stephan
AU - Syed, Najeeb
AU - Almabrazi, Hakeem
AU - Vempalli, Fazulur Rehaman
AU - Temanni, Ramzi
AU - Saqri, Tariq Abu
AU - Khatib, Mohammedhusen
AU - Hamza, Mehshad
AU - Zaid, Tariq Abu
AU - El Khouly, Ahmed
AU - Pathare, Tushar
AU - Poolat, Shafeeq
AU - Al-Ali, Rashid
AU - Albagha, Omar
AU - Al-Khodor, Souhaila
AU - Alshafai, Mashael
AU - Badii, Ramin
AU - Chouchane, Lotfi
AU - Estivill, Xavier
AU - Fakhro, Khalid A.
AU - Mokrab, Younes
AU - Puthen, Jithesh V.
AU - Suhre, Karsten
N1 - Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12
Y1 - 2021/12
N2 - Arab populations are largely understudied, notably their genetic structure and history. Here we present an in-depth analysis of 6,218 whole genomes from Qatar, revealing extensive diversity as well as genetic ancestries representing the main founding Arab genealogical lineages of Qahtanite (Peninsular Arabs) and Adnanite (General Arabs and West Eurasian Arabs). We find that Peninsular Arabs are the closest relatives of ancient hunter-gatherers and Neolithic farmers from the Levant, and that founder Arab populations experienced multiple splitting events 12–20 kya, consistent with the aridification of Arabia and farming in the Levant, giving rise to settler and nomadic communities. In terms of recent genetic flow, we show that these ancestries contributed significantly to European, South Asian as well as South American populations, likely as a result of Islamic expansion over the past 1400 years. Notably, we characterize a large cohort of men with the ChrY J1a2b haplogroup (n = 1,491), identifying 29 unique sub-haplogroups. Finally, we leverage genotype novelty to build a reference panel of 12,432 haplotypes, demonstrating improved genotype imputation for both rare and common alleles in Arabs and the wider Middle East.
AB - Arab populations are largely understudied, notably their genetic structure and history. Here we present an in-depth analysis of 6,218 whole genomes from Qatar, revealing extensive diversity as well as genetic ancestries representing the main founding Arab genealogical lineages of Qahtanite (Peninsular Arabs) and Adnanite (General Arabs and West Eurasian Arabs). We find that Peninsular Arabs are the closest relatives of ancient hunter-gatherers and Neolithic farmers from the Levant, and that founder Arab populations experienced multiple splitting events 12–20 kya, consistent with the aridification of Arabia and farming in the Levant, giving rise to settler and nomadic communities. In terms of recent genetic flow, we show that these ancestries contributed significantly to European, South Asian as well as South American populations, likely as a result of Islamic expansion over the past 1400 years. Notably, we characterize a large cohort of men with the ChrY J1a2b haplogroup (n = 1,491), identifying 29 unique sub-haplogroups. Finally, we leverage genotype novelty to build a reference panel of 12,432 haplotypes, demonstrating improved genotype imputation for both rare and common alleles in Arabs and the wider Middle East.
UR - http://www.scopus.com/inward/record.url?scp=85118396665&partnerID=8YFLogxK
U2 - 10.1038/s41467-021-25287-y
DO - 10.1038/s41467-021-25287-y
M3 - Article
C2 - 34642339
AN - SCOPUS:85118396665
SN - 2041-1723
VL - 12
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 5929
ER -