Pre-historical In Mesolithic Europe, R1a is characteristic of
Eastern Hunter-Gatherers (EHGs). A male EHG of the
Veretye culture buried at
Peschanitsa near
Lake Lacha in
Arkhangelsk Oblast, Russia c. 10,700 BCE was found to be a carrier of the paternal haplogroup R1a5-YP1301 and the maternal haplogroup
U4a. A male, named PES001, from
Peschanitsa in
northwestern Russia was found to carry R1a5, and dates to at least 10,600 years ago. A Mesolithic male from
Karelia c. 8,800 BCE to 7950 BCE has been found to be carrying haplogroup R1a. A
Mesolithic male buried at
Deriivka c. 7000 BCE to 6700 BCE carried the paternal haplogroup R1a and the maternal
U5a2a. Another male from Karelia from c. 5,500 to 5,000 BC, who was considered an EHG, carried haplogroup R1a. A male from the
Comb Ceramic culture in
Kudruküla c. 5,900 BCE to 3,800 BCE has been determined to be a carrier of R1a and the maternal
U2e1. According to archaeologist David Anthony, the paternal
R1a-Z93 was found at the Oskol river near a no longer existing kolkhoz "Alexandria",
Ukraine c. 4000 BCE, "the earliest known sample to show the genetic adaptation to lactase persistence (13910-T)." R1a has been found in the
Corded Ware culture, in which it is predominant. Examined males of the Bronze Age
Fatyanovo culture belong entirely to R1a, specifically subclade R1a-Z93. Haplogroup R1a has later been found in ancient fossils associated with the
Urnfield culture; as well as the burial of the remains of the
Sintashta,
Andronovo, the
Pazyryk,
Tagar,
Tashtyk, and
Srubnaya cultures, the inhabitants of ancient
Tanais, in the
Tarim mummies, and the aristocracy of
Xiongnu. The skeletal remains of a father and his two sons, from an archaeological site discovered in 2005 near Eulau (in
Saxony-Anhalt,
Germany) and dated to about 2600 BCE, tested positive for the Y-SNP marker SRY10831.2. The
Ysearch number for the Eulau remains is 2C46S. The ancestral clade was thus present in Europe at least 4600 years ago, in association with one site of the widespread
Corded Ware culture.
Europe In Europe, the R1a1a sub-clade is primarily characteristic of Balto-Slavic populations, with two exceptions: southern Slavs and northern Russians. The highest frequency of R1a1a in Europe is observed in
Sorbs (63%), a
West Slavic ethnic group, followed by
Hungarians (60%). Other groups with significant R1a1a, ranging from 27% to up to 58%, include
Czechs,
Poles,
Slovenians,
Slovaks,
Moldovans,
Belarusians,
Rusyns,
Ukrainians, and
Russians. R1a frequency decreases in northeastern Russian populations down to 20%–30%, in contrast to central-southern Russia, where its frequency is twice as high. In the
Baltics, R1a1a frequencies decrease from Lithuania (45%) to Estonia (around 30%). There is also a significant presence in peoples of
Germanic descent, with highest levels in
Norway,
Sweden and
Iceland, where between 20 and 30% of men are in R1a1a.
Vikings and
Normans may have also carried the R1a1a lineage further out, accounting for at least part of the small presence in the
British Isles, the
Canary Islands, and
Sicily. Haplogroup R1a1a averages between 10 and 30% in
Germans, with a peak in
Rostock at 31.3%. R1a1a is found at a very low frequency among
Dutch people (3.7%) and is virtually absent in
Danes. In Southern Europe R1a1a is not common, but significant levels have been found in pockets, such as in the
Pas Valley in Northern
Spain, areas of
Venice, and
Calabria in
Italy. The
Balkans shows wide variation between areas with significant levels of R1a1a, for example 36–39% in
Slovenia, 27–34% in
Croatia, and over 30% in
Greek Macedonia, but less than 10% in
Albania,
Kosovo and parts of
Greece south of Olympus gorge. R1a is virtually composed only of the Z284 subclade in
Scandinavia. In Slovenia, the main subclade is Z282 (Z280 and M458), although the Z284 subclade was found in one sample of a Slovenian. There is a negligible representation of Z93 in
Turkey, 12,1% The Russian geneticist Oleg Balanovsky speculated that there is a predominance of the assimilated pre-Slavic substrate in the genetics of East and West Slavic populations, according to him the common genetic structure which contrasts East Slavs and Balts from other populations may suggest the explanation that the pre-Slavic substrate of the East and West Slavs consisted most significantly of Baltic-speakers, which at one point predated the Slavs in the cultures of the
Eurasian steppe according to archaeological and toponymic references.
Asia Central Asia found R1a1a in 64% of a sample of the
Tajiks of
Tajikistan and 63% of a sample of the
Kyrgyz of
Kyrgyzstan. found R1a1a-M17 in 26.0% (53/204) of a set of samples from
Afghanistan, including 60% (3/5) of a sample of
Nuristanis, 51.0% (25/49) of a sample of Pashtuns, 30.4% (17/56) of a sample of Tajiks, 17.6% (3/17) of a sample of Uzbeks, 6.7% (4/60) of a sample of Hazaras, and in the only sampled Turkmen individual. found R1a1a-M198/M17 in 56.3% (49/87) of a pair of samples of Pashtuns from Afghanistan (including 20/34 or 58.8% of a sample of Pashtuns from
Baghlan and 29/53 or 54.7% of a sample of Pashtuns from
Kunduz), 29.1% (37/127) of a pool of samples of Uzbeks from Afghanistan (including 28/94 or 29.8% of a sample of Uzbeks from
Jawzjan, 8/28 or 28.6% of a sample of Uzbeks from
Sar-e Pol, and 1/5 or 20% of a sample of Uzbeks from
Balkh), 27.5% (39/142) of a pool of samples of Tajiks from Afghanistan (including 22/54 or 40.7% of a sample of Tajiks from
Balkh, 9/35 or 25.7% of a sample of Tajiks from
Takhar, 4/16 or 25.0% of a sample of Tajiks from
Samangan, and 4/37 or 10.8% of a sample of Tajiks from
Badakhshan), 16.2% (12/74) of a sample of
Turkmens from
Jawzjan, and 9.1% (7/77) of a pair of samples of
Hazara from Afghanistan (including 7/69 or 10.1% of a sample of Hazara from
Bamiyan and 0/8 or 0% of a sample of Hazara from
Balkh). found R1a1-SRY10831.2 in 30.0% (12/40) of a sample of Tajiks from Tajikistan. found R1a-M198 in 6.03% (78/1294) of a set of samples of
Kazakhs from
Kazakhstan. R1a-M198 was observed with greater than average frequency in the study's samples of the following Kazakh tribes: 13/41 = 31.7% of a sample of Suan, 8/29 = 27.6% of a sample of Oshaqty, 6/30 = 20.0% of a sample of Qozha, 4/29 = 13.8% of a sample of Qypshaq, 1/8 = 12.5% of a sample of Tore, 9/86 = 10.5% of a sample of Jetyru, 4/50 = 8.0% of a sample of Argyn, 1/13 = 7.7% of a sample of Shanyshqyly, 8/122 = 6.6% of a sample of Alimuly, 3/46 = 6.5% of a sample of Alban. R1a-M198 also was observed in 5/42 = 11.9% of a sample of Kazakhs of unreported tribal affiliation.
South Asia In South Asia, R1a1a has often been observed in a number of demographic groups. South Asian populations have the highest
STR diversity within R1a1a, and subsequent older
TMRCA datings. In
India, high frequencies of this haplogroup is observed in
West Bengal Brahmins (72%) in the east,
Bhanushali (67%) and
Gujarat Lohanas (60%) in the west,
Uttar Pradesh Brahmins (68%),
Punjab/Haryana Khatris (67%) in the north, and
Karnataka Medars (39%) in the south. It has also been found in several
South Indian Dravidian-speaking
Adivasis including the
Chenchu (26%) of
Andhra Pradesh and
Kota of
Andhra Pradesh (22.58%) and the
Kallar of
Tamil Nadu suggesting that R1a1a is widespread in Tribal Southern Indians. Besides these, studies show high percentages in regionally diverse groups such as
Manipuris (50%) to the extreme North East and among
Punjabis (47%) to the extreme North West. In
Pakistan it is found at 80% among
Yusufzai tribe of
Pashtuns (51%) from
Swat District, 71% among the
Mohanna community in
Sindh province to the south and 46% among the
Baltis of
Gilgit-Baltistan to the north. Among the
Sinhalese of
Sri Lanka, 23% were found to be R1a1a (R-SRY1532) positive. Hindus of
Chitwan District in the
Terai region
Nepal show it at 69%. Among
Syed Y6 most common.
East Asia The frequency of R1a1a is comparatively low among some
Turkic-speaking groups like
Yakuts, yet levels are higher (19 to 28%) in certain Turkic or
Mongolic-speaking groups of
Northwestern China, such as the
Bonan,
Dongxiang,
Salar, and
Uyghurs. A Chinese paper published in 2018 found R1a-Z94 in 38.5% (15/39) of a sample of Keriyalik Uyghurs from Darya Boyi / Darya Boye Village,
Yutian County, Xinjiang (于田县达里雅布依乡), R1a-Z93 in 28.9% (22/76) of a sample of
Dolan Uyghurs from Horiqol township,
Awat County, Xinjiang (阿瓦提县乌鲁却勒镇), and R1a-Z93 in 6.3% (4/64) of a sample of Loplik Uyghurs from Karquga / Qarchugha Village,
Yuli County, Xinjiang (尉犁县喀尔曲尕乡). R1a(xZ93) was observed only in one of 76 Dolan Uyghurs. Note that Darya Boyi Village is located in a remote oasis formed by the
Keriya River in the
Taklamakan Desert. A 2011 Y-DNA study found Y-dna R1a1 in 10% of a sample of southern
Hui people from Yunnan, 1.6% of a sample of
Tibetan people from
Tibet (
Tibet Autonomous Region), 1.6% of a sample of
Xibe people from Xinjiang, 3.2% of a sample of northern Hui from
Ningxia, 9.4% of a sample of Hazak (
Kazakhs) from Xinjiang, and rates of 24.0%, 22.2%, 35.2%, 29.2% in 4 different samples of Uyghurs from Xinjiang, 9.1% in a sample of
Mongols from
Inner Mongolia. A different subclade of R1 was also found in 1.5% of a sample of northern Hui from Ningxia. in the same study there were no cases of R1a detected at all in 6 samples of Han Chinese in
Yunnan, 1 sample of Han in
Guangxi, 5 samples of Han in
Guizhou, 2 samples of Han in
Guangdong, 2 samples of Han in
Fujian, 2 samples of Han in
Zhejiang, 1 sample of Han in
Shanghai, 1 samples of Han in
Jiangxi, 2 samples of Han in
Hunan, 1 sample of Han in
Hubei, 2 samples of Han in
Sichuan, 1 sample of Han in
Chongqing, 3 samples of Han in
Shandong, 5 samples of Han in
Gansu, 3 samples of Han in
Jilin and 2 samples of Han in
Heilongjiang. 40% of Salars, 45.2% of
Tajiks of Xinjiang, 54.3% of Dongxiang, 60.6% of
Tatars and 68.9% of
Kyrgyz in Xinjiang in northwestern China tested in one sample had R1a1-M17. Bao'an (Bonan) had the most haplogroup diversity of 0.8946±0.0305 while the other ethnic minorities in northwestern China had a high haplogroup diversity like Central Asians, of 0.7602±0.0546. In Eastern
Siberia, R1a1a is found among certain indigenous ethnic groups including
Kamchatkans and
Chukotkans, and peaking in
Itel'man at 22%.
Southeast Asia Y-haplogroups R1a-M420 and R2-M479 are found in
Ede (8.3% and 4.2%) and
Giarai (3.7% and 3.7%) peoples in
Vietnam. The
Cham additionally have haplogroups R-M17 (13.6%) and
R-M124 (3.4%). R1a1a1b2a2a (R-Z2123) and R1a1 are found in
Khmer peoples from
Thailand (3.4%) and
Cambodia (7.2%) respectively. Haplogroup R1a1a1b2a1b (R-Y6) is also found among
Kuy peoples (5%). According to Changmai et al. (2022), these haplogroup frequencies originate from South Asians, who left a cultural and genetic legacy in Southeast Asia since the first millennium CE.
West Asia R1a1a has been found in various forms, in most parts of
Western Asia, in widely varying concentrations, from almost no presence in areas such as
Jordan, to much higher levels in parts of
Kuwait and
Iran. The Shimar (
Shammar) Bedouin tribe in
Kuwait show the highest frequency in the Middle East at 43%. , noted that in the western part of the country, Iranians show low R1a1a levels, while males of eastern parts of Iran carried up to 35% R1a1a. found R1a1a in approximately 20% of Iranian males from the cities of
Tehran and
Isfahan. in a study of
Iran, noted much higher frequencies in the south than the north. A newer study has found 20.3% R-M17* among
Kurdish samples which were taken in the
Kurdistan Province in western Iran, 19% among
Azerbaijanis in
West Azerbaijan, 9.7% among Mazandaranis in North Iran in the province of
Mazandaran, 9.4% among Gilaks in province of
Gilan, 12.8% among Persian and 17.6% among Zoroastrians in
Yazd, 18.2% among Persians in
Isfahan, 20.3% among Persians in
Khorasan, 16.7% Afro-Iranians, 18.4%
Qeshmi "Gheshmi", 21.4% among Persian Bandari people in
Hormozgan and 25% among the
Baloch people in
Sistan and Baluchestan Province. found haplogroup R1a in 9.68% (18/186) of a set of samples from Iran, though with a large variance ranging from 0% (0/18) in a sample of Iranians from
Tehran to 25% (5/20) in a sample of Iranians from Khorasan and 27% (3/11) in a sample of Iranians of unknown provenance. All Iranian R1a individuals carried the M198 and M17 mutations except one individual in a sample of Iranians from Gilan (
n=27), who was reported to belong to R1a-SRY1532.2(xM198, M17). found R1a1-SRY10831.2 in 20.8% (16/77) of a sample of
Persians collected in the provinces of
Khorasan and
Kerman in eastern Iran, but they did not find any member of this haplogroup in a sample of 25
Kurds collected in the province of
Kermanshah in western Iran. Further to the north of these Western Asian regions on the other hand, R1a1a levels start to increase in the
Caucasus, once again in an uneven way. Several populations studied have shown no sign of R1a1a, while highest levels so far discovered in the region appears to belong to speakers of the
Karachay-Balkar language among whom about one quarter of men tested so far are in haplogroup R1a1a. ==Historic naming of R1a==