search this blog

Sunday, April 17, 2016

Estimating Basal Eurasian ancestry?


Basal Eurasians (BE) are a hypothetical ghost population that apparently split from other Eurasians no later than 45,000 years ago. If they actually existed, they had a significant impact on the ancestry of early Neolithic farmers, and thus all present-day West Eurasians.

Testing ancestry proportions from ghost populations isn't easy. However, Haak et al. 2015 made use of an f4 equation that seemingly gave an accurate estimate of BE admixture in LBK farmer Stuttgart: f4(Stuttgart,Loschbour;Onge,MA1)/f4(Mbuti,MA1;Onge,Loschbour) = 44%. The other LBK farmers scored an average of 40% BE, which also made sense.

Unfortunately, this equation doesn't appear to work too well for Caucasus Hunter-Gatherers (CHG) Kotias and Satsurblia. They both score around 25% BE, which, as far as I can see, seems way too low. Perhaps using MA1 in the equation is messing things up because CHG harbor significant MA1-related ancestry?

I tinkered around with Haak's equation and came up with this: f4(X,Iberia_Mesolithic;Dai,Karelia_HG)/f4(Mbuti,Karelia_HG;Dai,Iberia_Mesolithic). The results look solid, at least in relative terms (see image below). But is the equation actually valid?

My main worry is using both Iberia Mesolithic and Karelia HG. They share a lot of drift, much more than Loschbour and MA1. Also, even though both Dai and Onge belong to the so called Eastern non-African (ENA) clade, they're quite distinct, with Dai a lot less basal in the context of ENA diversity. Any thoughts? Suggestions?


Update 04/18/2016: Interestingly, my f4 equation essentially fails for most post-Neolithic Europeans, particularly those with relatively high ratios of Karelia HG-related ancestry. For instance, Yamnaya Kalmykia scores just 2.9% BE, which can't be right. Yamnaya Samara shows -2.2%, which is obviously wrong.

But I tried several combinations of reference samples and found that by replacing Karelia HG with Hungary HG and Dai with Ust-Ishim I was able to obtain coherent results for a wider range of groups, including Yamnaya.


To be honest, I still don't know what the hell I'm testing here exactly. The results appear to reflect the existence of two components within West Eurasia; one representing ancient hunter-gatherers from Europe and probably surrounding areas of the Near East, and another closely related to present-day Near Eastern populations. The latter might well be a signal of the so called Basal Eurasians, or perhaps a number of as yet unsampled meta populations from the ancient Near East?

35 comments:

Chad said...

I agree that there are a lot of questions here. Look at Ust_Ishim, according to these three ways of doing things. I've found that having a stronger drift with ENA, in pop A, can really inflate BE according to this. I'm not sure what we're really finding here, or if we can say there is one simple Basal Eurasian. Another thing about these below, is that West and East Eurasians share significant drift after Ust_Ishim. All but Kostenki.

Ust_Ishim Iberia_HG Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_HG 1.248710 0.063351 19.711

Ust_Ishim Loschbour Onge MA1 : Mbuti MA1 Onge Loschbour 1.171282 0.128513 9.114

Ust_Ishim Loschbour Onge Kostenki : Mbuti Kostenki Onge Loschbour 0.877311 0.077901 11.262

Chad said...

I'm not sure if anyone with any ENA can have reliable results. It makes this "BE" signal go up. Put an East Asian in there, and you see them get 196% Basal Eurasian. West Asians having recent ENA will get inflated results too.

Davidski said...

Certainly, the equation I posted won't work well, or none at all really, for any group with significant East Asian, South Asian and/or Sub-Saharan ancestry.

It only seems to be viable for basically fully West Eurasian farmer/hunter mixture populations.

Alberto said...

It looks difficult to find f4 ratios that would work well for all the populations, though those figures are probably not far off.

I do wonder if under the assumption of the estimates of BE for EEFs (which should be the most reliable), someone could calculate the theoretical coordinates for a ghost Basal Eurasian, for example from the latest PCA 9 (or any other of the datasheets). Maybe Matt knows a possible way to do that using PAST 3 , and while it would still be a very theoretical approach, it could be interesting for calculating BE admixture in "difficult" populations (with ENA, SSA,...) and making some interesting models.

Ryukendo K said...
This comment has been removed by the author.
Krefter said...

Basal Eurasian may have never existed. The argument used by Laz 2013 that it did exist was that Loschbour and MA1 have the same relationship to East Asians, despite being so separated in space and time. However, the same could be said about CHG and EEF.

So, which is it? East Asian admixture in Kostinki/MA1/WHG/EHG or Basal Eurasian in CHG/EEF? Is there any way to know which one? Basal Eurasian makes the most sense, but I'm hesitant to say it's that simple.

Alberto said...

I made a ghost Basal Eurasian just based on Stuttgart being 56% Loschbour and 44% Basal_Eurasian (for the latest PCA data with 9 dimensions posted by Davidski).

It's just for a quick test to see how this could work. With Stuttgart it works as expected:

Stuttgart:LBK380
"Loschbour:Loschbour" 56
"Basal_Eurasian:Ghost1" 44
"Bichon:Bichon" 0
"Hungary_HG:I1507" 0
"Iberia_Mesolithic:I0585" 0
"Karelia_HG:I0061" 0
"MA1:MA1" 0
"Dai:HGDP01308" 0
"Yoruba:HGDP00920" 0
Distance: 0

With others it's harder to say how good it might be:

Anatolia_Neolithic:I0707
"Loschbour:Loschbour" 51.35
"Basal_Eurasian:Ghost1" 48.1
"Karelia_HG:I0061" 0.55
"Bichon:Bichon" 0
"Hungary_HG:I1507" 0
"Iberia_Mesolithic:I0585" 0
"Kostenki14_UP:Kostenki14" 0
"MA1:MA1" 0
"Dai:HGDP01308" 0
"Paniya:PNYD1" 0
"Yoruba:HGDP00920" 0
distance=0.009987

Druze:HGDP00559
"Basal_Eurasian:Ghost1" 42.3
"Iberia_Mesolithic:I0585" 31.55
"MA1:MA1" 23.2
"Yoruba:HGDP00920" 2.2
"Dai:HGDP01308" 0.75
"Bichon:Bichon" 0
"Loschbour:Loschbour" 0
"Hungary_HG:I1507" 0
"Karelia_HG:I0061" 0
distance=0.036596

Kotias is complicated:

Kotias:KK1
"MA1:MA1" 52.55
"Basal_Eurasian:Ghost1" 30.6
"Iberia_Mesolithic:I0585" 15.45
"Yoruba:HGDP00920" 1.4
"Bichon:Bichon" 0
"Loschbour:Loschbour" 0
"Hungary_HG:I1507" 0
"Karelia_HG:I0061" 0
"Dai:HGDP01308" 0
distance=0.101346

Adding Kostenki14:

Kotias:KK1
"MA1:MA1" 38.55
"Kostenki14_UP:Kostenki14" 28.05
"Basal_Eurasian:Ghost1" 25.75
"Hungary_HG:I1507" 7.65
"Bichon:Bichon" 0
"Loschbour:Loschbour" 0
"Iberia_Mesolithic:I0585" 0
"Karelia_HG:I0061" 0
"Dai:HGDP01308" 0
"Yoruba:HGDP00920" 0
distance=0.098612

Adding Paniya:

Kotias:KK1
"Karelia_HG:I0061" 34.45
"Basal_Eurasian:Ghost1" 30.2
"Paniya:PNYD1" 25.7
"Hungary_HG:I1507" 8.9
"Yoruba:HGDP00920" 0.75
"Bichon:Bichon" 0
"Loschbour:Loschbour" 0
"Iberia_Mesolithic:I0585" 0
"Kostenki14_UP:Kostenki14" 0
"MA1:MA1" 0
"Dai:HGDP01308" 0
distance=0.084395

Not sure what to think. Maybe with a ghost based on more samples from HGs and EEFs it could be interesting.

Here is the data for this ghost:

Basal_Eurasian:Ghost1,0.01995,-0.3607192,0.046804818,0.004713,0.03119,-0.08396363,0.1048888,-0.02823418,0.34029281

Chad said...

Ryu,

I'm not sure about that formula, or what it's capturing.

Either way, it's going negative.

result: Stuttgart Onge MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.032068 0.014474 -2.216
result: Stuttgart Onge Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.197678 0.014160 -13.960
result: Anatolia_Neolithic Onge MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.037357 0.010465 -3.570
result: Anatolia_Neolithic Onge Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.173997 0.010316 -16.867
result: Kotias Onge MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.055225 0.013913 -3.969
result: Kotias Onge Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.104456 0.013704 -7.622
result: Stuttgart Ust_Ishim MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.061164 0.017688 -3.458
result: Stuttgart Ust_Ishim Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.208354 0.018985 -10.975
result: Anatolia_Neolithic Ust_Ishim MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.065635 0.014418 -4.552
result: Anatolia_Neolithic Ust_Ishim Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.185258 0.015859 -11.682
result: Kotias Ust_Ishim MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.088006 0.017845 -4.932
result: Kotias Ust_Ishim Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.117491 0.017352 -6.771
result: Stuttgart Ust_Ishim Karelia_HG Mbuti : Chimp Ust_Ishim Karelia_HG Mbuti -0.179750 0.018753 -9.585
result: Anatolia_Neolithic Ust_Ishim Karelia_HG Mbuti : Chimp Ust_Ishim Karelia_HG Mbuti -0.173216 0.015347 -11.286
result: Kotias Ust_Ishim Karelia_HG Mbuti : Chimp Ust_Ishim Karelia_HG Mbuti -0.172484 0.018718 -9.215

Ryukendo K said...
This comment has been removed by the author.
Chad said...

The results are still the same. Go to Haak et al. (2015). There is a section on the f4-ratio for Basal Eurasian. See if you can come up with something else.

Ryukendo K said...
This comment has been removed by the author.
Chad said...

The results are still infeasible. You have to have the West Eurasian and East Eurasian pops flipped. I've run all those. Kostenki seems to be okay, from between Ust_Ishim and West Eurasians. I'm not sure about using EHG, due to reasons you know from our talks. It would only complicate things. Another thing I've noticed from these formulas is that MA1 comes out about 40% BE, which is odd, considering he is basically equidistant from Ust_Ishim as WHG. We may need a whole new formula.

I still don't think we can completely throw out ENA into WHG, less into EEF, with some SSA, and toss out BE. Kostenki is equally distant from Ust_Ishim as WHG and ENA, yet further from ENA.

It could be that the Near East and farmers have some Kostenki-like ancestry, plus minor SSA.

Davidski said...

Interestingly, the equation I posted totally fails for Yamnaya. It must be because of too much shared drift with Karelia HG.

Yamnaya_Kalmykia Iberia_Mesolithic Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_Mesolithic 0.029006 0.052593 0.552

Yamnaya_Samara Iberia_Mesolithic Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_Mesolithic -0.022191 0.050312 -0.441

Chad said...

Kostenki gives them 42%, Karelia 29%, and Kotias 49%, but MA1 40%. BE may be an illusion. No matter the sample, someone won't make sense. I can't come up with an alternative yet.

Davidski said...

I'm just looking for a test that can estimate Basal Eurasian proportions in ancient farmers, ancient Near Easterners and also modern Near Easterners who don't have too much Sub-Saharan and South/East Asian ancestry.

All of these groups have very similar ancestry, and I think that's why my equation works. Although it'd be nice to find an alternative to Karelia HG just in case potential shared ancestry between CHG and Karelia HG is underestimating the Basal score in the CHGs.

I wouldn't worry about anyone else. There's just so much a single f4 test can analyze. It can't cover all the bases, especially for more complicated mixtures.

Kostenki14 works OK in place of Karelia HG, but the estimates look a little low across the board IMO.

Chad said...

Here is a few ways to look at it. This is supposed to be capturing ancestry not descended from Ust_Ishim, however, it seems this and the inverse are actually only capturing what is descended from Ust_Ishim, rather than deeper. Although, we see other HG and MN get lower amounts, which seems to imply that Ust_Ishim is BE, but not the same as what farmers have.

result: Ust_Ishim Loschbour Onge MA1 : Mbuti MA1 Onge Loschbour 1.171282 0.128513 9.114
result: Ust_Ishim Loschbour Onge Kostenki : Mbuti Kostenki Onge Loschbour 0.877311 0.077901 11.262
result: Ust_Ishim Iberia_HG Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_HG 1.248710 0.063351 19.711
result: Ust_Ishim Onge Loschbour Han : Mbuti Han Loschbour Onge 1.101166 0.106606 10.329

Seinundzeit said...

Very interesting questions, and the output seems very clean/solid, assuming that "Basal Eurasian" is real.

Although, I'm not too sure about this anymore, considering that K14 doesn't have any ancestry basal to Ust-Ishim (I recall a set of stats which demonstrated this), yet is much more distant from ENA compared to ANE/EHG/WHG. Like Chad stated, just having substantial ENA ancestry in West Eurasian hunter gatherers, with much less in ENF/CHG, and perhaps some African ancestry in ENF/CHG, probably makes more sense considering the data we have post-Laziridis et al.

I guess we'll just have to wait and see how things play out with more aDNA.

Anyway, Alberto's experiment seems to work great with EEF and SHG:

LBK_EN
55.70% Loschbour
43.95% Basal_Eurasian:Ghost1
0.25% Karelia_HG
0.10% Yoruba

Starcevo_EN
53.65% Loschbour
46.35% Basal_Eurasian:Ghost1

Motala_HG
55.8% Loschbour
31.3% Samara_HG
11.9% Karelia_HG
1.0% Basal_Eurasian:Ghost1

This is pretty cool. If we assume the Basal Eurasian/Crown Eurasian scenario, ENF/CHG are the mixed ones, so it's great to see how they stack up with unadmixed populations (again, unadmixed as per the Basal Eurasian/Crown Eurasian idea). These were done with the same reference populations as the EEF/SHG ones:

Lithuanian
54.3% Loschbour
30.4% Srubnaya_outlier:I0354
15.3% Basal_Eurasian:Ghost1

Albanian
43.50% Loschbour
31.20% Basal_Eurasian:Ghost1
20.95% Srubnaya_outlier:I0354
4.35% Paniya

Lebanese_Christian
43.35% Basal_Eurasian:Ghost1
23.00% Loschbour
21.90% Srubnaya_outlier:I0354
9.30% Paniya
2.45% Yoruba

Georgian
35.35% Basal_Eurasian:Ghost1
32.25% Srubnaya_outlier:I0354
20.25% Loschbour
12.15% Paniya

Kurdish
34.15 Basal_Eurasian:Ghost1
33.60% Srubnaya_outlier:I0354
16.20% Loschbour
15.45% Paniya
0.60% Yoruba

Kalash
50.80% Srubnaya_outlier:I0354
33.20% Paniya
14.50% Basal_Eurasian:Ghost1
1.35% Loschbour
0.15% Yoruba

Kotias
47.60% Srubnaya_outlier:I0354
26.30% Basal_Eurasian:Ghost1
18.10% Paniya
6.45% Loschbour
1.55% Yoruba

I don't think we really know what the Paniya percentages represent in the Balkans, West Asia, the Caucasus, South Central Asia, and in CHG. Also, Srubnaya_outlier is basically an EHG/ANE sample, which is why I used it.

A side note, but this estimate of BEA for Kotias matches what David found using the Haak et al. equation.

Chad said...

result: Iberia_HG Ju_hoan_North Stuttgart Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.664114 0.010259 64.733
result: Iberia_HG Ju_hoan_North Iberia_EN Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.691873 0.009134 75.745
result: Iberia_HG Ju_hoan_North Anatolia_Neolithic Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.662611 0.007468 88.724
result: Iberia_HG Ju_hoan_North Kotias Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.613168 0.010476 58.531

Chad said...

Paniyas and Srubnaya have BE, if it's real. So it really clouds the picture.

Davidski said...

Really not sure if it's possible to test BE with PCA/nMonte.

Anyway, I updated the post with new results.

FrankN said...

I always had problems with the Basal/Crown Eurasian concept. I feel it is (a) oversimplifying things and (b) possibly labelling groups the wrong way, i.e. "Crown" being the more ancient and widespread, and "basal" the later and more geographically restricted group.

Let's run through our sketchy knowledge of paleolithic population movements and genetic admix:

1. Hominids: Neandertal as "basal" (all Eurasian), Denisova as a "crown", mostly SEA population.
Quite early, ca. 100 kya, Neandertals have differentiated into a West and a East Eurasian line. The latter, represented by the Altai Neandertal, received early AMH genetic inflow, which European Neandertal samples are lacking. So far, we lack comprehensive information to which extent these two Neandertal clades may have differently acted on West/ East Eurasian populations, but I am confident that S. Pääbo's team at Leipzig is working on it.

2. Neandertal admixture shows regionally differentiated patterns, see
http://www.sciencedirect.com/science/article/pii/S0002929715004863
"Of the three putatively introgressed core haplotypes, III and IV are most similar to the Altai Neandertal genome (..). Core haplotype III is present in all non-African populations. (..) Core haplotype IV is restricted to specific Asian groups."
So, on top of Denisova, we have another "crown" East Eurasian component reflected by specific Altai Neandertal gene flow.

3. yDNA: yDNA K, or K2, estimated to have split off around 45 kya, appears to well represent, in Dave's words, "a hypothetical ghost population that apparently split from other Eurasians no later than 45,000 years ago" . In fact, most West Eurasian yDNA hgs (G/H/I/J) come from upstream that split, East Eurasian ones from downstream. The exception, of course, is yDNA R.
Irritating here is the tree phylogeny, since Australian K-M256 and Papuan M and S are found downstream of K, yet should in principle represent more ancient, "basal" populations than the West Eurasian G/H/I/J. To put it differently: Typical West Eurasian yDNA is indeed "basal" to the yDNA tree, East Eurasian and Ocean yDNA more downstream ("crown"), but we have little indication that those "basal Eurasians" in pre-historic times ever settled more than West Eurasia. Apparently, there is still much we have to learn about how the second "out of africa" wave some 45-50 kya ago worked. In any case, I suppose that "basal" vs. "crown" Eurasian is closely linked to yDNA pre- vs. post K(2).

4. mtDNA: Here, the case is less clear-cut. The M-N split appears to predate the emergence of "Basal Eurasian" at around 45kya. More fitting is mtDNA U, which dominates West Eurasian UP mtDNA. Intriguing here is the case of U2, which is predominantly South Asian, but includes West Eurasian U2e. South Asian U2 thus may represent an Upper Paleolithic West Eurasian migration into South Asia; note in this context also Kostenki's mtDNA U2.
Possibly, U2 is related to the "Paniya" component showing up in MA1 and EHG.

In summary, we still have a lot to learn about Paleolithic population movement. If a sizeable "common Eurasian" layer has survived at all seeams doubtful. Rather, we may be dealing with a quite early east-west split. The eastern side would be represented by Denisova, Altai Neandertal, yDNA K(2), mtDNA M, and Ust-Ishim; the western side by European Neandertals, yDNA GHIJ, mtDNA U (plus possibly others, e.g. HV), and Kostenki. Already during the UP, there should have been several cross-overs.

Matt said...

Hi All,

Can anyone explain to me why the ratio is

f4(X,Iberia_Mesolithic;Dai,Karelia_HG)/f4(Mbuti,Karelia_HG;Dai,Iberia_Mesolithic)

rather than

f4(X,Iberia_Mesolithic;Dai,Karelia_HG)/f4(Mbuti,Iberia_Mesolithic;Dai,Karelia_HG)?

(modeled on the same in Haak with Loschbour rather than Iberia and MA-1 rather than Karelia).

Also wouldn't

f4(X,Iberia_Mesolithic;Ust_Ishim,Hungary_HG)/f4(Mbuti,Hungary_HG;Ust_Ishim,Iberia_Mesolithic)

give more basal results for Yamnaya simply because

f4(X,Iberia_Mesolithic;Ust_Ishim,Hungary_HG) is going to be more similar to f4(Mbuti,Hungary_HG;Ust_Ishim,Iberia_Mesolithic) for them*, because they don't have much / any WHG admixture?

(the opposite problem seems to have been attacking the ratio using Karelia). Doesn't seem like it can be valid for EHG+Basal Mixtures. Even for Anatolian farmers, they are in theory WHG+UHG+Basal.

* As f4(X,Iberia_Mesolithic;Ust_Ishim,Hungary_HG) approaches equal to f4(Mbuti,Hungary_HG;Ust_Ishim,Iberia_Mesolithic) the ratio of the two approaches 1.

@ Alberto: Yes, like Ryu is mentioning, if you have a set of stable BE estimates for populations that you know to be simple two way mixes of BE and WHG (at least more than 3 populations, including WHG with 0), then you can do a regression equation in PAST to predict a 100% BE zombie in the PCA space. Then use it the zombie data through nMonte for more complex populations. I think you've done something like that?

One thing I would say about whatever Basal Eurasian construct is produced by this method is, the definition of Basal Eurasian is a population that is equally related to WHG, EHG and ENA. So it should meet this definition, of being equally distant, or is not really Basal Eurasian. However I'm not sure how you would test this, as greater and lesser drift in WHG and ENA branches would interfere with measuring this.

@ Ryu, apologies for not replying to your post in the other thread. I'm not on Anthrogenica, unfortunately. Did you and Alberto resolve what it was you were looking for?

@ Krefter, well observed and stated re: CHG and EEF. Also note Kostenki and EEF and CHG also have the same relationship to East Asians (but not to Ust Ishim or Oase1!).

Nathan Paul said...

FrankN.

Good summary. Lot of times I think its all political or ethnic fervor. otherwise how can yDNA GH becomes west?

Davidski said...

Matt,

As you point out, the ratio in the first test above is...

Stuttgart Iberia_Mesolithic Dai Karelia_HG Mbuti Karelia_HG Dai Iberia_Mesolithic

That's because, unless I'm losing my grip on reality, the ratio in the Haak supp info on page 79 is...

Stuttgart Loschbour Onge MA1 Mbuti MA1 Onge Loschbour

But the ratio you proposed, which puts Karelia_HG in a different place in the second part of the equation, does seem to work for groups that don't have significant Karelia HG admix.

Iranian_Jew Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.516748 0.028777 17.957
Druze Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.502651 0.026115 19.248
Armenian Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.501704 0.028977 17.314
Georgian_Jew Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.478387 0.029196 16.385
Cypriot Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.454031 0.029802 15.235
Georgian Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.453603 0.030366 14.938
Satsurblia Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.442465 0.048693 9.087
Kotias Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.438867 0.044729 9.812
Stuttgart Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.434508 0.04474 9.712
Anatolia_Neolithic Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.405771 0.031206 13.003
Sardinian Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.385938 0.029733 12.98
LBK_EN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.382318 0.032203 11.872
Iberia_EN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.375653 0.034828 10.786
Iceman_MN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.354782 0.045951 7.721
Remedello_BA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.344965 0.04524 7.625
Hungary_EN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.340293 0.033698 10.098
Esperstedt_MN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.339333 0.050863 6.672
Hungary_CA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.313537 0.049338 6.355
Iberia_Chalcolithic Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.294134 0.035349 8.321
Iberia_MN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.282007 0.037217 7.577
Hungary_BA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.174216 0.04601 3.786
Unetice_EBA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.130044 0.039846 3.264
Bell_Beaker_Germany Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.125737 0.039357 3.195
Andronovo Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.08975 0.043884 2.045
Corded_Ware_Germany Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.041574 0.043202 0.962
Afanasievo Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.03894 0.049053 0.794
Srubnaya Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.038541 0.042581 0.905
Yamnaya_Kalmykia Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.026593 0.047872 0.555
Poltavka Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.007984 0.047997 0.166
Yamnaya_Samara Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG -0.020148 0.045976 -0.438
Samara_Eneolithic Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG -0.363544 0.070432 -5.162

Onur Dincer said...

@Frank

2. Neandertal admixture shows regionally differentiated patterns, see
http://www.sciencedirect.com/science/article/pii/S0002929715004863
"Of the three putatively introgressed core haplotypes, III and IV are most similar to the Altai Neandertal genome (..). Core haplotype III is present in all non-African populations. (..) Core haplotype IV is restricted to specific Asian groups."
So, on top of Denisova, we have another "crown" East Eurasian component reflected by specific Altai Neandertal gene flow.

...

In summary, we still have a lot to learn about Paleolithic population movement. If a sizeable "common Eurasian" layer has survived at all seeams doubtful. Rather, we may be dealing with a quite early east-west split. The eastern side would be represented by Denisova, Altai Neandertal, yDNA K(2), mtDNA M, and Ust-Ishim; the western side by European Neandertals, yDNA GHIJ, mtDNA U (plus possibly others, e.g. HV), and Kostenki. Already during the UP, there should have been several cross-overs.


The Dannemann et al. 2015 paper you linked to only uses the Altai Neanderthal genome as a Neanderthal genome, so you cannot make conclusions specific to the Altai Neanderthal based on that paper. Indeed, the Kuhlwilm et al. 2016 paper (the very paper that found the 100,000 yo modern human introgression into the Altai Neanderthal) also found that the Neanderthal introgression into all non-African modern humans (and not just the West Eurasian ones) was from western Neanderthals, and not from eastern Neanderthals such as the Altai Neanderthal:

"When we refine our estimates of gene flow by adding the chromosome 21 sequences of the European Neanderthals to our genome-wide data, G-PhoCS infers significant rates of gene flow from Neanderthals into modern humans outside Africa only for El Sidrón and Vindija Neanderthals (0.3–2.6%) (Fig. 3a), suggesting that Neanderthals from Europe are more closely related than the Altai Neanderthal to the population that interbred with modern humans outside Africa 47,000–65,000 years ago."

Seinundzeit said...

Chad,

Good points, although this Srubnaya_outlier is actually an EHG/ANE sample, quite distinct from other Srubnaya samples. For example:

(Chimp, MA1, Srubnaya, Srubnaya_Outlier) d=0.0498
(Chimp, MA1, Yamnaya_Samara, Srubnaya_Outlier) d=0.0321
(Chimp, MA1, Samara_Eneolithic, Srubnaya_Outlier) d=0.0011
(Chimp, MA1, Karelia_HG, Srubnaya_Outlier) d= -0.0133

Or one can examine the PCA David posted, in which the Srubnaya_Outlier clusters near EHG, at quite a distance from other Srubnaya samples.

And when it comes to the Paniya, assuming Basal Eurasian is real (which is a very complex/interesting discussion in itself), they are probably (at most) around 5% BEA. So the Paniya percentages (from the Balkans to western Pakistan) probably reflect (at least in part) some sort of Crown Eurasian ancestry (for which we don't have any ancient samples). But who really knows. As always, we need aDNA.

Odd detail, despite the presence of MA1, Samara hunter, Karelia hunter, and Afontova Gora (in the PCA nMonte fits I tried), modern populations from Lithunia to India prefer Srubnaya_outlier, while ancient samples do often tend towards the Karelia/Samara hunters. Not sure what that could mean, if anything.

But as David noted, perhaps the PCA-based approach is wholly inadequate in this case, although it does work great with EEF and SHG.

Alberto said...

@Matt

Yes, I basically did that to create the Basal Eurasian ghost, but manually and with only one sample. It seems to work ok for Europe and the Levant (Basal Eurasian peaking at around 52% in BedouinB), but it's more complicated when it comes to CHG-rich populations (Kotias itself being some 30% BE, Satsurblia a bit less). Maybe moving the position of this ghost to the "east" a bit would work better.

Re: distances of BE to WHG, EHG and East Asian, the theory is that they should be equidistant. But I'm not sure how feasible is that on a PCA. Using BE as the only source population in nMonte (as you did before with 4mix) we can calculate the total distance for any given target population (using the Ncycles=1 parameter saves a lot of time):

Loschbour <-> Basal_Eurasian:Ghost1 = 0.620881
Karelia_HG <-> Basal_Eurasian:Ghost1 = 0.626815
Dai <-> Basal_Eurasian:Ghost1 = 0.753895
MA1 <-> Basal_Eurasian:Ghost1 = 0.605049

So clearly Dai look quite a bit further away from BE than the HGs are. Checking with Ust-Ishim to see if it's equidistant from East and West Eurasian:

Loschbour <-> Ust_Ishim = 0.400614
Dai <-> Ust_Ishim = 0.355974

Also not exactly, though I think D-stats do show that Ust-Ishim is slightly closer to East Asian (IIRC). Interestingly, checking the distance for Dai to Western populations:

Loschbour <-> Dai = 0.592068
MA1 <-> Dai = 0.482459
Anatolia_Neolithic <-> Dai = 0.594729
Kotias <-> Dai = 0.526993

So there is something strange there in Loschbour being too far from Dai relatively to the others. I checked with the D-stats based datasheet to see if the distances there were more correct (I used the last one provided by Davidski with Scythian as a row):

Western_HG <-> Dai = 0.350144
MA1 <-> Dai = 0.244707
Anatolia_Neolithic <-> Dai = 0.335807
Caucasus_HG <-> Dai = 0.308551

A similar pattern, though even a bit more strange to see Loschbour being even further away from Dai than Anatolia_Neolithic is.

Grey said...

pure guess but just in case it nudges anything

1) Personally I've always thought the idea of the Neanderthals being wiped out in one big sweep was unlikely. If they were particularly cold adapted then at the very least some in the high altitudes of high latitudes should have survived longer imo and in the most extreme case (north Himalayas?) potentially a lot longer. In which case the time since last 50/50 mixture may have been substantially different between the mid-east and northern mountain regions like the Himalayas.

2) If highly divergent DNA like Neanderthal is generally selected against for some reason (except the particularly good bits) from the time of mixture and at the same rate then if (1) is true then the population that had Neanderthal mixture latest would still have more over time than the earliest (even if declining all the time) until a minimum percentage was reached (which may not have happened yet).

3) I'm not a mathematician (probably obviously) but I've messed around with various equations as part of modding AI for games so I know equations can throw up odd effects with unusual situations you'd hadn't thought of.

so

4) how do the various software packages deal with small quantities of highly divergent DNA?

I'm wondering if BE is maybe an artifact revolving around how the software deals with populations whose Neanderthal ancestry was least recent relative to those in whom it's more recent.

Gioiello said...

"To be honest, I still don't know what the hell I'm testing here exactly. The results appear to reflect the existence of two components within West Eurasia; one representing ancient hunter-gatherers from Europe and probably surrounding areas of the Near East, and another closely related to present-day Near Eastern populations. The latter might well be a signal of the so called Basal Eurasians, or perhaps a number of as yet unsampled meta populations from the ancient Near East?" (Davidski)
"Another thing about these below, is that West and East Eurasians share significant drift after Ust_Ishim" (Chad Rohlfsen).
"I'm wondering if BE is maybe an artifact revolving around how the software deals with populations whose Neanderthal ancestry was least recent relative to those in whom it's more recent" (Grey).
...........................................
After Copernicus arrived.

Anonymous said...

It's impossible to find a commonality between groups in terms of things that are defined as the differences between groups.

Twilight said...

It appears that the Antillean Culture came to an end due to an abrupt change in stone technology; courtesy of the succeeding Kebaran Culture. Courtesy of Eupedia's hypothesis; N1,N2,X & W=Basal Eurasian and the fact that those mtdna haplogroups were dominate in Mesolithic farmers, how likely could it be that the Basal Eurasian gene came from the Antillian Culture?

The fact that basal Eurasian is highest in Neolithic Middleeasterners could be a sign that Basal Eurasians were assimilated there.

Twilight said...

Hey Davidski, just thought of something. I was looking at your Basal Eurasian K7 D-Stats and saw that La-Branda hasn't been compared yet to Basal Eurasians. La-Branda in Gedmatch=F999915. It's just a wild guess but I'm wondering if Mesolithic Europeans/Gravettian Culture somehow absorbed some Basal Eurasian before heading to Europe. :)

Davidski said...

La_Brana1 has been thoroughly tested for Basal Eurasian and it doesn't have any. But Villabruna might have a little bit.

Twilight said...

Wish you the best of luck :).
Ultimately it's gonna come to the fact that as far as prehistoric archeology is concerned, you are way ahead of your time. Burials of Modern Homo Sapiens from Upper Paleolithic; or Paleolithic for that matter are simply hard to come by. Therefore you simply do not have enough Paleolithic archeological remains to DNA test for the soul purpose of finding Mr and Miss Basal. But despite our lack of Paleolithic skeletons, we do have our very own Ydna/Mtdna; these two DNA clusters can trace our direct Paternal and direct Maternal ancestors to Paleolithic times.

https://en.m.wikipedia.org/wiki/List_of_human_evolution_fossils

Is there any way to Download Raw Mtdna/Ydna and test them on the Basal-Eurasian K7? If none of the Ydna;A-T or Mtdna;A-Z have any sign of Basal Eurasian Influence then it might be safe to say that Basal-Eurasian is fluke-dna.

Mark Moore (Moderator) said...

I thought that Lazardias et al disputed the idea that Kostenki 14 had Basal Eurasian. At least the same BE that the EEF had. It does not make sense that Kostenki 14 was heavy on Neanderthal genes and yet one of the hallmarks of BE is that it is inversely correlated to such admixture.

Do the tests make more sense if you just use the more recent samples (<15K)? If the tests using only the more recent samples makes sense but the tests with the older ones don't maybe its because the younger and older are not both the same BE.