- Research article
- Open Access
The genetic prehistory of domesticated cattle from their origin to the spread across Europe
BMC Geneticsvolume 16, Article number: 54 (2015)
Cattle domestication started in the 9th millennium BC in Southwest Asia. Domesticated cattle were then introduced into Europe during the Neolithic transition. However, the scarcity of palaeogenetic data from the first European domesticated cattle still inhibits the accurate reconstruction of their early demography. In this study, mitochondrial DNA from 193 ancient and 597 modern domesticated cattle (Bos taurus) from sites across Europe, Western Anatolia and Iran were analysed to provide insight into the Neolithic dispersal process and the role of the local European aurochs population during cattle domestication.
Using descriptive summary statistics and serial coalescent simulations paired with approximate Bayesian computation we find: (i) decreasing genetic diversity in a southeast to northwest direction, (ii) strong correlation of genetic and geographical distances, iii) an estimated effective size of the Near Eastern female founder population of 81, iv) that the expansion of cattle from the Near East and Anatolia into Europe does not appear to constitute a significant bottleneck, and that v) there is evidence for gene-flow between the Near Eastern/Anatolian and European cattle populations in the early phases of the European Neolithic, but that it is restricted after 5,000 BCE.
The most plausible scenario to explain these results is a single and regionally restricted domestication process of cattle in the Near East with subsequent migration into Europe during the Neolithic transition without significant maternal interbreeding with the endogenous wild stock. Evidence for gene-flow between cattle populations from Southwestern Asia and Europe during the earlier phases of the European Neolithic points towards intercontinental trade connections between Neolithic farmers.
The transition from foraging to producing economies, also called Neolithisation, was a major turning-point in human prehistory. The process of Neolithisation started in a region spanning from the Zagros Mountains to Central Anatolia and from Palestine to the plains beyond the East Taurus Mountains [1,2]. It was characterized by the successive appearance of sedentism (12th-10th millennia BCE), plant cultivation (mid-10th millennium), animal husbandry (mid-9th millennium) and pottery (early 7th millennium) [3,4]. Elements of the Neolithic lifestyle expanded into Western Anatolia in the early 7th millennium [5-7], while the earliest signs for Neolithic settlements on the European continent are found in present-day Greece around 6,400 BCE . The subsequent Neolithic spread across the rest of Europe followed at least two main routes: One leading across Southeastern Europe, and the second via the Western Mediterranean [9-11]. The extent to which this expansion of a new culture and economy was driven by the migration of people has been debated for decades [12-17]. An early study on human ancient DNA emphasized the role of inward migration at the beginning of the Neolithic period in Central Europe , a view that is supported by more recent palaeogenomic studies [19,20].
As animal husbandry was an important part of the foundation of the new agricultural lifestyle, remains of domesticated animals can serve as a good proxy for the Neolithic spatial expansion and the presence and activity of farmers in newly populated areas . In recent years, genetic and palaeogenetic studies have increasingly converged on a Southwest Asian origin for the four Neolithic domesticated animals: cattle, sheep, goats, and pigs . For Near Eastern taurine cattle (Bos taurus), a recent coalescent-based analysis using ancient Iranian samples suggested a severe Near Eastern domestication bottleneck, with an estimated effective size of just 80 female founders . However, comprehensive data sets of ancient cattle DNA from other areas are so far restricted to Central and Western Europe, for example from Bollongino et al. . Thus, detailed and continent-wide evaluation of the early spatiotemporal demography of Bos taurus has so far been hindered by the lack of data from the key bridging areas of the Neolithic, namely Anatolia, the Balkans, and the Western Mediterranean.
In this study we greatly extend a previous coalescent-based demographic model, based on 15 ancient Iranian and 27 modern Near Eastern and Anatolian cattle mitochondrial DNA (mtDNA) sequences, in terms of sample size, and geographic and temporal range . To be able to investigate the early population history and migratory patterns of taurine cattle in detail, the present model is now conditioned on a larger ancient (n = 193, including the Iranian samples) and modern (n = 597) mtDNA dataset that widely covers the area of the Neolithic westwards expansion from the 7th millennium BCE onwards. The focus of the study is on the time period when cattle were first introduced to Europe, thereby allowing us to address the following questions: i) Is the scenario of a single and severe domestication bottleneck in the Near East still supported when adding the much larger dataset from western Anatolia and Europe? ii) Did cattle reach Europe in a single dispersal process or is there evidence for multiple introductions or continuous gene-flow between regions? iii) How much of the genetic diversity from the Near East was introduced to the European continent? iv) Did the spread of cattle coincide with the spread of the Neolithic culture? and v) Are there any signs of admixture with female aurochs along the expansion route of domestic cattle?
150 samples of prehistoric domestic cattle from 24 archaeological sites were taken to analyse a 434 bp long mitochondrial d-loop fragment (for detailed information on the archaeological sites and sample age see Additional file 1. The majority of investigated individuals (113) come from Western Anatolia and Southeastern Europe, i.e. a region defined as an “interim zone” , pointing to its bridging position between the “Neolithic core zone” and its European fringe.
A further 22 samples from Southern France and Southern Italy represent the first domesticated cattle to reach Europe on the “Mediterranean route” of the Neolithic expansion.
Additionally, new prehistoric samples from Germany (4), Northern and Western France (11) and Syria (1), plus 80 previously published sequences mainly from Central and Western Europe and Iran were used for population genetic analyses ([23-28] and GenBank: KC172647 - KC172649). A total of 597 modern d-loop sequences of 240 bp length were collected from previously published studies [29,30]. They each provide representative sets of sequences that match the European, Anatolian and Near Eastern study area of the present paper, thereby also covering areas which are underrepresented in the aDNA dataset, e.g. Italy and the Iberian Peninsula. For a complete list of GenBank accession numbers of previously published sequences see Additional file 2.
Ancient DNA work and sequencing
All samples were processed in the ancient DNA facilities at the Institute of Anthropology, Mainz University (Germany), under strict rules for contamination prevention as described in Bramanti et al. . Those include strict separation of pre-PCR and post-PCR labs, protective clothes, regular cleaning of surfaces and equipment with detergent and bleach, and UV-irradiation of rooms, laboratory hoods, and equipment. Bone samples were UV-irradiated for 45 min from two sides. The surface was mechanically removed using a sandblaster (P-G 400, Harnisch & Rieth) or rotary saw (Electer Emax IH-300, MAFRA). Bone cubes of about 0.3 cm side length were again UV-irradiated for 45 min from two sides. Samples were pulverized using a mixer mill (MM200, Retsch). Generally, aliquots of 0.5 g bone powder were incubated on a rocking shaker at 37°C in a decalcification and digestion solution containing 2.5 ml EDTA (0.5 M, pH8; Ambion®/Applied Biosystems), 250 μl N-Laurylsarcosine (0.5 %; Merck) and 30 μl Proteinase K (18 U; Roche). DNA extraction was performed using phenol-chloroform-isoamylalcohol (25:24:1; Roth). DNA was washed and concentrated using 50 kDa Centricons or 50 kDa 15 ml Amicons (Millipore). At least two independent extractions per sample were performed. Extraction blank controls were processed during each extraction. Additionally, the cleanness of the grinding jars was tested by extracting hydroxylapatite that was pulverized under the same conditions as the bone samples.
Amplification of 434 bp of the HVSI (positions 15914–9 according to reference sequence V00654) was generally conducted using a PCR primer set consisting of 6 primer pairs as in Bollongino et al.  (BosU1/L1-BosU6/L6) with slight modifications.
PCR reactions were usually performed with 2.5 U AmpliTaq Gold® (Applied Biosystems), 1x PCR Gold Buffer (Applied Biosystems), 2 mM MgCl2 (Applied Biosystems), 0.2 mM dNTP’s (Quiagen), 0.4 μg/μl BSA (Roche), and 0.2 μM primer (Biospring) and HPLC-H20 (Acros Organics). Initial activation at 90°C for 6 min was followed by 50 cycles of denaturation (40 sec at 94°C), annealing (40 sec at 52-60°C), and elongation (40 sec at 72°C) in a Mastercycler gradient (Eppendorf). Blank controls were processed during each PCR. At least three independent PCRs from two different extracts were performed. Samples were sequenced on an ABI PRISM™ 3130 Genetic Analyzer (Applied Biosystems) using POP-6™ polymer (Applied Biosystems).
Sequences were analysed using the programs SeqMan™ and MegAlign™ (DNASTAR Lasergene® 7.1 and 8). At least three sequences obtained from independent PCRs from two independent DNA extractions per sample per primer pair were usually used to create a majority rule consensus sequence. For further details on ancient DNA work and sequencing including deviations from the general laboratory procedure described see Additional file 3.
Descriptive and summary statistics
All new and previously published ancient DNA sequences were subdivided into the following geographical groups: Iran/Syria, Western Anatolia, Southeastern Europe, Southeastern Central Europe, Italy, Southern France, Central/Western Europe, and Spain. These groups were further subdivided into chronological subgroups reflecting up to four different Neolithic and post-Neolithic periods per region (see Additional file 4 for detailed information on the groupings). Modern sequences were grouped according to their country of origin (also see Additional file 2).
For statistical analyses, all ancient sequences were cut to a 399 bp fragment to match the fragment sizes of previously published ancient sequences (positions 15,914-16,312 according to reference sequence GenBank V00654). Haplotype diversity, mean number of pairwise differences, Tajima’s D, Fu’s Fs and population pairwise FST were calculated using Arlequin 126.96.36.199 . P values are based on 10,000 random permutations. The level of missing data allowed was adjusted in order to include all nucleotide positions even if there were gaps in some ancient sequences. Besides that, default values were used.
The MDS (multidimensional scaling) plot is based on FST values calculated using Reynolds’ genetic distances and running 10,000 permutations in Arlequin 188.8.131.52. The MDS plot was created in R 2.14.2 R  using the packages MASS , plotrix  and shape .
Correlation between genetic and geographical distances among defined populations/groups was assessed by a Mantel test  under 9,999 random permutations using GENALEX 6.4 . The Mantel test is based on FST values calculated using Reynolds’ genetic distances and running 10,000 permutations in Arlequin 184.108.40.206. Geographical coordinates were determined by eye as the centre of appropriate countries per group for the modern samples and the centre of all archaeological sites per group for the ancient samples.
Coalescent simulations were performed using Bayes Serial SimCoal , by extending the model previously described in Bollongino et al. . Similarly, we assume an intergeneration time of 6 years, an ancestral Near Eastern wild aurochs female effective population size of 45,000  and, again, a single domestication process of parameterized size N D at time 8,500 years BCE (i.e. 1,750 generations BP). Following the domestication bottleneck, this Near Eastern population grows exponentially to a modern Near Eastern and Anatolian effective population size N NE of 1,007,170 (see SI Bollongino et al. ). At 6,400 years BCE (i.e. 1,400 generations BP) a proportion of the population P is allowed to migrate to form a new and separate European population, which then grows exponentially to a modern European effective population size N E of 7,942,392 (additional simulations which allow both N NE and N E to vary by an order of magnitude are described further in Additional file 5). From the split time until 5,000 years BCE migration between the two populations is allowed at per generation rate M E (‘early migration’), after which it is changed to rate M L (‘late migration’). Prior values for N D are drawn uniformly from the range 1 – 1,000, P uniformly from the range 0 – 1, and both migration parameters uniformly from the range 0 – 0.01. The mutation rate is fixed at 45% per million years, the posterior modal value previously estimated by Bollongino et al. .
We used the above-mentioned 597 previously published modern sequences of 240 bp length (positions 16,023-16,262 according to reference sequence GenBank V00654) and cut the ancient sequences accordingly. The resulting 790 sequences were grouped into 4 sample groups: ancient Near Eastern and Anatolian (n = 24), ancient European (n = 169), modern Near Eastern and Anatolian (n = 100) and modern European (n = 497). We calculated 5 within- and 2 between-sample summary statistics (total = 32, also see Additional file 5 for details), and used approximate Bayesian computation (ABC ) to estimate parameter values.
Out of 150 newly analysed bones and teeth from prehistoric domesticated cattle, 113 yielded replicable and highly reliable mitochondrial HVR1 sequences, constituting a success rate of 75.3%. The sequences have been deposited in GenBank [GenBank: KF307209 to KF307322]). None of the blank controls contained amplifiable amounts of bovine DNA (for further detailed discussion of the validity of the ancient DNA data see Additional file 6). The successfully analysed samples come from Bosnia-Herzegovina (3 of 5), Bulgaria (52 of 68), France (15 of 19), Germany (4 of 4), Italy (5 of 14), Romania (15 of 16), Syria (1 of 1), and Turkey (18 of 23).
Using the nomenclature of Achilli et al. , all sequences belong exclusively to lineages from haplogroups that have previously been defined in present-day European domesticated cattle, namely T3 (n = 70), Q (n = 33), T2 (n = 6) and T, T5 or T1’2’3 (n = 4). None of them belongs to a specific mtDNA motif referred to as haplogroup P that is dominating in the indigenous aurochs population of Europe [26,27,42]. It is of note that the high frequency of haplogroup Q in ancient Southeastern Europe (between 50% and 29% in 5,500-5,000 BCE and 2,700-2,200 BCE, respectively) does not match present-day haplogroup distributions of taurine cattle from Europe, and particularly from the same area (combined frequency for T and Q in present-day Balkan and Greece: 1.5 - 2% ). It is also markedly higher than in all other ancient European groups (e.g. only 4% in Central/Western Europe (5,400-4,400 BCE)). See Additional file 4 for detail on haplogroup composition and frequency of haplogroup Q across the 13 spatiotemporal groups.
There are 35 different mitochondrial lineages in the 193 prehistoric individuals, eight of which occur more than once in the dataset. Non-unique haplotypes (H) were named according to their haplogroup and numbered consecutively (H1-H8). Additional files 4 and 7 provide a detailed overview on the distribution of haplogroups and shared and unique haplotypes across the 13 spatiotemporal groups. Only haplotypes called T3_H1, Q_H4, and T2_H7 occur more than twice in the dataset (114, 37, and 5 times, respectively), with T3_H1 also being predominant in present-day taurine cattle. Haplotype T3_H1 occurs in all of the ancient 13 spatiotemporal groups, Q_H4 in all except Spain 2,700-1,600 BCE and Southern France 5,500-4,500 BCE. It is of note that Q_H5, T_H6, and T2_H7 are restricted to the geographical groups of Iran/Syria and Southeastern Europe (Q_H5 in Iran/Syria 4,000-1,400 BCE and Southeastern Europe 6,200-5,500 BCE; T_H6 in Iran 7,000-5,000 BCE and Southeastern Europe 2,700-2,200 BCE; T2_H7 in Iran 7,000-5,000 BCE and Southeastern Europe 5,500-5,000 BCE).
Genetic distances between cattle populations
The MDS plot (Figure 1) reveals a pattern that separates three geographical groups: The four Southeastern European groups cluster with the one from Western Anatolia; both groups from Iran/Syria and from Central/Western Europe are close to each other. However, Southern France and Central/Western Europe are isolated from all other groups and from each other.
Subgroups comprising only the earliest Neolithic cattle of each geographical group were used to further evaluate the influence of sample age and geographical location on genetic distances. Figure 2 maps significant pairwise FST values between the resulting eight groups. The greatest genetic distances can be observed between Iran 7,000-5,000 BCE and the groups from Central/Western Europe 5,400-4,400 BCE and Southern France 5,500-4,500 BCE with values of 0.47 and 0.40, respectively. These groups also show the greatest geographical distances. The second highest FST values occur between Southeastern Europe 6,200-5,500 BCE and Central/Western Europe 5,400-4,400 BCE and between Southeastern Europe 6,200-5,500 BCE and Southern France 5,500-4,500 BCE (0.27 and 0.29 respectively). In comparison, the genetic distance between Iran 7,000-5,000 BCE and Southeastern Central Europe 5,100-4,000 BCE is – despite greater geographical distance - slightly lower (0.23). The FST between Iran 7,000-5,000 BCE and Southeastern Europe 6,200-5,500 BCE is even smaller (0.17). The geographically adjacent groups of Southeastern Europe 6,200-5,500 BCE and 5,500-5,000 BCE and Southeastern Central Europe 5,100-4,000 BCE reveal a distance as high as 0.16 and 0.10, respectively.
A Mantel Test on the basis of Reynolds’ FST resulted in a strong positive correlation (Rxy: 0.75, P-value 0.001) between geographical and genetic distance among the eight groups. Approximately 56% of the variation can be explained by geographical distance (R2 = 0.56). There is a weaker correlation of genetic and geographical distances in modern samples (Rxy: 0.54, P-value 0.002). Here, only 29% of the variation can be explained by geographical distance (R2 = 0.29). Complete population pairwise FST matrices can be found in Additional file 8.
Measurements of molecular diversity (Ĥ, π), Tajima’s D and Fu’s Fs
The estimates of haplotype diversity (Ĥ), the mean number of pairwise differences (π), Tajima’s D, and Fu’s Fs are given in Table 1.
Haplotype diversity clearly decreases in a southeast to northwest direction with Iran 7,000-5,000 BCE (0.96) at the high end, and Southern France 5,500-4,500 BCE (0.00) and Central/Western Europe 5,400-4,400 BCE (0.22) at the low end. The haplotype diversity of the earliest domesticated cattle on the European continent in Southeastern Europe 6,200-5,500 BCE (0.62) is much lower than in Iran, and comparable to Western Anatolia 6,400-5,700 BCE (0.64), but higher than in the geographically close European group of Southeastern Central Europe 5,100-4,000 BCE (0.52). Again, diversity in Central/Western Europe 5,400-4,400 BCE is substantially lower (0.22). Following the northern Mediterranean coast, the values also drop sequentially from Western Anatolia 6,400-5,700 BCE (0.64) to Italy 6,000-5,500 BCE (0.40) to Southern France 5,500-4,500 BCE (0.00).
Haplotype diversity estimates increase with time in the two regions where samples from two Neolithic periods are available: from 0.22 to 0.50 in Central/Western Europe and from 0.62 to 0.78 in Southeastern Europe. In Southeastern Europe, haplotype diversity remains the same in the subsequent Chalcolithic group 5,000-4,000 BCE (0.78), but increases again during the Bronze Age 2,700-2,200 BCE (0.81). Similar patterns are observed when considering the mean numbers of pairwise differences. The Neolithic subgroups also show a tendency of decreasing values with distance from Iran. Regionally, the values increase with time; in Central/Western Europe from 0.45 to 0.91 and in Southeastern Europe from 1.26 to 1.79 to 2.11. In the youngest Southeastern European group (2,700-2,200 BCE) the value drops again (0.54). In Central/Western Europe, where both diversity indices increase with time, Tajima’s D is also significantly negative (Fu’s Fs only in the younger group). This is not the case for Southeastern Europe. Diversity estimates of the 597 modern cattle sequences only show a slight tendency towards an east to west gradient for both haplotype diversity, and the mean number of pairwise differences. Tajima’s D and Fu’s Fs are mostly significantly negative. All diversity estimates and graphical visualisations of chronological and geographical diversity trends can be found in the Additional file 9.
We performed 5 million coalescent simulations under the demographic model described above, and used a tolerance proportion of 0.1%, meaning that we retained the 5,000 best parameter sets. Figure 3 shows the joint posterior density of parameters N D and P (marginal to the remaining two), with the joint mode found at N D = 81 and P = 0.73. The marginal modal value for N D was 92 (95% credible interval: 29 – 783). Marginal densities for the two migration parameters M E and M L are given in Figure 4. While it is not possible to infer much from the relatively uninformative posterior for M E (top, mode 0.006 94; 95% CI: 0.00033 – 0.00974), we are able to say that migration between the Near East and Europe (M L ) appears to have been greatly reduced, essentially to zero, in the period after 5,000 years BCE (bottom, mode 0.00022; 95% CI: 0.00001 – 0.00946). Further simulations were performed in order to test the sensitivity of these parameter estimations to our assumed fixed values of N NE and N E . Increasing or decreasing both N NE and N E by an order of magnitude produced estimates that did not significantly differ from those given above (see Additional file 5 for details).
The domestication process of taurine cattle
Using an ancient (n = 193) and modern mtDNA dataset (n = 597) of domesticated cattle from the Near East, Anatolia and Europe for coalescent simulation and approximate Bayesian computation, we inferred a (joint) posterior mode of 81 female founder individuals at the beginning of the domestication process. This result is consistent with the previous estimate based on 15 ancient Iranian and 27 modern Near Eastern and Anatolian cattle , and demonstrates that this initial finding of a very strong Near Eastern bottleneck is robust even with a greatly expanded continent-wide data set, and is not biased by theoretically possible subsequent introgression from aurochs populations outside the Near East. It can therefore be concluded that domestic cattle indeed have a discrete and rather localised origin, very likely in Southeastern Anatolia and the Near East, a view that is consistent with a huge body of archaeozoological evidence from the 9th millennium BCE [44-46].
Subsequent to the first domestication phase, the ancient DNA data, together with archaeological evidence, point to an intermittent expansion scenario. Expanding from Southeastern Anatolia, cattle reached Western Anatolia and the Aegean not before 7,000 BCE. From here, they spread simultaneously across Southeastern Europe and along the Mediterranean coast into Central, Northern, Southwestern, and Western Europe. In essence, the observed strong correlation between genetic and geographical distances together with decreasing genetic diversity roughly in a southeast to northwest and southwest direction is consistent with the idea of serial dilution of diversity by a series of recurring founder events. The oldest (Neolithic) groups with the greatest geographical distance from each other, namely Iran and Central/Western Europe and Southern France, show the highest FST values (0.47 and 0.4, respectively). Smaller genetic distances are observed between more adjacent areas, such as between Iran and Western Anatolia and between Iran and Southeastern Europe (0.11 and 0.17, respectively). Other statistics are equally consistent with the serial dilution model: Neolithic cattle from Iran yield the highest value for haplotype diversity in the whole dataset (0.96). Haplotype diversity consistently decreases along the proposed two main Neolithisation routes, with the lowest values in remote areas, i.e. in Neolithic Central/Western Europe and Southern France (0.22 and 0.00, respectively), while intermediate values are observed in between.
Alternative scenarios of secondary domestications or traceable female gene-flow from wild aurochs in Europe have been discussed several times in the literature [29,47-51]. The arguments are mainly based on scarce findings of the mtDNA haplogroup P, pre-dominating in European aurochs, in the domesticated stock [49,50] on the one hand, and the presence of mtDNA lineages in pre-Neolithic Italian aurochs that resemble those of the imported domesticated animals [29,48] on the other, thereby impeding the detection of introgression by mere comparison of haplogroup composition. However, realistic expectations under such models would also include i) a larger inferred founder population due to introgressions of diverse aurochs lineages and ii) significant deviations from the serial dilution of genetic diversity model. None of the two has been observed in or can be inferred from the data presented here. Detection of potential introgression of Italian aurochs through time deserves further attention, e.g. by expanding the existing dataset to encompass finds from diverse archaeological sites and later chronological phases. However, the existing dataset from the rest of Europe suggests that introgressions of local genes into the imported domestic cattle populations are rare and geographically restricted exceptions, or coming from male aurochs. Separate independent domestication(s) of European aurochs can almost with certainty be excluded.
The strict separation of domestic cattle from their wild European relatives is very different to what can be observed in other animals. For example, pigs were imported to Europe in a similar way to cattle, but after a few centuries all their mitochondrial lineages were replaced through admixture with local wild boar [52,53].
The first domesticated cattle in Europe
The summary statistical patterns described here may be partly biased by the fact that the analysed data come from heterochronous and spatially diverse samples . Therefore, we used coalescent simulations to estimate the key parameters of taurine cattle population history upon their arrival in Europe in a realistic evolutionary demographic framework.
Our model suggests that a high proportion (73%) of domesticated cattle in Anatolia and the Near East may have migrated into Europe. This indicates that the expansion into Europe was a far less severe bottleneck than assumed, and that much of the variation present in the original Anatolian/Near Eastern population survived in initial European cattle populations. Consistent with this, the Western Anatolian and Southeastern European sample groups constitute a cluster in the MDS plot (Figure 1). However, Southern France and Central/Western Europe instead are clearly separated, very likely reflecting genetic diversification along the two main Neolithisation routes. It is noteworthy that the data from Western Anatolia, Southern Italy, and Southern France come from very few sites with less than ten samples each (eight, five, and eight, respectively) and therefore have to be evaluated cautiously. However, the drastic decline in haplotype diversity and mean number of pairwise differences from 0.64/0.80 and 0.40/0.79 in Western Anatolia and Italy to 0.00/0.00 in Southern France is a good fit to a scenario of only few individuals being transported by boat to the Northwestern Mediterranean coast [3,55]. Low diversity estimates are also congruent with the fact that cattle did not play a major role in the domesticated faunal spectrum of Neolithic economies from Mediterranean Europe (Impressa and Cardial), in contrast to Neolithic Cultures in Central Europe, where domesticated cattle were generally well represented [56,57].
Tracing the spread of cattle through the European mainland, there are patterns in the data that point to significant demographic changes connected to the expansion of the Neolithic culture from Southeastern to Central/Western Europe. The genetic distance between Southeastern Europe (6,200-5,500 BCE) and Central/Western Europe (5,400-4,400 BCE) is unexpectedly high (0.27). To put this high value into context: the FST values between Iran (7,000-5,000 BCE), Southeastern Europe (6,200-5,500 BCE) and Italy (6,000-5,500 BCE) are much lower (0.17 and 0.15, respectively) despite larger geographic distances. A good indicator for this massive demographic change is that the frequency of the mitochondrial Q-lineage drops from 50 % in Southeastern European (6,200-5,500 BCE) to 4% in Central/Western Europe (5,400-4,400 BCE). Haplotype diversity decreases drastically from 0.62 to 0.22. There are several additional lines of evidence that point to the region between Southeastern Europe and Central Europe as a kind of core area where the Neolithic idea was re-consolidated: i) From archaeology: The Linearbandkeramik culture (LBK, engl. Linear Pottery culture) developed here and spread rapidly over Central Europe starting around 5,600 BCE ; ii) From palaeogenetics: A migration of farmers from Southeastern to Central Europe has been inferred using ancient mtDNA ; iii) From gene-culture coevolutionary modelling: Spatially-explicit computer simulations of the spread of an allele associated with lactase persistence in humans (i.e. the ability to digest milk sugar as an adult), point to this area as where positive selection started affecting the frequency of this allele in dairying cultures . We therefore suggest that the observed substantial loss of genetic diversity and the increasing genetic distance in prehistoric cattle are the result of a significant founder event along with the spread of the LBK. It probably coincides with a major wave of human migration and is followed by a period of intensified cattle breeding resulting in a rising importance of dairying. This picture becomes even more comprehensive when we look at how patterns change after the early Neolithic.
After the arrival
Cattle herding becomes more and more important with the onset of the LBK. A few centuries later, cattle bones constitute up to 70% of all domesticated animal bones in faunal assemblages in Central Europe, a value that stays roughly the same for most of the subsequent millennia with some regional fluctuations [56,57]. Accordingly, significantly negative Tajima’s D and Fu’s Fs values in Neolithic Central/Western Europe and in the majority of the modern sample groups point to extended periods of population growth (see Table 1 and Additional file 9).
Interestingly, there is no indication for population growth in Southeastern Europe. The observed diachronic increase in haplotype diversity in the Southeastern European sample groups appears in tandem with new, previously absent mitochondrial haplotypes (also see Additional file 7). It is of note that two of these new haplotypes (T_H6 and T2_H7) are present here and in the Iranian Neolithic sample but not elsewhere.
According to our demographic modelling, migration between Anatolia/the Near East and Europe was greatly reduced, essentially to zero, in the period after 5,000 BCE. We should expect that accurately estimating the level of early migration between 6,400 and 5,000 BCE to be difficult, as it is somewhat confounded by the proportion of cattle P moving to Europe at the time of the split (indeed the two parameters are very slightly negatively correlated; degree r = -0.07, p = 0.0002). However, it is clear that there is support at least for some level of migration during this early period as the estimated modal migration rate is clearly greater than 0.
We therefore suggest that a probable underlying scenario for our observations is one of continuous gene-flow into Europe following the initial colonization at around 6,400 BCE. This scenario also fits in with archaeological evidence for accelerated westward acculturation occurring in the first half of the 6th millennium BCE [6,60]. This early phase was followed by almost total isolation between the European and Anatolian/Near Eastern cattle populations after 5,000 BCE.
It is of note that this pattern has changed again in later periods. The pattern of decreasing diversity in the direction of the Neolithic expansion and the correlation of genetic and geographical distances is considerably weaker in modern-day cattle breeds than in the Neolithic. It is not clear yet to which extent human migrations from the East as postulated for the Bronze Age  influenced the already existing cattle stock in Europe. However, the fading geographical patterns are likely mirroring more recent demographic changes and founder events, such as global trade, exceptional selection pressure on particular high performance breeds and replacement of traditional breeds . Thus, the present study explicitly underlines that ancient demographic and evolutionary processes in selectively bred animals can only be uncovered by using ancient DNA data.
Overall, palaeogenetic together with archaeological and archaeozoological data strongly support the following scenario: taurine cattle were domesticated in a region between Southeastern Anatolia and the Zagros Mountains, Syria and the Lebanon. The domestication process started in the mid-9th millennium BCE, with a small effective number of wild female aurochs (estimated modal value of 81). After 7,000 BCE, domestic cattle populations were transported from the Central Anatolian plateau to Western Anatolia and the Aegean. Much of the original Anatolian and Near Eastern variation (approximately 73%) survived in the first Neolithic cattle that were introduced to Europe around 6,400 BCE. Despite some evidence for subsequent gene-flow with Anatolia and the Near East between 6,400 and 5,000 BCE, most of the initial genetic diversity was lost as cattle spread through Europe along with the Neolithic transition: Via the Mediterranean trajectory, migrating farmers reached i.e. Southern Italy, Northern Africa, the Tyrrhenian Islands, Southern France and the Iberian Peninsula by boat. The low genetic diversity observed in the few genetic data available from these regions points to a significantly low effective population size of cattle arriving in the Western Mediterranean. Along the second trajectory across the European mainland and without major signs of introgression from wild aurochs, cattle finally reached Central, Western (after 5,500 BCE) and Northern Europe (after 4,100 BCE). Also here, much of the genetic diversity was lost during the move, particularly when cattle were brought to Central Europe by LBK farmers.
Gene-flow between Europe and Anatolia and the Near East appears to have been reduced, essentially to 0, after around 5,000 BCE. In modern breeds however, the genetic effects of the inferred migratory patterns and geographical diversification become far less pronounced, probably due to selective breeding and trade of high performance cows in very recent times.
In summary, the genetic prehistory of domestic cattle seems to consist of a small, localised domestication process, followed by a relatively straightforward series of spasmodic expansion episodes resulting in a serial dilution of genetic diversity from the Near East to Western and Northern Europe. Future genomic multi-locus studies of ancient DNA from prehistoric periods will hopefully add greater detail to this picture, particularly by incorporating the potentially divergent demography of male cattle.
Özdoğan M. The expansion of the neolithic way of life: What we know and what we do not know. In: How did farming reach Europe? Edited by Lichter C, vol. 2. Istanbul: Byzas; 2005. p. 13–27.
Özdoğan M. Archaeological Evidence on the Westward Expansion of Farming Communities from Eastern Anatolia to the Aegean and the Balkans. Curr Anthropol. 2011;52(S4):S415–30.
Vigne J-D. Zooarchaeological aspects of the Neolithic diet transition in the Near East and Europe, and their putative relationships with the Neolithic Demographic Transition. In: Bocquet-Appel J-P OB-Y, editor. The Neolithic Demographic Transition and its Consequences. New York: Springer Verlag; 2008. p. 179–205.
Conolly J, Colledge S, Dobney K, Vigne J-D, Peters J, Stopp B, et al. Meta-analysis of zooarchaeological data from SW Asia and SE Europe provides insight into the origins and spread of animal husbandry. J Archaeol Sci. 2011;38(3):538–45.
Özdoğan M. An alternative approach in tracing changes in demographic composition. The westward expansion of the neolithic way of life. In: Bocquet-Appel J-P, Bar-Yosef O, editors. The neolithic demographic transition and its consequences. 2008th ed. Berlin: Springer; 2008. p. 139–78.
Düring BS. The prehistory of Asia Minor : from complex hunter-gatherers to early urban societies. Cambridge: Cambridge University Press; 2011.
Çilingiroğlu A, Cevik O, Çilingiroğlu C. Ulucak Höyük: Towards understanding the early farming communities of Middle West Anatolia: Contribution of Ulucak. In: Özdoğan M, Başgelen N, Kuniholm P, editors. The Neolithic in Turkey, Western Turkey. Istanbul: Archaeology & Art Publications; 2012. p. 139–75.
Reingruber A. Die deutschen Ausgrabungen auf der Agrissa-Magula in Thessalien II. Die Agrissa Magula. In: Beiträge zur ur- und frühgeschichtlichen Archäologie des Mittelmeer-Kulturraums, Hauptmann H, editors. Das frühe und das beginnende mittlere Neolithikum im Lichte transägäischer Beziehungen. Bonn: Dr. Rudolf Habelt GmbH; 2008.
Lüning J. Steinzeitliche Bauern in Deutschland – die Landwirtschaft im Neolithikum. Bonn: Dr. Rudolf Habelt GmbH; 2000.
Guilaine J. De la vague à la tombe, La conquête néolithique de la Méditerranée (8000-2000 avant J.-C). In. Paris: Le Seuil; 2003.
Tresset A, Vigne J-D. Last hunter-gatherers and first farmers of Europe. C R Biol. 2011;334(3):182–9.
Ammerman AJ, Cavalli-Sforza LL. The Neolithic transition and the genetics of populations in Europe. Princeton, Guildford: Princeton University Press; 1984.
Zvelebil M, Zvelebil KV. Agricultural transition and Indo-European dispersals. Antiquity. 1988;62(236):574–83.
Ammerman AJ. On the Neolithic Transition in Europe - a Comment. Antiquity. 1989;63(238):162–5.
Zvelebil M. On the Transition to Farming in Europe, or What Was Spreading with the Neolithic - a Reply. Antiquity. 1989;63(239):379–83.
Whittle AWR. Europe in the Neolithic : the creation of new worlds. Cambridge: Cambridge University Press; 1996.
Pinhasi R, Thomas MG, Hofreiter M, Currat M, Burger J. The genetic history of Europeans. Trends Genet. 2012;28(10):496–505.
Bramanti B, Thomas MG, Haak W, Unterlaender M, Jores P, Tambets K, et al. Genetic discontinuity between local hunter-gatherers and central Europe's first farmers. Science. 2009;326(5949):137–40.
Skoglund P, Malmstrom H, Raghavan M, Stora J, Hall P, Willerslev E, et al. Origins and genetic legacy of Neolithic farmers and hunter-gatherers in Europe. Science. 2012;336(6080):466–9.
Lazaridis I, Patterson N, Mittnik A, Renaud G, Mallick S, Kirsanow K, et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature. 2014;513(7518):409–13.
Tresset A, Bollongino R, Edwards CJ, Hughes S, Vigne J-D. Early diffusion of domestic bovids in Europe: An indicator for human contact, exchanges and migrations? In: Hombert JM, Errico F, editors. Becoming eloquent, advances in the emergence of language, human cognition, and modern cultures. Amsterdam: John Benjamins Publ. Comp; 2009. p. 69–90.
Larson G, Burger J. A population genetics view of animal domestication. Trends Genet. 2013;29(4):197–205.
Bollongino R, Burger J, Powell A, Mashkour M, Vigne J-D, Thomas MG. Modern Taurine Cattle descended from small number of Near-Eastern founders, Molecular Biology and Evolution. 2012. doi:10.1093/molbev/mss1092.
Bollongino R, Edwards CJ, Alt KW, Burger J, Bradley DG. Early history of European domestic cattle as revealed by ancient DNA. Biol Lett. 2006;2(1):155–9.
Anderung C, Bouwman A, Persson P, Carretero JM, Ortega AI, Elburg R, et al. Prehistoric contacts over the Straits of Gibraltar indicated by genetic analysis of Iberian Bronze Age cattle. Proc Natl Acad Sci U S A. 2005;102(24):8431–5.
Edwards CJ, Bollongino R, Scheu A, Chamberlain A, Tresset A, Vigne J-D, et al. Mitochondrial DNA analysis shows a Near Eastern Neolithic origin for domestic cattle and no indication of domestication of European aurochs. Proc R Soc B Biol Sci. 2007;274(1616):1377–85.
Scheu A, Hartz S, Schmoelcke U, Tresset A, Burger J, Bollongino R. Ancient DNA provides no evidence for independent domestication of cattle in Mesolithic Rosenhof. Northern Germany Journal of Archaeological Science. 2008;35(5):1257–64.
Bollongino R, Elsner J, Vigne J-D, Burger J. Y-SNPs do not indicate hybridisation between European aurochs and domestic cattle. PLoS One. 2008;3(10), e3418.
Beja-Pereira A, Caramelli D, Lalueza-Fox C, Vernesi C, Ferrand N, Casoli A, et al. The origin of European cattle: evidence from modern and ancient DNA. Proc Natl Acad Sci U S A. 2006;103(21):8113–8.
Troy CS, MacHugh DE, Bailey JF, Magee DA, Loftus RT, Cunningham P, et al. Genetic evidence for Near-Eastern origins of European cattle. Nature. 2001;410(6832):1088–91.
Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564–7.
R Developement Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, R Foundation for Statistical Computing. 2012. ISBN: 3-900051-07-0, URL http://www.R-project.org/.
Venables WN, Ripley BD. Modern Applied Statistics with S. 4th ed. New York: Springer Verlag; 2002.
Lemon J. Plotrix: a package in the red light district of R. R-News. 2006;6(4):8–12.
Soetaert K: Shape: Functions for plotting graphical shapes, colors. R package version 1.3.4. http://CRAN.R-project.org/package=shape. 2011.
Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Res. 1967;27:209–20.
Peakall R, Smouse PE. GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research Molecular Ecology Notes. 2006;6(1):288–95.
Anderson CN, Ramakrishnan U, Chan YL, Hadly EA. Serial SimCoal: a population genetics model for data from multiple populations and points in time. Bioinformatics. 2005;21(8):1733–4.
MacEachern S, Hayes B, McEwan J, Goddard M. An examination of positive selection and changing effective population size in Angus and Holstein cattle populations (Bos taurus) using a high density SNP genotyping platform and the contribution of ancient polymorphism to genomic diversity in Domestic cattle. BMC Genomics. 2009;10:181.
Beaumont MA, Zhang W, Balding DJ. Approximate Bayesian computation in population genetics. Genetics. 2002;162(4):2025–35.
Achilli A, Bonfiglio S, Olivieri A, Malusa A, Pala M, Hooshiar Kashani B, et al. The multifaceted origin of taurine cattle reflected by the mitochondrial genome. PLoS One. 2009;4(6), e5753.
Gravlund P, Aaris-Sorensen K, Hofreiter M, Meyer M, Bollback JP, Noe-Nygaard N. Ancient DNA extracted from Danish aurochs (Bos primigenius): genetic diversity and preservation. Ann Anat. 2012;194(1):103–11.
Lenstra J, Ajmone-Marsan P, Beja-Pereira A, Bollongino R, Bradley D, Colli L, et al. Meta-Analysis of Mitochondrial DNA Reveals Several Population Bottlenecks during Worldwide Migrations of Cattle. Diversity. 2014;6(1):178–87.
Peters J, von den Driesch A, Helmer D. The upper Euphrates-Tigris basin: Cradle of agro-pastoralism? In: Vigne J-D, Helmer D, editors. The first steps of animal domestication New archaeological approaches Proceedings of the 9th ICAZ Conference, Durham 2002. Oxford: Oxbow Books; 2005. p. 96–124.
Helmer D, Gourichon L, Monchot H, Peters J, Segui MS. Identifying early domestic cattle from Pre-Pottery Neolithic sites on the Middle Euphrates using sexual dimorphism. In: Vigne J-D, Peters J, Helmer D, editors. The first steps of animal domestication New archaeological approaches Proceedings of the 9th ICAZ Conference, Durham 2002. Oxford: Oxbow Books; 2005. p. 86–95.
Hongo H, Pearson J, Öksük B, Ilgezdi G. The process of ungulate domestication at Çayönü. Southeastern Turkey: A multidisciplinary approach focusing on Bos sp and Cervus elaphus Anthropozoologica. 2009;44:63–73.
Achilli A, Olivieri A, Pellecchia M, Uboldi C, Colli L, Al-Zahery N, et al. Mitochondrial genomes of extinct aurochs survive in domestic cattle. Curr Biol. 2008;18(4):R157–8.
Mona S, Catalano G, Lari M, Larson G, Boscato P, Casoli A, et al. Population dynamic of the extinct European aurochs: genetic evidence of a north-south differentiation pattern and no evidence of post-glacial expansion. BMC Evol Biol. 2010;10:83.
Schibler J, Elsner J, Schlumbaum A. Incorporation of aurochs into a cattle herd in Neolithic Europe: single event or breeding? Sci Rep. 2014;4:5798.
Stock F, Edwards CJ, Bollongino R, Finlay EK, Burger J, Bradley DG. Cytochrome b sequences of ancient cattle and wild ox support phylogenetic complexity in the ancient and modern bovine populations. Anim Genet. 2009;40(5):694–700.
Bonfiglio S, Achilli A, Olivieri A, Negrini R, Colli L, Liotta L, et al. The enigmatic origin of bovine mtDNA haplogroup R: sporadic interbreeding or an independent event of Bos primigenius domestication in Italy? PLoS One. 2010;5(12), e15760.
Ottoni C, Flink LG, Evin A, Geörg C, De Cupere B, Van Neer W, et al. Pig domestication and human-mediated dispersal in western Eurasia revealed through ancient DNA and geometric morphometrics. Mol Biol Evol. 2013;30(4):824–32.
Geörg C. Paläopopulationsgenetik von Schwein und Schaf in Südosteuropa und Transkaukasien. ForschungsCluster1. vol. 9. Verlag Marie Leidorf GmbH: Rahden/Westf; 2013.
Depaulis F, Orlando L, Hanni C. Using Classical Population Genetics Tools with Heterochroneous Data. Time Matters! PLoS ONE. 2009;4(5):5541.
Guilaine J, Manen C. Vigne J-D. Pont de Roque-Haute (Portiragnes, Hérault). Nouveaux regards sur la néolithisation de la France méditerranéenne. Archives d’Ecologie Préhistorique: Toulouse; 2007.
Benecke N. Der Mensch und seine Haustiere. Die Geschichte einer jahrtausendealten Beziehung. Stuttgart: Theiss; 1994.
Tresset A, Vigne J-D. La chasse, principal élément structurant la diversité des faunes archéologiques du Néolithique ancien, en Europe tempérée comme en Méditerranéenne: tentative d'interprétation fonctionnelle. In: Arbogast RM, Jeunesse C, Schibler J, editors. Rôle et statut de la chasse dans le Néolithique ancien danubien (5500-4900 av J-C) Actes Premières rencontres danubiennes de Strasbourg, 20-21 nov 96. Rahden/Westf: Marie Leidorf; 2001. p. 129–51.
Pavúk J. Typologische Geschichte der Linearbandkeramik. In: Lüning J, Frirdich C, Zimmermann A, editors. Die Bandkeramik im 21 Jahrhundert: Symposium in der Abtei Brauweiler bei Köln 2002. Rahden/Westf: Verlag Marie Leidorf GmbH; 2005. p. 17–39.
Itan Y, Powell A, Beaumont MA, Burger J, Thomas MG. The origins of lactase persistence in Europe. PLoS Comput Biol. 2009;5(8), e1000491.
Çilingiroğlu C. The appareance of impressed pottery in the Neolithic Aegean and its implications for maritime networks in the Eastern Mediterranean. Tüba-Ar. 2010;13:9–22.
Haak W, Lazaridis I, Patterson N, Rohland N, Mallick S, Llamas B, et al. Massive migration from the steppe was a source for Indo-European languages in Europe, Nature. 2015.
Taberlet P, Coissac E, Pansu J, Pompanon F. Conservation genetics of cattle, sheep, and goats. C R Biol. 2011;334(3):247–54.
This work was funded by German Archaeological Institute, Johannes Gutenberg-University Mainz, and CNRS (Centre national de la recherche scientifique).
The authors would like to thank Adrian Bӑlӑşescu, Cornelia Becker, Daniel Bradley, Altan Çilingiroğlu, Çiler Çilingiroğlu, Giuliano Cremonesi, Keith Dobney, Ceiridwen Edwards, Ralf Gleser, Angela Graefen, Jean Guilaine, Svend Hansen, Daniel Helmer, Robert Hofmann, Raiko Krauß, Marion Lichardus-Itten, Claire Manen, Ingo Motzenbäcker, Mehmet Özdoğan, Jean Roudil, Silviane Scharl, Nils Müller-Scheeßel, Wolfram Schier, Mona Schreiber, Elisabeth Stephan, and Bernhard Weninger for providing bone material and for helpful discussions.
The authors declare that they have no competing interests.
JB, NB, and JDV designed the research, AS and RB performed the experiments, AS, RB, AP, JDV, AT, CC, NB, and JB analyzed and interpreted data, and AS, AP, and JB wrote the paper. All authors read and approved the final version of the paper.
Sample list. Supplemental information on the location of the archaeological sites, dating, sample providers, sequencing results, and GenBank accession numbers of all new sequences.
GenBank accession numbers. GenBank accession numbers and references of all previously published ancient and modern mtDNA sequences used in this study. Modern sequences are grouped geographically by their country of origin.
Ancient DNA methods. Supplemental methodological detail on DNA amplification and DNA sequencing (such as primer sequences and locations, and detailed protocols), haplogroup determination, and establishment of consensus sequences.
Chronological and geographical sample groups. Individual assignment of all ancient sequences used in this study to chronological, cultural and geographical groups, and individual haplogroup and haplotype assignments and polymorphic positions. The relative frequency of halplogroup Q per group is provided with a 95% confidence interval (CI). Shared haplotypes (H) are numbered consecutively. Haplogroup T stands for T, T5 or T1'2'3. Haplogroup assignment according to . Polymorphic positions are given according to the reference sequence GenBank V00654, whereby dots represent bases that match the reference sequence, and asterisks unavailable sequence information.
Coalescent simulations and ABC. Methodological detail on the coalescent-based demographic modelling of the domestication and then spread of cattle into Europe.
Validation of ancient DNA data. Supplemental information on ancient DNA validity criteria (such as blank controls and contamination rate estimation), and explicit discussion of not fully replicated sequences.
Shared haplotypes. Shared haplotypes (H) are named and coloured according to their haplogroup and numbered consecutively. Shades of red: T3; White/light gray: Q; Shades of blue: T, T5 or T1’2’3; Shades of green: T2 (haplogroup definition according to ). The x-axis gives the number of sequences. a) Haplotype distribution across all 193 ancient mtDNA sequences; b) Haplotype distribution across 13 spatiotemporal groups defined by region of origin and age in BCE to the left of each bar.
F ST values. a) Ancient pairwise FSTs. b) Ancient pairwise FST P values. c) Ancient Reynolds’ FSTs. d) Modern pairwise FSTs. e) Modern pairwise FST P values. For a)-c): First row/column:IR/S: Iran/Syria; IT: Italy; SECE: Southeastern Central Europe; SEE: Southeastern Europe; SF: Southern France; SP: Spain; WA: Western Anatolia; CWE: Central/Western Europe.Numbers indicate the age of the samples in BCE; numbers in brackets indicate sample size.Significant values at the 0.05 level are highlighted in bold. For d)-e): First row/column: Area/country of origin. Numbers in brackets indicate sample size. Significant values at the 0.05 level are highlighted in bold.
Graphical representation of Table 1 and summary statistics from modern mtDNA sequences. a) Boxplots of ancient haplotype diversity. b) Boxplot of ancient mean numbers of pairwise differences. Estimates are sorted by the magnitude of the oldest available chronological group per geographical group. Markers are colored by chronology as follows: white: (earliest) Neolithic, grey: Middle/Late Neolithic, dashed: Chalcolithic, black: post-Neolithic. X -axis: IR/S: Iran/Syria, IT: Italy, SECE: Southeastern Central Europe, SEE: Southeastern Europe, SF: Southern France, WA: Western Anatolia, and CWE: Central/Western Europe. Numbers indicate the age of the samples in BCE; numbers in brackets indicate sample size. c) Summary statistics of d-loop sequences from 11 geographical groups of modern domesticated cattle. First column: area/country of origin. Ĥ: Haplotype diversity, π: mean number of pairwise differences. Significant Tajima’s D and Fu’s Fs value at the 0.05 level are highlighted in bold. d) Boxplot of modern haplotype diversity. e) Boxplot of modern mean numbers of pairwise differences. The boxplots are sorted by magnitude. X-axis: country/area of origin; numbers in brackets indicate sample size.