Quantcast
Channel: The MacGregor DNA Project
Viewing all 31 articles
Browse latest View live

Article 0

$
0
0

MacGregor DNA Project  - update January 2015


Welcome to the MacGregor DNA Project update for 2014-5 

This year's DNA update will move away from discussing Y chromosome results as I have done in previous years to consider the other tests currently available. This year there have been 49 new members of the project whose names reflect the aliases adopted by members of the clan or sept  names (such as Magee, Gragg, Greer, or Black) associated with the clan, and many of the results published this year were for tests other than the male Y chromosome test (in fact, in total there were over a thousand pieces of genetic  data reported). Most males who join the project will first do the 37 marker Y chromosome test. If they find that they match someone else with 35, 36, or 37 markers being the same, it can be useful for establishing familial connections to increase the number of markers to compare, up to 67 or even 111 instead of just 37. However there are limitations to the use of STR (Short Tandem Repeat) markers, as they are known, in that DNA mutates randomly. As a result some testees find that their Y DNA is very similar to that of another male, but neither person's DNA has mutated very much in 600 years, and as a result what looks like a recent relationship with another person from the same family branch is probably simply an indication of connection with the common ancestor from whom that surname developed as many as 600 years ago. On the other hand, some distinctive mutations - for example in the project all Stirling/MacGregors have '16' at DYS576 whereas the norm for the group is '18'- probably indicate descent from a more recent common ancestor.
      When surnames are grouped together by variant spelling (e.g. Greig, Gregg, Grieg, Grig etc) it quickly becomes clear that there are multiple origins for these names. Some of the results in these groups suggest a connection with the main MacGregor group that predates the actual founder (Gregor) who, it is believed, flourished c1360, rather indicating a strong genetic connection pre- surnames. What also emerges, in some cases, is an indication of contradictory genealogy  - individuals with very different genetic profiles who are supposed to be from the same ancestor.  Such cases suggest that genealogies need further checking for accuracy.
       It has been noticeable this year that there has been an increase in interest in the other DNA tests available as participants seek to learn more about their genetic descent. In order to help with the interpretation of these, or simply to help with choice, this blog update will concentrate on some of the more common: Family Finder (also known as My Origins, Ancestry Painting, All my Ancestry and so on, depending on the company used); MtDNA (mitochondrial DNA); and SNP testing.  I will deal with each in turn.

Family Finder/My Origins/Ancestry Painting

This test is designed to give participants an indication as to the origin(s) of their genetic ancestry. Most geneticists believe that human beings emerged out of Africa perhaps 70,000 to 100,000 years ago, but this test looks for more recent connections than that. For those whose later ancestry is British the test usually indicates a predominance of DNA from the British Isles, with reasonably high proportions coming from Scandinavia and/or the Mediterranean. These can usually be explained by links with Viking invaders in the case of Scandinavia and with emigration from the Spanish ice age refuge, as well as the possible influence of invasion by the Romans and their soldiers after AD43. 
     The results of such tests are often expressed in colours located geographically like this anonymised one from FamilyTreeDNA:


Fig. 1: My Origins geographical plot from FtDNA


All the companies which offer this test use the same principle for presenting results, that is, to show the results as percentages.  Here are the results for the same individual from 23andme.

Fig. 2: Genetic ancestry percentage chart from 23and me


Although the results show some variation (for example in FamilyTreeDNA 3% Middle Eastern against 1.1% Ashkenazi in 23andme) they are essentially the same. 23andme also show these results as Chromosome View concentrating on the past 500 years - and this has been highly rated by users. Here is the same information as Fig. 2 but in the chromosome view.

Fig. 3 Chromosome view of Fig. 2


BritainsDNA [aka ScotlandsDNA] adopt a colour graph approach which is a bit different from the others just mentioned but tends to produce similar overall results. These graphs are obtained by comparing sections of a person's DNA with the same sections from individuals in known populations. There are two versions - Figure 4 is based on a global analysis of the DNA, while Figure 5 refines the European elements of the global report.  Note, however, that these results are not from the same individual as the first three examples.

Fig. 4: BritainsDNA Ancestry Painting - global result


Fig. 5:  BritainsDNA Ancestry Painting - European

And the same company's presentation of chromosome painting:

Fig 6:BritainsDNA  Chromosome Painting



As more data are made available to geneticists, charts and grids such as these will be refined and developed further. FtDNA, for example, uses the results from My Origins to suggest relationships between individuals on the basis of shared DNA segments.

MtDNA - Mitochondrial DNA


Mitochondrial DNA is passed from a mother to her sons and daughters but only the daughters can pass it on afterwards.  What this means for genealogical purposes is that MtDNA identifies those people who are related through the mother's mother's line back through time. However, whereas surnames for males can be tracked back in time for perhaps 600 years or more and related to the passing on of the male Y chromosome, societies tend not to preserve information on maternal genealogy (unless there is a royal connection). Participants are therefore most interested in the possibility of identifying others who have the same genetic grouping or haplotype as they do, or, they are interested in finding out about the origin of their particular haplogroup - for example where did the woman who originated the genetic line live and how many tens of thousands of years ago was she alive. Migration maps have been created to show how migration based on mitochondrial DNA happened and also to show how the different genetic lines developed. One such map can be found here:

http://www.transpacificproject.com/index.php/genetic-research/

There are three levels of testing: HVR (hyper variable region) 1; HVR2; and FGS (Full Genome Sequence). Typically a participant will match a number of others at HVR1 but matches will decrease the more detailed the level of testing. Here is an example of report for haplogroup J1c1b1a:

Fig. 7 mitochondrial (MtDNA) results


I should add that 23andme actually covers the whole genome X, Y and autosomal. It has a lot of medically relevant information as well although recent American legislation has limited the company's ability to offer this.

SNPs - Single Nucleotide Polymorphisms


Perhaps the most interest in DNA in the past year has been generated by the developments in SNP identification.  The following websites might help in the understanding of the significance of SNPs:

https://www.genome.gov/Pages/Education/DNADay/TeachingTools/MakingSNPsMakeSense.html
https://www.youtube.com/watch?v=tJjXpiWKMyA

At the moment, as far as Scotland is concerned, the greatest attention has been directed to using SNPs to distinguish the points at which different clans split off from each other in the early Medieval period. Some of the most important SNPs for this analysis were identified by Jim Wilson of ScotlandsDNA using the Chromo2 chip. FamilyTreeDNA had previously released their product - BigY -which was designed to test for a large number of SNPs but did not include those which were of most significance to Scottish genealogy, though they have since made the most interesting SNPsavailable for purchase as single SNPs. Towards the end of this past year the company known as YSEQ started offering panels (groups) of SNPs including those which identify the Scottish clan groupings.  The most comprehensive genome testing is done by Full Genome Corp - which currently offers comprehensive testing of the Y chromosome for less than 900 dollars US.  This last is the only test currently available (because it is comprehensive) which has the potential to identify SNPs which are more recent and important for dividing up family groups in historic time - that is, since the beginning of surnames.  With this comprehensive Y test comes the possibility of identifying all the SNPs which link specific families together in the centuries since the introduction of surnames.

The most informative website for seeing what has been learned so far about the relationship of clans is the one created by Alex Williamson:

http://www.littlescottishcluster.com/RL21/NGS/Tree.html  and
http://www.littlescottishcluster.com/RL21/NGS/R-L1335.html which is the main group for those of Scottish descent of which the MacGregors are part.

A summary version of the tree is found here:

The key discovery as far as Scots from the Western Highlands is concerned came about through the work of Dr Jim Wilson of Edinburgh University. Through his work using the Chromo2 chip the sequence of SNPs ending with S690 which defined the MacGregor line was identified. Since we believe that S690 was probably found for the first time in Gregor the founder of the clan (who flourished c1360), anyone who thinks they might be a MacGregor can test for this SNP on its own, and this is now much the cheapest starting point in searching for genetic origins (but of most value to those who believe that they currently bear an alias).  As far as MacGregors are concerned, there are intriguing suggestions that a much smaller and less productive group of MacGregors broke off earlier than 1360, and it is speculated that those MacGregors and the McFarlands (now known as 'hidden MacGregors') who have S691 or S695 are part of this group.
     Current thinking is that SNPs occur about every 90 years, and even if this is an underestimate it is clear that all Scottish clans are very strongly related. The next Figure shows an adapted version of the genetic tree identified by Jim Wilson (as presented to the International Clan Gregor Gathering in summer 2014).

Fig. 8: SNP tree as presented by Jim Wilson in July 2014

The sequence of SNPs which arose is therefore, in chronological order:
L1335
L1065
S749
S744
S691
S695
S690  MacGregor (and ?S697)
S696  MacGregor of Glencarnaig (Chief's line) (and ?S698)

According to the current date estimates L1335 could have arisen at c790AD.  We believe that there are some, as yet undiscovered, SNPs between S690 and S696 (and S698) which define the Chief of MacGregor's line only. If S696 happened in say 1420 we would expect to see more clan members having this SNP than appears to be the case.  At any rate it seems likely that S696 dates to within the last 250 years. However do not currently know the status of the SNPs S697 and S698 which seem to be related to S690 and S696 is some way.
     Thanks to the work of Dr Neil McGregor in Australia we are beginning to break down the SNP results to see which particular SNPs define which families. Our aim is to raise money through the Clan Gregor Society in order to pay for full Y chromosome testing on 5 individuals who represent different branches of Clan Gregor. The difficulty then will be in identifying the significant SNPs from the tens of thousands which will be reported! The picture below taken from a recent Excel file shows the speculative tree to date.

Fig. 9: Hypothetical breakdown of MacGregor genetic tree (as at December 2014)


This draft tree is based on the following data which Neil McGregor has derived from a combination of STR (marker) results and SNP results. The number on the left hand side is the Family Tree DNA kit number.

Fig.10 STR data on which Fig. 9 is based



As usual if any individuals wish to see how their results relate to anyone else in the project I am happy to do a comparative chart such as those found on previous blog updates.  Since this opportunity has been available to all since the beginning of the project this offer is particularly directed to those who have tested their Y chromosome in 2014. As usual my email address is richardmcgregor1ATyahoo.co.uk (substituting @ for AT).


MacGregor DNA Project Update January 2016

$
0
0
MacGregor DNA project blog update January 2016

Welcome to the annual blog update of the MacGregor DNA project.  Last year I gave an introduction to the various DNA tests and what they could show. In the past year, while numbers in the project have continued to rise, most people seem to be opting for either the Family Finder test, or for SNP testing, in order to determine where on the human family tree they lie. For this year’s update I have decided to focus in particular on two case studies –  one of which has been supplied by Neil McGregor and Matt McGregor in Australia. I’ll get to these in a moment.

When I wrote about Family Finder last year I didn’t really emphasise the importance of uploading a personal family tree into a personal DNA webpage. If you are familiar with .GEDCOM files this will need no introduction but for the benefit of those unfamiliar with the file format I hope these brief comments will help.

There are numerous propriety software packages on the market for recording family tree information and since, for once, the online community has agreed on a common format for exporting files, it does not matter which is used.  For PC a commonly used program is ‘Family Tree Maker’ and for MAC ‘Reunion’ is often found (though Family Tree Maker for MAC also exists). In the main, all programs do the same thing so it really comes down to personal preference in the way data is entered and displayed as to which one to choose. The two I have mentioned are simply those that I use myself and should not be viewed as a recommendation.

The important thing about a GEDCOM file when used with Family Finder is that it creates a list of surnames which can easily be searched to see if any surname is the same in two, or more, ‘genetic cousins’.

Also new for this year’s update is the mitochondrial grouping for those who have done this test in the MacGregor project. It is very unlikely that exact matches will be revealed by this grouping (unlike male Y chromosome DNA) because MtDNA does not go with surname and it is quite unlikely that two people in what is essentially a male surname-based project  would happen to have the same mother’s mother’s mother etc, although if a close match on the full genome sequence for MtDNA were found, it might suggest a common emigrant ancestor.

The grouping of MtDNA results can be found at: 

As for previous updates I am conscious that it isn’t possible to cover all participants in the project in detail, especially now that there are over 1100 folk taking part. I am happy to make up some comparative charts for project members, much as will be presented below in relation to the Stirling connection. I only request that if you wish me to run a chart that you say the kit numbers or group with which you wish to be compared. I can only run these comparisons for the Y chromosome test – and it does not give useful results if less that 37 markers are compared: in other words, the number of individuals who match within one or two mutations on 12 or 25 markers is so great as to make comparisons meaningless.


The other limitation to note on these generated charts is this: a computer program will only do comparisons on statistical or numerical result similarities, and since DNA mutates randomly close family connections (that is, over the last 500 years) can get obscured by random mutations – some families’ DNA mutates faster than others. Various factors have been cited for increased mutation, such as age of father, place in the sequence of births for a couple, environment, diet, and radiation. The value of such comparisons therefore lies in the clues that it affords to possible connections: when paper genealogy fails then such clues can be very important!

Case Studies 1
The Stirling MacGregors

I was asked to compare the 111 marker results for kit number 13201 – representing the branch of the main MacGregor line who adopted the alias Stirling sometime during the 17thcentury. The story goes that a MacGregor (some say of the Glenstrae family, and some say his forename was Robert, or William or Duncan) was employed in the house of the Stirlings of Keir near Dunblane when the soldiers came looking for the MacGregor. They were met by the lady of the house who declared “there is no-one named MacGregor here” and from that time the MacGregor and his family used the surname Stirling.

While the family tradition is that the Stirlings originated in Glenstrae the evidence shows the first use of the Stirling aliasis a “Willeam McGregour VcCoueill callit to ane toname (alias) Stirling” in the 1611a list [prepared for the Laird of Luss]. This person seems to be the William spoken of in the tradition above regarding the establishment of the Stirling alias. This makes him a William McGregor son of Donald.  The 1611a list (#81) lists him as being of the House of Gregor McAne and hence of the Brackley family. Searching the data would place him in the following tree.

Gregour McAne (b~1520 alive 1595).
   |- Eion Dubh (b~1550 executed 1612). #43 in 1611a list (In Glenurchy)
   |    |- Patrick (b~1570)
   |    |- Alasdair cass (b~1571 executed 1613) #44 in 1611a list (Eion’s son)
   |    |- ? #48 in 1611a list (Eion’s son)
   |    ‘- Donmall (b~1572) (#45 in 1611a list (Eion’s son).
   |        ‘- William (b~1590) #81 in 1611 list.
   ‘- Niall (5 sons)
This family came from the lands in Glenurchy and may have claimed “FROM GLENSTRAE” as distinct from “OF GLENSTRAE”.  [previous 15 Line comment received from Neil McGregor after seeing a draft of the blog].

In order to see whether there were some possible connections with other participants who had also done the 111 Y chromosome marker test I abstracted all the 111 results in the MacGregor Ian Cam subgroup [supposed founder of the clan who died 1390 in Glenorchy/ Glenstrae].  What was known to this point was that every individual who had the surname Stirling and had the genetic profile of a MacGregor had a specific mutation from 17 to 16 at marker 32 – a mutation not borne by any other members of the Ian Cam group. This suggested that this specific mutation had occurred by at least 1700.

Running the marker results through the comparison tool created by Dean McGee (this can be found at http://www.mymcgee.com/tools/yutility111.html) gives a possible “Time to Most Recent Common Ancestor’ result for comparing any two individuals with each other. This is only a rough guide to relative distance between two results and since we expect that the time back to the originator of the clan is probably 600-650 years clearly any figures which exceed this number of years in the grid would be too far back if we accept decent from Ian Cam [or supposed father Gregor]. However, the program is assessing relative distance between individuals on the basis of numbers of mutations, and in that context it gives a helpful indication of possible closeness of different branches of the family.

In the chart the kit number for Stirling is 13621 (click image to expand).

Figure 1: TMRCA for members of the Ian Cam MacGregor group
So, according to this grid, the possible time distance between 13621 and, for example, 2124 – the line of the clan chief – is found by reading along the grid line of 13621 horizontally until it meets the line coming down vertically from 2124. This gives an estimate of 460 years and a split point of c1500 AD. There has been a suggestion that the Stirlings are were originally Glenstrae MacGregors – this estimate would not rule out that being possible. Less likely would be any connections closer than 260 years because we know that there has been a Y chromosome mutation in the Stirling family that must have taken place  before c1700. Also less likely are those time distances which exceed 900 years  separating them from other participants – simply because surnames as such did not exist prior to about 1300 AD.  This suggests that SNP testing (on individual results which show a larger time interval from other results) would be worth pursuing to see whether or not the individual concerned has the MacGregor SNPs S690 or S697. As more SNP results appear it may be that we will have to reconsider the formation of clan name groups as predating the adoption of an identifying surname. That will be a while coming however. [If you wish to see current results google Alex Williamson’s Big Y tree].

It is important to note that not all the individuals in the above grid are called MacGregor. When the Ian Cam group was created individuals were added to the group on the basis of what their STR numerical results suggested (that is, they looked like MacGregor). In the above grid it is therefore significant that those which show the furthest time distance are 381858 Murray, 292892 and 350316 Stewart, 121048 McPherson, 120820, 189492 and 258767 McFarland and 237186 Hunt. Only two of the ‘further out’ results are MacGregor which suggests that even if the Y chromosome  STR results look similar, SNP testing will be necessary to find out if individuals belong to the main surname group or whether they split off earlier. So far the McFarlands have tested S690- but 292892 Stewart is S690+.

If we look at the chart which the above table generates we see that there are some interesting family groupings suggested:

Fig 2: 111 marker Ian Cam group chart


In particular, on the right hand side of the chart we see that many of the kit numbers mentioned in the last paragraph are all grouped together in very close proximity which seems to imply some closer genealogical connection. It would be interesting to know what the terminal SNP for each of these individuals is.

Most of the other branches appear to represent separate families though as I mentioned earlier it’s hard to say anything absolutely since DNA can mutate randomly even within the same branches of a family.  In the chart above it seems like Stirling kit 13621 is more closely related to McGrigor kit 256584. If this truly is a family connection then it must be before 1700 because kit 256584 does not carry the distinctive Stirling mutation at position 32.

It is useful to compare the locations of the Stirling results in the 111 analysis in Fig 2 above with those members mentioned in the following case study which makes a similar claim to Glenstrae origin but using DNA and documentation to make the case.

Case Studies  2
A surviving MacGregor of Glenstrae family?

The second case study is extracted from a longer version which will be published in the Clan Gregor Society Spring 2016 Newsletter, and will subsequently be available to download from this blog. The material has been put together and analysed by Neil McGregor and Matt McGregor in Australia using 111 marker results from Family Tree DNA, Big Y results from Family Tree DNA, and MacGregor results from ScotlandsDNA. [the following text is an abbreviated version of their article].

Analysis of the y-DNA is associated with the known descendants of John McGregor (John of Monzie) who married Ann Stobie on 14 June 1826 in Monzie Parish, Perthshire, Scotland. As the y-DNA only traces the male line we obtained data from three male lines descended from John of Monzie. These sons were Charles John (b 1836 at Glassworks, Alloa), John(b 1838 at Balmain, NSW) and James (b 1840 at Braidwood, NSW) (See figure 1 and tables 1-2 for the lines of the DNA samples and the results of the analysis). The data in tables 1 and 2 show that the family descend from the Clan Gregor hierarchical family and that the DNA from current descendants of John’s line (NRM & AAM) differs from the current descendant of Charles (GCM) line by 2 mutations (one in each from the common ancestor) and James’s line differs by three mutations from the other two. Both John and Charles lines differ from the Clan line by five mutations (389a and 389b appear to be a single mutation not two) and James by six mutations. The Y-DNA pattern of John of Monzie (b1790), the common ancestor, can be predicted and would have differed from the Clan line by four mutations - his Y-DNA data is given in Table 1. The actual dates from the birth of John of Monzie to all the tested subjects is between 150 and 175 years, being 4-5 generations. The calculations are based upon the McDonald mutations rates and using average generation time of 30 years results in a reasonably accurate prediction. James’s data indicates that the family may have a higher mutation rate than is normal but recent research has indicated this may simply be the result of where the mutations occur on the Y-DNA and not an actual increase in mutation rate.

Lines of the Y-DNA samples from John McGrigor (Monzie b 1790).

John McGrigor (Monzie born 1790)
   |- Charles John (1836) (Alloa Scotland) 3rd generation Graeme Chisholm MacGregor
   |- William (No samples acquired from this line)
   |- John (1838 - Balmain NSW) 3rd and 4th generations Neil Roland and Alexander Andrew James
   ‘- James (1840) (Braidwood NSW) 4thgeneration  James Hardy

From these data we can estimate the time to the Clan common ancestor (Table 2) to be ~1350 using the McDonald mutation rate for the normal population. This figure is different if we calculate the mutation rates based upon the higher mutation rate seen within this family’s actual data, the date to common ancestor being between 1380 and 1618 with a median of 1480.

 Table 1. Summary of the Y-DNA 67 marker test for the Clan and the two subjects and that of John (1790).


Table 2. Time to common ancestor (Years) for John 1790 and his three descendants.


Using the Y chromosome mutation rates proposed by Doug McDonald the separation from the Clan modal occurred ~1380, however, as mentioned above, using the individual mutation rates determined for this family’s actual data gives later values. The mutations rates of NRM and GCM give the separation date as ~1480 and for GH ~1618. This date related to GH is far too short to the present and the mutation rate therefore too high to be meaningful over all the generations so only the data based upon NRM and GCM will be used.

Examination of the Clan documented history only places three Clan Gregor family lines in the Monzie/Crieff parishes in the 1600s and they are: Glenstrae, Glenlednock, Roro and a group who used the alias McAra, whom the documents show are actually of the Roro line. These Clan Gregor families have their common ancestors at various dates: Glenlednock have been proposed to have separated from the Glenstrae line in ~1400 and the Roro line prior to that date. However these dates are only speculative. Thus, the DNA evidence supports the possibility that the Bega family [family of NRM who first settled in Bega, Australia] could be derived from any of these three families. We have approximately 18-20 generations between the above tested individuals and Gregor [name father of the clan] born ~1300, which equates to 1 mutation every 3 to 4 generations. Whilst this mutation rate is higher than normal, it actually is found in the Familytreedna.

Firstly, it is noted that the Bega [NRM] family has a unique mutation in the Clan dataset at DYS 389a and DYS389b of 14/31. No other tested subjects have this marker so we cannot be matched using that data. It is likely that this mutation occurred after 1500 and before 1790 and therefore would only be seen in recently related individuals. Analysis of the Clan Gregor Y-DNA database shows several distinct clusters with few mutations and several high mutation clusters. Analysis of these latter groups shows potential correlation between the mutations and certain geographic locations. Two of these appear to be sequential, and they are mutations at DYS576 at 16 and 17 when compared with the general Clan data at 18. Those at 16 are the Stirlings who lived in Dunblane and those at 17 from people who claim to be from the Glengyle family. Both these groups could have descended from the Glengyle although other data suggest that the Stirlings are from the house of Gregor McAne of Brackley and have lived in Glenurchy on the Glenstrae lands. It is very unlikely that the Bega [NRM] family are descended from the families of Roro, Glengyle, Brackley / Ladasach, or McRob. This strongly suggests that the Bega family are descendants of the one remaining family: the Glenstrae line.

Assessment of other genetic matches.

The Bega [NRM]family has a mutation of 9 at 459b. This is shared by 5 other participants in the Clan Gregor database.

Table 3 Analysis of subjects within Clan Gregor who have 9 at 459b.


This seems to be an early mutation in one branch of the family. These kit numbers were assessed using comparative Y-DNA analysis, and table 3 shows the mutations within the group. Table 4 shows the dates to common ancestor for the group and this suggests that the closest common ancestor was with kit number 191228. The date of common ancestor being ~390 years ago or ~1560 using the same mutation rates as for Neil [NRM] and his cousin Graham in the analysis. The oldest known ancestor for kit 191228 was Alexander McGrigor born in 1790, living in Lanarkshire.

Table 4. Time to common ancestor for the members of the analysis.

Kit 94903 has their oldest known ancestor in Lanarkshire in 1790. Kit 119330 has their oldest known ancestor in Buchanan in Stirlingshire, and 121048 has their oldest known ancestor as one James McPherson from Abernethy/Kincardine and that they lived on the lands of the Grants. Kit 120679 did not report a location or ancestor.

The important factor about these different family locations is that they are very widespread across the Highlands and part of the lowlands. Whilst the dates and locations may be related to movement during the “clearances” or post Jacobite wars (1745/46), as appears to be the case with kit 191228, the others cannot be attributed to these events. This suggests that the 9-9 mutation group have an early mutation from a family that is now well scattered – most likely due to the early habitation of the occupation sites, movements during the proscription or some other upheaval. If the mutations occurred early in the family then the low numbers in the database could suggest that many of the 9-9 kindred may have been killed during the proscription hence their low numbers within the Clan database. The known aliases used by some within the 9-9 kindred are Grahamfor kit 191228, McPherson for kit 121048 and Murray which was used by the Bega (NRM kit 16198) family.

McGrigor in Lanarkshire.

Matt McGregor (Kit 191228) respectively is the closest non immediate related DNA sample to the (McGregor Bega Kits) [NRM] and also to McPherson (Kit 121048) suggesting that all three have a high probability of being from the same family and from the discussion above, therefore most likely Glenstrae. Importantly the DNA data suggests that the common ancestor is likely to be Alaster Ruadh MacGregor. (The DNA suggests ‘time to common ancestor’ is 390 - 450 years respectively for all 3 Kits). This suggests that the individuals are likely to have arisen from different sons of that Chief (Alaster Ruadh 1524-1547).

[detail omitted which will be in the full report when published]

Of all the possibilities, when considering the actual family history, traditions and marriage connections, etc., the data at hand suggests Kit 191228 is most likely a descendant of Ewin the Tutor and probably from Kilmannan himself through the son Hugh. A more detailed report on this will be submitted later, in conjunction with the history of Kilmannan and Rob Roy that to date seems not yet to have been posted. In support of this contention Kit 191228 has 4 separate Autosomal DNA matches with known descendants of Kilmannan’s via his female line.

As Kit 191228 McGrigor and Kit 121048 McPherson both match the Bega McGregor [NRM] kits, with the DNA data being supported by the documentary evidence, the conclusion is that the Bega [NRM] McGregors are also Alaster Gald - Glenstrae descendants.

Neil McGregor has summarized this and other data analysed in a revision of the chart published in last year’s blog.

Fig 3: Proposed Clan Gregor descendancy chart as at end 2015



Just to repeat the invitation to request comparisons with other participants stated above: please state the group or individuals with whom you wish to be compared and I will help you interpret the results. Please note that it is only feasible to compare like with like (i.e. 67 markers with 67, 37 with 37). As usual my email address is richardmcgregor1ATyahoo.co.uk (substitute @ for AT).

Richard McGregor January 2016

Can Blog followers please note this message from Google:

In 2011, we announced the retirement of Google Friend Connect for all non-Blogger sites. We made an exception for Blogger to give readers an easy way to follow blogs using a variety of accounts. Yet over time, we’ve seen that most people sign into Friend Connect with a Google Account. So, in an effort to streamline, in the next few weeks we’ll be making some changes that will eventually require readers to have a Google Account to sign into Friend Connect and follow blogs.

As part of this plan, starting the week of January 11, we’ll remove the ability for people with Twitter, Yahoo, Orkut or other OpenId providers to sign in to Google Friend Connect and follow blogs. At the same time, we’ll remove non-Google Account profiles so you may see a decrease in your blog follower count.

We encourage you to tell affected readers (perhaps via a blog post), that if they use a non-Google Account to follow your blog, they need to sign up for a Google Account, and re-follow your blog. With a Google Account, they’ll get blogs added to their Reading List, making it easier for them to see the latest posts and activity of the blogs they follow.

We know how important followers are to all bloggers, but we believe this change will improve the experience for both you and your readers.

Posted by Michael Goddard, Software Engineer

Article 0

$
0
0
MacGregor DNA Project January 2017 update

I used the 2015 update of this blog to explore the various tests available. Since then there has been a noticeable increase in the number of individuals taking the test known as ‘Family Finder’, or something similar, rather than Y chromosome or mtDNA tests. This has probably been as a result of quite aggressive marketing by Ancestry.com [it has a variety of website endings depending on where it is based] in particular. This has promoted the equal use of DNA testing for both males and females and tied it into the submission of family trees which individual testers can use to identify the same family name(s) with others who have tested and submitted their genealogies. What has perhaps been rather glossed over in this is the fact, firstly, that DNA gets ‘lost’ over time – if it didn’t we would have the DNA of billions of ancestors in our bodies, and, secondly, that it is only a tiny portion of our DNA which is currently being examined for genealogical purposes. In relation to the latter, if you took all the DNA out of your approximately 100 trillion cells and stretched it out in a long line, with DNA in one cell being about 2 metres long, it would reach to the moon and back about 8000 times. By comparison, if you did a similar process to your veins, arteries and capillaries they would measure about 100,000 kilometres (or 62,000+ miles) or roughly twice the height, or thereabouts, that aircraft fly above the earth.
     You do not inherit 25% of your autosomal ancestry from each of your 4 grandparents. This is because your autosomal DNA is randomly recombined, and not in equal proportions from each parent, and so the more you go back in time the percentage inherited from people in a particular generation becomes smaller and smaller and therefore the more distant the ancestor is the more difficult it becomes to identify what you received from that person. What then are the chances of that same bit of DNA being preserved from a specific ancestor in yourself and someone else? For example, if your name is, say, Smith, and your male MacGregor ancestor lived 10 generations ago on your mother’s side it is really not feasible with today’s technology to identify that DNA by looking at your DNA today. The tests which are offered by Ancestry, Family Tree DNA etc. only try to identify links to 5/6 generations back. The key thing to remember is that if you and someone else have, say, people called Brown in your trees it does not necessarily mean that you have a recent ancestor in common, or indeed that you have any ancestor called Brown in common at all. For these tests of connection to work properly, you, and the person you are comparing with, need to have as much genealogical information as possible on every ancestral line in your respective trees, going back 5 or 6 generations (as well as a significant shared portion of DNA). This information can be displayed on a fan chart such as this one, which can be viewed at:
Fig 1. fan chart example - Perry

You and the person you are comparing with both have to put all your family information into one of the computer genealogy programs (like Family Tree Maker, Reunion etc.), save it as a GEDCOM file, and upload it to whichever DNA company you have tested autosomal DNA with. The possible links between the two family trees are then highlighted in some way which allows a comparison of ancestry to be made in order to explore if there is a family match on some surname. The fact that there is a match on surname does not necessarily mean that it is the same family, only that there is a surname in common. Clearly the more unusual the surname, the more likely that the match will be with the same family. In my own case the extensive GEDCOM file that I have made found a name match in another equally detailed GEDCOM tree which linked to a common ancestor born towards the end of the 18th century in the Volga German colonies in Russia, although in fact this was just confirmation of a previously suspected connection found by traditional genealogical research (I’ll come back to this later). Clearly autosomal testing COULD be very valuable, despite these caveats, for females who cannot take advantage of the Y chromosome test which only men can do since it relates directly to father’s surname (unless of course a father, brother, or cousin who has the name of interest will do the test on the female’s behalf).
     There is another benefit to autosomal testing in that portions of a person’s DNA can be compared with typical results from other ethnic groups. So, for example, a person might find through this utility (sometimes called My Origins or similar) that they have, for example, Mediterranean, African, or Native American ancestry as part of their genetic makeup. This won’t be a link to a specific person (unless of course this genealogy is already known or at least suspected) but is often of interest when an individual wants to know the answer to the question ‘where did I come from?’. This is explored a bit more later in this blog. What we invariably find is that our most ancient ancestry is incredibly mixed and that our own personal ethnic makeup links back to many different regions and races. The Economist discussing the work of scientist and geneticist Luigi Luca Cavalii-Sforza remarked that his work "challenges the assumption that there are significant genetic differences between human races, and indeed, the idea that 'race' has any useful biological meaning at all".

SNPs
     I want to spend most the rest of this update considering the current and possible future significance of SNP testing. A SNP, or single nucleotide polymorphism, is a DNA ‘event’ which functions a bit like a marker in time. At the moment, I am here talking about SNPs which occur in the male line (and are often relatable to surnames). As far as is known, once an SNP occurs it does not re-mutate backwards. Because of this fact, if we can identify specific SNPs, as a first step, and if we can then date them, even approximately, we find ourselves on the way to constructing family trees, not just in prehistory but, potentially, for SNP events which took place within historic time. More than a decade ago Ken Nordtvedt and Dr John McEwan worked on ways of grouping STR results [that is, results from the standard male Y chromosome test] to show how some numerical patterns were constant within defined groups. This was limited at the time by the fact that only 37 markers were available in STR tests (whereas now one can test 67, 111 or even more). McEwan identified 49 groupings, and the one he called R1bSTR-47 came to be known as ‘Scots’.  This was the DNA profile based on 37 markers that was identified as ‘Scots’ (Fig. 2):



 Fig 2.  The ‘Scots Modal’ and ‘MacGregor Modal’ compared
The lower figure was the modal figure for the MacGregor group who claimed descent from Ian Cam (who is in the record of obituaries written down by Sir James MacGregor Dean of Lismore as having died in 1390). What was exciting at the time was the realisation that with only three mutations different, the MacGregors seemed to be a group who had mutated a little way, but not far, from the Scots group. Since then some commentators and researchers, particularly Alistair Moffat and Jim Wilson in their book The Scots: A Genetic Journeyhave suggested that the MacGregors were actually Picts. Without wanting to go into the arguments for or against this interpretation in any depth I did want to ask: if these were the Picts then who are the Scots [this is an inversion of the way the question used to be asked]? If you look at the various DNA discussion boards you will see that many people disagree with their interpretation. It is nonetheless important to repeat it here because the definition of Pict was based, by Jim Wilson, on his interpretation of the SNP S530 which he found and named when he was attached to ScotlandsDNA [the company now has various versions of the same name such as IrelandsDNA, BritainsDNA and so on]. 
     Following on from this, S530 was found to be equivalent to L1335 [the confirmation came in 2012] and the search was on to find what SNPs were more recent in time than L1335/S530.  Four years ago not enough was known about SNPs to attempt a time estimate as to when they occurred, but since then the group known as YFull have given broad estimates, from their research, of possible dates for each SNP. At this point I am concentrating on just the MacGregor male line but later in this blog I will make reference to another line to demonstrate how SNP testing is currently revolutionising our understanding of genetic/clan genealogy going back into the past beyond the time of written genealogies. What we still lack are more SNPs coming forward into the time of written records although there are some SNPs which are beginning to divide up families into smaller subgroups with the same surname, particularly when associated with STR results.
     At this point I repeat a chart which I first included in the 2015 blog. The chart was derived by Jim Wilson from work which he did in relation to clan DNA origins through the ScotlandsDNA company (Fig. 3).


Fig. 3: The SNP tree of the Scottish clans as at 2014

This helpful map from Wikipedia shows how the clans relate to each other geographically (Fig. 4):



Fig 4.  Scottish_clan_map Wikipedia Commons.png

Taking each of the SNPs from S530/L1335 onwards, YFull have given the following time estimates:

L1335 (also known as [aka]S530) formed 4300ybp (years before present), and time to most recent common ancestor 3600ybp
L1065 (aka CTS11722 or S749) formed 3600ybp, time to most recent common ancestor 1750ybp
S744 formed 1750ybp, time to most recent common ancestor 1750ybp
S691 formed 1750ybp, time to most recent common ancestor 1700ybp
S695 formed 1700ybp, time to most recent common ancestor 1550ybp [or, c.320 AD to c. 470 AD]

It not clear from the above how YFull distinguish the time frames of the later SNPs as they seem to be suggesting that they all arise around the same time, although, other commentators on DNA have speculated that there are approximately 90 years on average between SNP DNA mutations – but –  that assumes a degree of regularity in the occurrence of mutations, which we know is not the case.
     If we work in reverse from S690 [see Fig, 2 above] which is the defining mutation of the MacGregor bloodline, and which, we think, could have arisen as late as 1360 AD and perhaps as early as 1200 AD, then, using the standard SNP sequence going backwards in time, S695 might have occurred as late as 1270 AD and as early as 1110 AD.  Then S691 above S695 as late as 1180 AD or as early as 1020 AD. Ian Cam MacGregor descendants will notice that there is no mention of S697 here. We simply don’t know enough about this SNP or FGC17830 which seem to be typical of the MacGregor main line to say anything definite about them at this point. Indeed, we know little about S701, FGC17831, S703, FGC17832, S27834, BY144, FGC17829, S27835, and BY143 which are currently found as positive ONLY in the SNP results of MacGregor bloodline and in two Buchanan men. (By the way, it is very interesting that almost all other Buchanan men descend from the next level up on the ancestral tree as shown by the results collected and displayed on the excellent site run by Alex Williamson). The section for descendants of SNP L1335 can be found on Williamson’s site at: 

Fig 5. MacGregor/Buchanan section of Alex Williamson’s The Big Tree

If YFull are right in their estimates then it is quite possible that these SNPs [from S701 onwards] fill the gap between c. 500 AD and c. 1300 AD, implying that MacGregors remained a homogenous group from 500 AD onwards BUT this seems highly unlikely, to say the least. Looking at Williamson’s trees it is quite clear that there are many SNPs about which we need more information. Williamson shows the following as the sequence of SNPs from L1335 (and this same sequence is shown as part of the Scots Modal Panel which can be ordered from YSEQ.com):

Top of Form
L1335/S530 > L1065 > Z16325 > S744 > S691 > S695 > S701 > S690



Fig 6: Section of the L691 descendant SNPs from YSEQ

This image, which is taken from the YSEQ website shows just how complex the clans family tree is becoming, and changing as we learn more and more about SNPs. It is clear that the relationships which exist between Scottish clan groups will eventually be refined in such a way that the traditional stories of clan origins will either be confirmed or refuted. So, even though now we cannot say with absolute confidence what the exact sequence of SNPs leading from L1335 to S690 is and cannot assign secure dates for their first appearance it will only be a matter of time before the confidence levels on all this become much clearer. It is remarkable how much has been achieved in just four years.
     For comparison, I attach another small section from Alex Williamson’s Big Tree – in this case for the McLeans (Fig. 5). We can see in this that the family groups are beginning to divide up based on individual SNPs separating family groups. The McLeans, like the MacGregors, have a small number of individuals who are quite separate from the mainline, in having the SNP M222 which often has Irish connections, or other SNPs which seem to indicate adoption of the name by different families during the time of change from patronymics to surnames.


Fig 7: McLean section from Williamson The Big Tree

What are patronymics?
     The use of patronymics versus surnames is not always well understood, so a brief discussion here might be helpful. Years before surnames became common in Scotland an individual might be referred to by his geographical location – so in Clan Gregor we have John of Glenorchy who flourished in the 13thcentury. It has always been believed that this John was one of the first MacGregors, but without a surname we have no real way of knowing if he was indeed a MacGregor – there’s always the possibility that he was an early Campbell. After the middle of the 14th century, and in Britain following the devastation wreaked by the Black Death, not only did individuals move around rather more than before but individuals were less tied by servitude to landowners. In Scotland individuals who managed to acquire some land-holding capability began to use surnames to identify family groupings, whereas, the common people around them would continue to be identified by their family relationships in the formula ‘the son of and grandson of’ (or ‘daughter of’ in the case of females) another male forename. Thus, whereas the most important MacGregor might be known as Patrick MacGregor of Glenstrae, the under tenants, who might, or might not, be distantly related to him, would be known as, for example, Patrick McEwan VicConachie [son of Ewan, grandson of Duncan]. As late as the middle of the 18th century the rental documents of the Menzies estates were still referring to males by their patronymics, but, by the end of the century all tenants had acquired and been identified by fixed surnames. These families might indeed be genetically MacGregors but they might equally be genetically Menzies or Drummond, or they might never have been genetically linked to one of the main families in that area at all. It is because of this variety of means by which names were acquired that clans include individuals who have a variety of genetic origins.

DNA testing companies and their products.
Y chromosome tests:
     It can be very difficult for people new to DNA testing to work out what the best test is for them. The answer depends on what question a person wants answered. If a male wants to know about his surname and its origins, then the Y chromosome test is in practice the only option. Very little is gained however by simply doing 12 or 25 markers because comparisons with other people with the same surname are only effective when comparing 37 or more markers. Many of the programs which help to make comparisons with another individual’s marker scores work best with 67 or 111 markers, and some do not work with less than 67 markers. The advantage to participants of an FtDNA (Family Tree DNA) Y chromosome project is that the company has the largest publicly available database of Y chromosome results for comparison. Surname projects allow direct comparisons with others, but it is possible to keep results private and not visible to the general public, although there is no way to identify an individual testee based on kit number alone.
     One of the results of Y chromosome testing is that an initial assignment of the results to a haplogroup is made. For the majority of Western Europeans this will be R1b-M269 while most others will be I-M253 [originally I1], or I-M223 [originally I2] (both associated with Scandinavia, and originally from the Balkans), and R-M512 (or R1a), whose origins lie in the Steppes. What the numbers indicate is an SNP that defines a specific group (for example, R1b-M269 is Western Atlantic origin). Some companies, however, such as 23andme, still classify individuals by alphabetic lettering. Thus, in my own case 23andme labels my paternal line as R1b1b2a1a2f* ‘a subgroup of R1b1b2’ which is a ‘subgroup of R1b1’. R1b1b2 then described in the accompanying explanation as:

·       Age: 17,000 years
·       Region: Europe
·       Example Populations: Irish, Basques, British, French
·       Highlight: R1b1b2 is the most common haplogroup in western Europe, with distinct branches in specific regions.

Hopefully the earlier part of this blog has shown why this doesn’t say much about recent genetic connections.
     For I1 [I-M253], for example, the 23andme include generalised results for some famous individuals, including Leo Tolstoy and Warren Buffett. It’s unfortunate that some newspapers then choose to interpret such results as ‘meaning’ that a named person is ‘related’ to Warren Buffett (yes they are, but probably cousins at 10,000-20,000 years distance), or, even worse, that a named person is a ‘descendant’ of the historical figure (and that is extremely unlikely unless it happens to be Genghis Khan!!). In this way completely false stories about DNA relationships spread.
    The origins and spread of these haplogroups are shown on the attached map found in several forms on the internet:


Fig 8: Origin and Spread of haplogroups R1a, R1b and I

What about Big Y or Full Genome testing?
Big Y
FtDNA advertise their Big Y as follows:
“Nearly 25,000 known SNPs, placing you deep on the haplotree.
10 Million base-pair coverage - more than any other Y-DNA test on the market.
Find SNPs that may be completely unique to you.
Explore your deep paternal ancestry
Help the community uncover new, undiscovered SNPs.
Use your newly discovered SNPs to help grow the haplotree”.

Whereas FGC (Full Genome Corporation) offer:
The “GenomeGuide, a whole genome test for ancestry purposes, and Y Elite 2.1 a comprehensive test” of a person’s “Y chromosome. Y Elite 2.1 determines those markers (i.e. SNPs and STRs) that are most useful” for a person’s “paternal ancestry”.

Both these tests aim to locate SNPs on a male Y chromosome and may include SNPs classified as ‘private’, meaning that at this point in time they have only been found in a single or very limited number of individuals, and their exact significance to the more general tree or to an individual’s personal family tree has yet to be confirmed.
    It will be clear from the above product descriptions that FGC’s offer is more comprehensive (and they have other versions which probe the Y chromosome even more thoroughly, but cost considerably more). The essential problem lies in identifying which test, if any, gives the most useful information. Some project administrators make suggestions as to which more comprehensive test to take, or, they highlight specific SNPs that an individual might choose, but, these usually build on previous testing rather than being aimed at people starting to look at SNP testing for the first time. A good starting point is to observe what SNPs others in a group have already tested (FtDNA show these as ‘confirmed SNPs’ in green). Individuals who don’t know here to start with SNP testing do need to look for help from a project administrator regarding which SNP(s) to choose. If we take M269 (for group R1b) for example, in many projects in FtDNA this will show in red, meaning the SNP is predicted but unconfirmed. Normally the prediction is correct. If starting from this point probably the best thing to do, short of going straight to one of the two big tests mentioned above, is to have SNP L21 tested for positive or negative. If a person is L21 positive and doesn’t want to go down the line of Big Y or FGC testing then the next step, having looked at any confirmed green entries for SNPs in the sheets of excel data for people lying nearby in the grid, is to go for an L21 SNP Panel either with FtDNA or with YSEQ.com (but using the latter will require a new registration and a new sample, although their pricing is competitive). If SNP testing is done with FtDNA, their results program will usually suggest what the next SNP tests might be. At a certain point in testing it is definitely worth (if only financially) trying an appropriate SNP Panel. For instance, results in the STR Y chromosome tables for a surname project which lean towards L1335 suggest that that would be a good STR Panel to test. Both FtDNA and YSEQ offer L1335 panels as well as individual SNPs (but doing one SNP at a time can get expensive).
     Just to re-emphasise: the advantage of the more comprehensive tests is that ‘private’ SNPs are often identified. Sometimes these are unique to an individual but sometimes they will be found in several individuals and therefore they may well define a discrete family group from within the historic period. However, in order to identify these as belonging to more than one person, other people who seem to be closely related (when looking at the other DNA male line results) need to test for the same ‘private’ SNPs. Many surname groups are working to try to identify these ’private’ SNPs for family groups both to advance genealogical links but also to save participants some money!

Health related issues
Most DNA testing companies do not give reports which include information about health risks. Exceptionally, 23andme have offered health related reports in the past but after difficulties in America with the FDA they suspended these reports, but later reinstated some for the non-American market. These tests do not have a genealogical component and therefore will not be discussed further.

Ethnic mix
As mentioned earlier, FamilytreeDNA, through its MyOrigins report, ScotlandsDNA through Ancestry Painting, Ancestry.com through the AncestryDNA test, and 23andme through Ancestry Composition, all, with some variations in reporting procedures, aim to give an individual a ‘picture’ of his or her ancestral connections with populations around the world. Results naturally vary considerably from almost 100% European to real mixtures of different ancestral backgrounds including American Indian, Far Eastern, African and so on. Ancestry for example says that their DNA test ‘looks at a person's entire genome at over 700,000 locations’ and covers ‘26 ethnic regions’. Ancestry.com claim to have‘more than 2 million people’ in their database and ‘the unique ability to connect with Ancestry’s billions of historical records and millions of family trees’. For further information on these tests and how they report see my 2015 blog on this site.

Discovering distant relatives’
The reference to Ancestry.com in the above paragraph was deliberate. On the one hand the ability to contact other members part of whose DNA is the same as one’s own is clearly attractive. This is exactly what I was referring to in the opening paragraphs of this blog. The difficulty is that Ancestry does not remind you to check that the information you receive from others is actually accurate. Many a false genealogical connection has been made through eagerness to get back as far as possible. What many people do not realise is that the written records on which genealogies are constructed can be missing for some areas of the world. Even in Scotland the records for the counties in the very north are missing for many localities before 1800 and almost universally before 1750. Wars and carelessness, as well as the wide dispersal of the populations in remote locations meant that children might well not ever be baptised, or if they were, it was done whenever the minister happened to be in the locality. However, it was the parish clerk’s job to keep the records, not the minister’s, and the parish clerk might be tending his cattle 20 or more miles away.  The same cautionary statement holds true of FtDNA’s Family Finder in that what appears in an imported GEDCOM file only represents the family researcher’s work and, as with all internet genealogy, needs to be checked for accuracy.



Fig 9: Screen grab from Family Finder proposed matches

In this screen grab from Family Finder, for the sake of privacy and data protection, I have removed the picture details of matches including the email of the individual whose family includes an individual related to my own family. The match is Charlotta Major but she is not an ancestor in my line, but her father Konrad born in 1797 is. This then is not an MtDNA link(and in any case the person who is my match has not tested this, nor, being female, could she test the Y chromosome), it is an autosomal link with a male line which is my mother’s great grandfather. I have, however, been unable to identify any links with the other individuals listed as matches.

Which company then?
As I said earlier the choice of company depends entirely on what question or questions you want answered:


Fig. 10: DNA testing company list

Bottom of Form

-->
I have not drawn out trees based on STR Y chromosome results this year [that is, those that appear in a chart for people in, for example, an R1b group as having a number sequence like 13, 24, 14, 10, 11,14 etc.] since these results are too diverse and complex when making a comparison between surname groups in the project which now has over 1200 participants, and sometimes even too variable within a surname project name subgroup [as for example Greer, Grier, Grierson in the MacGregor Project]. In short, there are now too many people in the project to do comparison charts that would have any real meaning. Also, the amount of detail would be far too great to permit any links to be seen. Because of this I repeat here my usual offer in relation to those who have tested their Y chromosome through STR tests. If you wish me to run a comparison with other participants, then please state the group or individuals with whom you wish to be compared and I will make a personalised graph for you and help you interpret the results. Please note though that it is only feasible to compare like with like (i.e. 67 markers with 67, 37 with 37). As usual my email address is richardmcgregor1ATyahoo.co.uk (substitute @ for AT). Please contact me offline also for advice on SNP test choices. Could members of the Ian Cam MacGregor group [the bloodline group] please note that the terminal SNP for the group is currently S690 and we do not yet have any ‘private’ SNPs to recommend, other than S696 and S698 which seem to be carried only by the Glencarnock line, and may have arisen in the last 250-300 years. Apart from the two known carriers of these SNPs other members of the Ian Cam group who have tested these SNPs have found them to be negative.

Article 0

$
0
0
MacGregor DNA Project  blog update 2018

This year’s blog will be slightly different from those done previously in that I want to spend a portion of it explaining how genetic genealogy helps us understand intra-clan relationships but in order to continue I need to say exactly what a Scottish clan is, as there are some particular misconceptions concerning the exact nature of clan in the 21stcentury.  At the end, I go on to discuss how SNPs are helping to build up a better picture of family groupings within the main line MacGregor family tree after the mid 14th century.

What is a clan in Scottish usage?
Six hundred years ago this question was quite simple to answer. You were associated with a clan if you had been born with the name – in the MacGregors’ case that might be expressed as Gregor, Grigor, MacGregor, McGrigor, McGregor and a whole range of alternative spellings such as, for example, McGreagor (an attempt to render the Gaelic phonetically into English?). At that time too spelling had not been standardised - so one might find Mckgregor, M’gregor and so on. You were also a member of the clan if your name was an accepted variant, such as Grierson, or Grier, Greig/Grieg/Grig etc. These were considered to be shortened or anglised versions of the main clan name. So, Grier-son equals Gregor-son and Grier is the same name without the ‘son’ on the end.
Whether or not these accepted names were genetically related to the main line was not the point, as a clan was a collection of related surnames. Members of the clan recognised as Chief the head of the main line (the Chief of the MacGregors for example), and often, especially in the early days relied on him for protection, or rather, on his ability to pull a ‘federation’ of individuals together to ensure, usually armed, protection, or, as a means of seeking retribution on another group for some offence.
There were others associated with individual clans. People who belonged to individual septs. Sometimes the same name would appear in two or more lists of accepted septs of different clans – such is the case with surname King, among others. Again, some descriptive words used as surnames were understood to have been borne by people associated with the clan, and these surnames are found in many clan lists: Bain (or Ban) or its anglicised equivalent White; Roy meaning Red; Dhu or Dow meaning dark or black, are some examples.
Finally, there were people who answered none of these ‘qualification’ but who lived on the land which was under the Chief’s influence as ‘part-takers’. Grant and Menzies rental documents of the 18th century reveal instances where individuals adopted the name of the local chief where formerly they were called only by their patronymics. A patronymic shows the genealogy of an individual back two generations – so my patronymic would be Richard McEwan VicPeter: there McEwan is not a surname but shows that my father was Ewan (not THAT Ewan). On a rental document you might find John McGregor VicPatrick which means that John’s father was Gregor and his grandfather Patrick: if John were a very poor inhabitant – a cottar – it might be that he had lost the knowledge that he was a genetic MacGregor and so ended up taking the surname Grant. It’s unusual to find this situation among MacGregors because of their turbulent history but it happens in other clans. Paradoxically because the MacGregor name was proscribed [forbidden] for so long [1603-60 and 1693 to 1774] it seems many families held on to the knowledge that they were MacGregors despite having been forced to take other surnames. Some families never changed back to MacGregor when it was finally possible to do so – which is why in the DNA project we see individuals called Drummond, Stirling, Campbell etc who are genetically MacGregors - their ancestors never readopted the name when it was safe to do so.

What is clan in the 21st century?
To some the concept of clan in the present century is an irrelevance. We no longer need the Chief to protect us and we no longer live in defined communities of inter-related individuals offering support to each other in the same way. In the 21st century a clan is made up of people who value the bonds of kinship often promoted by Clan Societies, and they value the, sometimes surprising, connections that clan association brings. At its gathering in 2014 the Clan Gregor Society of Scotlasnd had 11 nationalities represented, some of whom like the Philippine and South African contingents actually shared a recent common ancestor. Members of a clan still recognise the Chief as head of the clan, but, as I have explained, it is likely that relatively few share a common ancestor with him (or her). Most Clan Societies recognise that finding paper evidence of relationships becomes increasingly difficult before 1750 and therefore accept both female line connection (e.g. ‘great grandmother was a MacGregor’, or a King, or a Bain etc), or, that there is a tradition within a family of MacGregor clan connection.

What does genetic genealogy tell us?

1)    That people called by a certain surname are not necessarily descended from a common ancestor, although as we have found in the DNA project approximately 50% within a surname subgroup do share a common ancestor. (I will explain my subgrouping in a moment)
2)    That there are different groups of individuals called, say MacGregor, who share a common ancestor in the fairly recent past but the connection of one group of these MacGregors to each other is likely to lie thousands of years in the past and pre-surnames
3)    That there are multiple origins for surname groups
4)    That clan sept names have varied genetic origins and not just one single origin within the period of the existence of surnames (surnames become more common after the mid 14th century, particularly in England following the Black Death as there was more movement of labour to replace those lost to the plague)

The MacGregor DNA project at this date has 1427 members. In order to make the results easier to navigate I have divided them up into subgroups. The advantage to this is that within a subgroup it is easier to see individuals who are potentially related, in the more recent past, to each other, because they appear in the surname grid near to each other. It also shows up where there are multiple origins for surnames, and is particularly useful where some members of the subgroup are able to indicate an earliest known ancestor. In a good number of cases that ancestor will be shared with other members of the subgroup and some of the group may not have the genealogical information that others have and so find their link to the past. It can also suggest geographical location for the ancestor. The disadvantage is that members of different subgroups in the results grid cannot quickly compare their results with members of other subgroups, but in a Y chromosome study that is less of a problem. In any case everyone has access to a ‘Matches’ tab on their DNA results page and also everyone can use Ysearch and Mitosearch.org.
To further illustrate what can be done I will demonstrate with one subgroup. Regular readers of this blog know that I have offered to make any comparisons between individuals that are wanted. It would be helpful if this could normally be limited to 10-12 results for comparison. Also, please remember that I have to be able to compare like for like – so I cannot compare a 37 result with a 67 – I can only use the first 37 of the latter for the comparison.

Example 1: Greig/Grieg/Gregg etc surname 37 marker result grid

The first example shows all the 37 marker results for this subgroup. No colours are present in this grid to show mutations because we are not trying to compare individuals to one ‘master’ result or even to an ‘average’ or modal result. It can be hard without colours to see how one individual is related to another but in general if results are closer together on the grid they would tend to be more closely related genetically (but just being close on the grid does not necessarilypick up results that are more related than others). That is why we use a graphics program to make comparing results more visual in what is known as a phylogenetic tree. Also the program used picks up similarities across results that might not be immediately obvious to the eye.

Example 2: Greig/Grieg/Gregg etc surname 37 marker phylogenetic tree

The second example therefore shows the chart in graphic form, generated by Splitstree (acknowledgement is given at the bottom of this blog). I have labelled them as the chart is labelled so perhaps the easiest way to make comparisons is to print off both examples and cross-refer between them. In this graphic representation some relationships become more obvious but there also are some surprises. The first, and most important, point to make is that we are seeing at least 11 distinct family groups who each shared a common ancestor many thousands of years ago long before surnames became common. So we see at least 11 different genetic origins for people called Greig, Grigg, Gragg and Gregor. In general, if results are closely clustered together on the graphic then they probably share a recent common ancestor (and by recent I mean since the acquisition of surnames).

1)    There are 7 individuals (kit 20673 is one) who share a common ancestor who could be the William Gregg born in 1616 or his immediate forebears - an early emigrant to the New World. This family have no spelling variations – always Gregg – so may have been literate from the earliest emigrant
2)    There is a group of 3 (kit 214992 is one), who are related and may have a connection to Tipperary in Ireland: these families are Gregg or Gragg
3)    There is another group of 3 (kit 239449 is one) - these connect to a common ancestor but there is no indication of who this might be in the genealogies submitted – these used spellings Grieg, Greig and Gregg
4)    Another group of 4 (kit 158127 is one) who seem to connect to Antrim in Ireland (using Gragg and Gregg)
5)    There is a group of 3 (45360 is one) who connect to Edinburgh and Pathhead (a nearby village)(surnames are Greig and Gregg)
6)    There is a group of individuals who are genetically related and with geographical links to North East Scotland with very different versions of the name: Griggs, Greig and Grigor (9690 is one of these)
7)    A group of 2 (kit 6979 is one) using Gregg (no locations available)

All other individuals appear to belong to separate unrelated families although the distant possible connection between Gregor(y) 476609 and Charles Greig 585177 would be worth further investigation. Robert Gregor 239031 belongs to a completely different genetic haplogroup.

Example 3: Greig/Grieg/Gregg etc surname 67 marker grid

This example [Examples 3 and 4] shows the smaller group of individuals who have tested 67 markers (they are all in the 37 group discussed above). What we are interested in is whether the greater number of markers gives any further information on genetic connections. Since 50% of the sample shown in Example 1 and 2 is now not present, the graphic representation is much sparser.

Example 4: Greig/Grieg/Gregg etc surname 67 marker phylogenetic tree

The problem that is immediately apparent [in Example 4] is that without a larger number of individuals testing to 67 (or more markers) family groups do not break down significantly further.  In Example 4 the only possible confirmation seen is that Kits 7489 Gregg, 214992 Gragg and 9690 Greig may share a common ancestor in the relatively distant past but it is possible that all three share a geographical origin –  North East Scotland as suggested in point 6 above.
                                                            ***
By way of comparison I have used the same processes on the Gregory group who also have diverse origins, but what is particularly interesting with this group - given that membership of a DNA surname testing project is essentially based on random participation - is to see just how many individuals descend from the same ancestor: almost certainly, given that there are forebears in common (for example Gideon Gregory – kits 58711 and 179683) then this group probably had a common emigrant ancestor in the United States.

Example 5: – Gregory surname 37 marker phylogenetic tree
 Apart from that group of related individuals there are only 4 other, much smaller groups, whose individual ancestry lies close to each other – their ‘earliest known ancestor’ as given by each participant,  suggesting a range of possible genetic origins (see the grid on Example 6)

Example 6: Gregory surname 37 marker grid

Dean McGee’s DNA Utility allows an estimate of time to most recent common ancestor.

Example 7: Gregory surname 37 marker Time to Most Recent Common Ancestor grid (partial)

This is only an estimate and the number of years suggested always depends on the confidence level chosen for the program – choosing 100% confidence would give a different result from choosing 10% confidence. In this example from the Gregory charts we see an estimate of the possible time to the shared ancestor for the each individuals in the group with each other person. In order to see a good number of results I have had to remove the labels from the top grid but they can simply be put in by hand - going from left to right on the grid top line in the same order as reading top to bottom. Notice, for example, that comparing the first two individuals ‘Peter R. Gregory’ 275887 and Gregory 37140 suggests that they share a common ancestor 5220 years ago.

All of these analyses benefit greatly – and benefit other genealogists – if testers indicate the name of their earliest known male ancestor with the surname – no matter how recent that might be.

If we now look at the phylogenetic tree created when we use only those kits that have tested to 67 marker level the only real difference is that some of the genetic distances seems to be clearer.  

Example 8 Gregory DNA 67 marker grid
However, this particular program only separates by mutation – so if we look at the Gideon Gregory results again it looks like they come from different lines of the same family, rather than from the common ancestor Gideon. This is a limitation not of the program but of an ability to input into the program that two results come from the same ancestor. We have to remember that programs such as this were not designed primarily for family history but for comparing genetic markers in species and not just the human species. After all, it would be next to impossible to say whether two turtles shared the same great great grandfather …

The development of SNP analysis

For several years now there has been an increasing focus on the testing and analysis of SNPs (single nucleotide polymorphisms). The difference between these and the more commonly tested STRs was given in my 2014 blog (opening paragraphs). To put it simply, SNPs are markers in time: as far as is known if these mutate they stay mutated in subsequent generations. What that means is that once enough SNPs are identified an element of dating can be applied to when the mutation happened. For family historians this fact is becoming hugely important. Dating SNPs to the time before surnames is of limited use to family historians but to have dates, even approximate, from the time after the adoption of surnames means that family surname groups can be split down into smaller and more recent family subgroups.  On the Greig and Gregor grids I show the SNP information which is assigned to each individual in the leftmost column. In most cases this is simply M269, a SNP that happened thousands of years ago. Some individuals have had some SNP testing done but very few people have had the ‘Big Y’ test done which takes results forward in time towards the present day, and identifies SNPs which may have happened between 500 and 800 years ago.

Greig SNPs
In the Greig grid confirmed SNPs are in green. M269 is too early in date to be considered so the only other SNPs to be taken into account are kit 476609 R-L066; 363402 R-FGC10125; 259416 R-U152; 585177 R-Z253; 195430 R-U106; 295321 R-FGC5494; 404866 R-FGC37100; 110496 R-L21; B196295 R-ZP77; and 239031 T-M70

Of these U152, R-Z253, R-U106, R-L21 are well known early SNPs which happened before surnames, sometimes by thousands of years and most have further testing options available to bring the results further forward in time. FGC in the results indicates that the SNP was identified by the Full Genome Corp (as indeed the other letters identify the source lab or individual who identified the SNP in question in the first place). Of the other SNPs:

L1066 is more recent but still before surnames.

is the next SNP in the sequence for some individuals after L1065 [not the same as L1066] which is said to identify the Scots modal group
[see http://www.ytree.net/DisplayTree.php?blockID=160] Since L1065 roughly dates to 1750 years before the present, FGC10125 may have happened before surnames.

FGC5494 is European in origin but again is somewhat earlier than surnames, and has SNPs which descend from it towards the present time

FGC37100 is a descendant, or technically, ‘downstream’ of L151 – that SNP is again an much earlier one and found in England as well as other places.

ZP77 is the same as FGC6562 and is found in concentrations in Ireland and to a lesser degree in Scotland: it also has numerous downstream markers

Finally T-M70 is an very early SNP with a distribution over southern Europe, the Middle East and East Africa. It is comparatively rare among tested individuals [see www.yfull.com/branch-info/T-M70/which dates it to 16,000 years before the present].

Similar discussion on identified SNPs could be done for all the subgroups in the MacGregor DNA project. The Gregory group, for example, has the following identified SNPs (see example 6):
R-S16906; DF21; R-CTS7678; L48; Z343; R-P312; R-L1336; R-BY15955;R-S691

MacGregor DNA – current SNP analysis

The work which Neil McGregor in Australia has been doing in analysing MacGregor SNPs is not concerned with the earlier SNPs. We already knew from Jim Wilson’s work that most MacGregors in the main line group carried the SNPs S690+ and S697+, both probably dating from after 1200AD (though no absolute dating is yet available). In his analyses Neil has begun to break down the test results of MacGregor participants into individual family groups – which the clan has known about for generations and which are referred to in older documents as the ‘houses’ or ‘sleik’ [of] Clan Gregor, the main ‘houses’ being Glenstrae, Roro, Gregor McIan (or Brackley) Dugall Keir and more recently ‘of Glencarnock’ the Chief’s line (Glencarnock is the area they held from the mid 18th century).

Neil’s current identification of family groups is given in Example 9:

Example 9: current predictions of SNPs associated with MacGregor family groups.

Neil’s email to me allowing this to be included in the blog suggests that MacGregors from the main line should seriously now consider doing the BigY DNA test (rather than FGC – Full Genome Corps – with whom we have also undertaken testing). He says:

“The best recommendation is that people get BigY as everybody seems to have between 3 and 8 separate SNPs which will allow them to be separated from everybody [else], other than from their own immediate family or first cousins. Some of them [those who have tested under BigY] appear to have a cluster of SNPs which appear to have mutated together and may represent one mutation. A mutation seems to be as low as once per generation through to once every 4-5 generations – seems related to the number of STR [the marker scores than participants start with] mutations as well.

The clan seems to be divided into two major clusters and this would appear to be early on. The section I am in has at least 3-5 sub-branches as does the other major group. The dividing SNP appears to be BY28714”.

Just to repeat that I can do comparisons of STR results for individuals – comparing with up to 10 to 12 others. I would repeat Neil’s encouragement to do BigY if you can – please ask me for further information if needed [richardmcgregor1ATyahoo.co.uk substituting @ for AT]

I have just had a comment from EMC which is worth repeating here in case folks miss it:
It is important to note that the results Neil has recently shown are also due to FTDNA reprocessing BigY kits under a new genome reference called HG38. Prior, under HG19, many of the SNP's used now were heretofore unknown. This SNP refinement is important.

Charts were constructed using Dee McGee’s Utility at http://www.mymcgee.com/tools/yutility.html?mode=ftdna_mode, using a 75% level of confidence, on Doug MacDonald’s mutation rate, an average of 30 years per generation and with no modal results assigned. The graphic representations of phylogenetic trees are made by Splitstree:

-->
D. H. Huson and D. Bryant, Application of Phylogenetic Networks in Evolutionary Studies, Mol. Biol. Evol., 23(2):254-267, 2006

MacGregor DNA Project Blog 2019

$
0
0

(Note to enlarge images simply click on them, and to dismiss and return to text click the X in the top right-hand corner)

Welcome to my MacGregor Project DNA blog for 2019. This year I am going to focus in part on some of the problems presented by DNA testing for genealogy and, in some cases, suggest not exactly what the solutions might be, but rather possible way(s) forward.  I will be dealing with, for the most part, homogeneous groups from the MacGregor Project: all of these groups contain some MacGregor surname participants, participants with related or sept-based surnames, as well as those whose surnames have no currently known MacGregor connection, other than perhaps by tradition. It is important to understand the difference between a related surname and a sept of a clan and I have explored this issue in last year’s blog, so I won’t repeat it here. As for the processes of DNA testing for genealogy the current Wikipedia entry is very helpful, and it can be accessed at:
     I hope it is not too simplistic to say that one of the key ways in which DNA testing for genealogy will improve in the future is by more people doing higher level tests. This is certainly true for the male Y chromosome (only males have the Y) since the potential of the Big Y test is still relatively unexplored, and to have more participants at this level of testing is essential for the future. Having said that I must here give great credit to all those who are working to develop analyses and interpretations including, for the MacGregor project, Professor Neil McGregor in Australia. If you have already tested Big Y then you should send your results to Neil (neilmcgregorATtpg.com.au, substituting @ for AT) if you are from a MacGregor (any spelling line) or where the DNA evidence suggests that your surname was once MacGregor but changed as a result of Proscription (banning of the name from 1603-60 and 1692-1774). In addition, you should send your results to Alex Williamson for inclusion in The Big Tree (see https://www.ytree.net/). This tree is only for branches under the SNP R-P312 and therefore not applicable to other groups such as the I groups which are connected with the Scandinavian countries (and by implication, Vikings). However, Alex’s tree does deal with results which account for at least 70% of British and Irish DNA, if not more. You can join the discussion of this at http://mcewanjc.org/scotsr1b.htm. You will also find helpful pictorial analyses of population densities at https://www.eupedia.com/genetics/britain_ireland_dna.shtml.
      Those of you most interested in the information on MacGregor SNPs should skip to the end of this article where I discuss what progress has been made (and will be updated later in the year). I want to begin by dealing with two specific groups. In the analyses that follow you will need to have your kit number to hand as I am not labelling the charts with surnames(with two exceptions) but will mention surnames in the text of my analysis. You can easily refer to the surnames for each kit number by looking at the relevant subgroup in the project: www.familytreedna.com/public/macgregor.  The first two groups are what I have termed Viking 1 and Viking 2: these are separate genetic lines generally associated with the area including Denmark (Viking 1) and Norway (Viking 2). I should point out that I have only done analyses at the level of 37, 67, or 111 markers and have had to exclude 12 and 25 markers as these are too general and not specific enough to generate discrete results.

The Viking Y chromosome groups

 In Viking 1 group at the level of 37 marker testing we see a common point of origin (typified by the star shape) in an individual who probably bore the SNP I-M253 (Figure 1):

Fig. 1: Viking 1 group at 37 markers Y chromosome
From that individual there have come many branches, some of which split to show connection to a common ancestor more recent than the one that bore I-253. In general, the closer together and further up a line the splits come suggests more recent shared ancestry – but it does not mean that everyone has the same surname.  Compare this with the 67-marker chart Figure 2), which has less lines since those who only did 37 marker testing have dropped off. 67 markers make links much more obvious especially for splits at the ends of lines.
Fig. 2: Viking 1 group at 67 markers Y chromosome
In the Figure the three results which include kit 101344, 2 are surnamed Skinner and therefore closely related, but one is Young – it looks like one or other surname was adopted quite late.  At this point I should mention that the programmes I use (Dean McGee’s DNA Comparison Utility and Splitstree [details at the end]) group individuals by common characteristics that are not always evident when looking at the number sequences on the Project subgroup grid. In other words, if there is a mutation or two earlier in the number sequence (particularly in the first 25) the project grid at FtDNA will change the position of the results on the grid relative to others who might be more closely related, whereas, the Splitstree charts tend to ignore smaller variations and early changes in number of mutations. As a second example, the group above which includes kit 377721 are all related having the surname White (however spelt). The genealogies for these three kits connect Scotland, NE England and Kent. N73491 has the surname Kellogg associated with Essex (across the river Thames from Kent). If this is accurate then one interpretation is that there were two Viking brothers who went separate ways, one to north Britain one south (or both south then one went north!) – but that really is speculation.
       Remembering that DNA testing is entirely random as to who actually decides to test it is interesting to note that the Viking 2 group (Norway) is less represented than Viking 1. This could be explained by geography, in that the Danes primarily colonised the east side of Britain while the Norse went to the north and west (the Scottish Islands, north Wales, East Ireland) where populations were relatively smaller.When we look at the 37-marker chart for the Norse group (Figure 3) we see that there are some very clear groupings related to families. At the 67-marker level (Figure 4) some of the results have dropped off since participants only did 37 markers. The radiating lines are longer (greater time depth) and what we see may represent two individuals rather than one as the origin in prehistoric times.


Fig. 3: Viking 2 (Norse) group 37 markers Y chromosome
Fig. 4: Viking 2 (Norse) group 67 markers Y chromosome
Interestingly there are the same number of more recent family groupings for less participants than the Viking 1 group. There is a perhaps a suggestion here of geographical shared origin. The group of 4 results which include kit 22659 are consistently named King whereas the group which contains kit 45658 has five different surnames apparently more closely related genetically: McGregor, McLean, McClister, Mills and White.
     There is one question which participants with Scottish ancestry who find they have a Viking Y chromosome ask and that is ‘am I a Viking?’  The answer to this would be yes ONLY if the most part of your DNA could be tracked back to the Scandinavian countries. As it is most Scots whose ancestors stayed in Scotland into the 20thcentury would find that their male ancestors were largely from haplogroup R1b (related to Scots modal) and not I (Viking). It was only that perhaps a thousand years ago that a Viking Y chromosome was ‘inserted’ into the predominantly Celtic clan environment but then the families who descended from him would continue living and reproducing in a Celtic/clan environment. Other parts of the DNA – the autosomal which I discuss further under the Family Finder section – show comparatively few genes which have ‘Viking’ connections. Yes, there are some, but most indigenous Scots’ DNA is ‘Celtic’. In a subsequent blog I will discuss the thorny questions of ‘Who were the Scots? ‘versus ‘Who were the Picts?’ and can we tell the difference as certain authors suggest we can (and they say that some clans are Pictish)? In the end it is a similar question to ‘Am I a Viking?’ but probably needed DNA testing aa thousand years ago1
   
The Irish-Related group (note spelling both Grier and Greer to show there is no distinction)

If we turn now to the group which I have called Irish Related we find another star shape at 37 markers (Figure 5) which is preserved at 67 markers (Figure 6) suggesting descent from one individual (who carried SNP R-M222), although this is less obvious when looking at 111 markers (Figure 7). The group which contains kit 862 is almost exclusively surnamed Greer, however spelt, and this group should also contain kits 333215 and N225557 (I will move these from where they are in the Scottish Irish Grier group in due course).

Fig. 5: Irish Related group 37 markers Y chromosome

Fig. 6: Irish Related group 67 markers Y chromosome

Fig 7. Irish related group 111 markers Y chromosome
I have deliberately included all three Irish Related charts to show that while we can see relationships at the 37-marker level and to a lesser extent 67 (without the 37 only participants), when we reach the number that have tested at 111 markers the relationships detail and closeness disappears – only the group with kit 862 still has 3 related individuals – all the other close matches have disappeared.   We see that lines have lengthened, and this gives us a better understanding of how far back in time the family split happened. If we look again at the group containing 862, it appears like this may represent two brothers or even three brothers diverging at a point: this would need to be checked with genealogies if they exist. I believe this split may have happened in the United States. When we look at the 111 marker group and include a chart which estimates the number of years to the Common Ancestor (using 75% probability on FtDNA’s stated mutation rate in Figure 8) we see that many of these individuals can trace a relationship back in time to after the beginning of the Second Millennium (1000AD or 1918BP – before present) which is interesting given the fact that this represents different surnames it is not a single surname grid.
Fig. 8: Irish Related group – 111 markers Time to Most Recent Common Ancestor
At this point I should mention that there three other distinct and unrelated groups of Grier/Greer/Grierson.  One group of these Greer groups I have termed Scottish Irish could be from Northern Ireland or South West Scotland. The second group has Viking genetic heritage (I have called the group Grier Viking). This latter group descend from 3 or at most 4 separate individuals whereas the Scottish/Irish Griers (Figure 9) descend from more individuals, possibly as many as seven (there could well be more in reality, but it is quite a small test group).
Fig. 9: Grier Scottish Irish group 37 markers Y chromosome
In a case like this, in order to sort out the different family groups we need participants to undertake SNP testing – preferably Big Y (since testing one SNP at a time can easily soon mount up in cost and is a much more hit and miss approach). We do have an example of a SNP which has started to define different family groups, often tied to surname – although as a SNP it dates to before the adoption of surnames in Scotland. SNP L1065 has been called the Scots Modal.

L1065 SNP – the Scots Modal

If we apply the Splitstree chart making program to those who have tested SNPs and who show that they all descend from the individual who had the L1065 SNP, we see again a star shape suggesting a common origin [YFull dates this SNP origin point to approximately 1750 years ago] but there are many separate lines emanating from it with few shared numerical sequences.  This chart (Figure does NOT include any of the MacGregor related participant surnames that have tested L1065, nor any of the other smaller surname groups within the project - it only draws out those identified in the L1065+, S691+ and S695+ MacGregor Project groups.  At the end of this Blog you will find all the surnames in the MacGregor Project who have the L1065+ SNP. My point here is to show just how diverse the descent was from L1065 using just a few surnames, Alex Williamson’s Big Tree shows just how many individuals with that SNP there are, (or rather those who have actually tested – there must be many many more), and how many surnames it led to (including the main MacGregor line). However, the key factor is that it is an SNP which originated in Scotland and most of those who have tested FtDNA’s Big Y or YFull’s DNA testing and now appear on Alex’s tree show a genealogical origin in Scotland. See: https://www.ytree.net/DisplayTree.php?blockID=160.


Fig. 10: L1065 positive results in the MacGregor project (only groups 1065+, S691+ and S695+)
Two short specific cases: White; McGhie

What we see again and again in DNA genealogy is that there are multiple origins for surnames especially those which are descriptive. If we examine the chart (Figure 11) generated by those participants who have the surname White we see multiple origins for the name. The significance of White as related to Clan Gregor is that it was a name associated with the clan (sometimes as an alias) and indeed many other clans. It is the English version of Ban or Bain and indeed there is one participant with that name who is descended from Gregor the name father of the clan. All those with the name White are part of Clan Gregor if they wish to be so the genetic connection is, to a degree, irrelevant.
Fig. 11: Diverse genetic origins of surname White
A different result is seen with the McGhies (however spelt). A large cluster of individuals, almost certainly all from the original emigrant to the United States is found to the right of Figure 12 while there are two other, unrelated and distant families, of McGee and Magee on the left. This chart does not include Magees (however spelt) who are assigned to the Irish related group (and would show as another tangential line on this Figure.  The connection of the MackGehees, who have a tradition of change of name from MacGregor on arrival in America, is not clear. The reference to Iain Dubh (MacGregor) is a supposition.
Fig. 12: McGhie/MacGehee/McGee at 37 markers
I have concentrated on STR data, especially in Figure 11 above because at the moment the best source of SNP analysis is the previously mentioned Alex Williamson The Big Tree and to show it can be used to expand on STR results there is a good example which elaborates two of the McGhie (however spelt) results located on the right-hand group. The relevant Big Tree Block is:  https://www.ytree.net/DisplayTree.php?blockID=16. The two SNP results which relate to the McGhie results are for Thomas_Ma1 and William_M1 in Figure 12 which lie within the large group. Both individuals have the SNP R-BY172925 which is a descendant of SNP Z255, sometimes referred to as the Irish Sea Haplotype from its general geographical distribution. The DNA shows that this split occurred much earlier than that which led to the cots Modal L1065+ having occurred in prehistoric time.

The current problem in SNP testing with respect to families

SNP testing IS the way forward for DNA genealogy but there is a particular issue associated with confidently assigning individual to family groups and especially with the MacGregor ‘houses’ or discrete families (such as Roro etc). The problem resides with SNPs that are ‘no shows’ – in other words have not given a result in the testing. In a recent communication on this Professor Neil McGregor elaborated on the essential problem this brings. When you consider that the Big Y DNA test looks at about 2,000,000 SNPs [out of approximately 10,000,000 in humans] which is allied to an extensive STR analysis then any missing key data [that is, ‘no reads’] is a problem. Rather than summarise I give an edited version of Neil’s email to me:

I certainly can separate every individual within the group. Every person has at least 2-4 unique SNPs, but many have common SNPs with a small sub cluster of people, thereby allocating them to a subfamily.  I only use the SNPs that have high levels of accuracy to do this, but this leaves me with a significant quandary. There are exactly 100 SNPS which have higher levels of no reads which should be capable of subdividing the branches more accurately. Of these ~20 are critical in dividing the clan participants into the final family clusters and sub clusters. I will be contacting familytreedna to request they address this issue in their data analysis.

The second major problem is that if you wish to compare the BIGY data you have to redownload every individual each time [you do it] as familytreedna keep adding SNPs and they also allocate some of the unique SNPs a new name (e.g BY1123252) and they drop the old position number so that prevents one from easily comparing the changes.

The third major problem is working out who has a back mutation of a SNP which tends to make you place them in one group until one sees the whole dataset. They may have one match with two separate individuals but have multiple matches with others who do not match the common one. I personally have one of those in my data. I do not believe I will be able to separate them until the ‘no read’ SNP data can be clarified better.”

When Neil checked his own genetic data using STR and SNP data this is what he found: “My nearest match in the SNP data was allocated as a distant cousin in the STR data 450 years using 111 STR data and 630 years to common ancestor using 37 [marker] data”.
    In summary, any allocations to individual families within a surname subgroup – especially the MacGregors – needs to be provisional until the ‘no read’ data issue is resolved since these currently unavailable SNPs could well change the allocation of an individual from one family subgroup to another.  An update for the MacGregor SNP family groupings will be added here later when Neil is happy that what is published here is as good as it could be at this stage, but, it will be provisionalpending clarifications as outlined above. It is quite likely that we will eventually suggest specific SNPs that individuals should have tested but we are not yet at that point.

The scope of Family Finder (or Ancestry DNA or similar test)

I have been asked several times “why do you not do a surname study grouping for Family Finder?”. The simple answer it simply isn’t feasible. Consider this scenario: in six generations every individual could have 64 different ancestors with 64 different surnames. Take 100 individuals: then you might have to include 64,000 different surnames. Take 1000 individuals and you could have up to 640,000 different surnames.  What you then need is a database that compares results with all the other results in a project and that is exactly what the various companies’ databases do in order to find individuals whose genetic signature is similar in some places to yours. The more similar the DNA the more likely it is that the second person is recently related. Then, each company estimates what the actual relationship might be.  The first problem is – very few people can confidently say who every one of their 64 ancestors was actually called as a surname (especially the earlier females). The second problem is - to make a comparison you need to have uploaded your family tree so surnames can be compared with any other person. It really baffles me that someone would do the Family Finder test and not put in their ancestral names at least. So, you might have a good match, but your match has inserted no family detail at all – how can you compare and estimate the connection? The third problem is – you write to the email address and get no reply: did the person see the mail and choose not to answer; did the mail go into their Junk or Spam folder and they never saw it; is the email address even current and has never been updated?; did the person do the test but has never done any genealogy – it just seemed like a good idea? 
    What I am saying is that Family Finder requires active genealogical and communicative information from all who do that test. The material that others need in order to compare with you can only be there if you input it. You don’t need a Project Admin to do this for you – in fact, only you know your genealogy, a Project Admin almost always has no specific information on your family tree. Finally, even if you find genealogy information inputted always remember that it may be full of assumptions and can have inaccurate family links, which means you have to check the accuracy for yourself. Finally, remember that the predictions that are offered by the testing company on possible relations are exactly that, predictions, not facts.

Mitochondrial DNA

If you have taken an MtDNA test with Family Tree DNA and are in the MacGregor Project, you will find from the Project menu that you can look at the female haplogroup groupings by selecting to view the mitochondrial results. You will notice that I have assigned everyone to their appropriate alphabetical group. However mitochondrial groups have subdivisions, so, haplogroup J, for example, has many subdivisions (not as many, by far, as haplogroup H). If your subgroup is, say, J1a1b1a then you will most closely match someone with the same configuration. You should then look at the column with the DNA results which will show if you have an exact match or if there are some different mutations. Mutations suggest an ancestor further back (to give time for mutations to have taken place) and it may be hard to find the exact common ancestor since very few genealogies have all the female surnames back much further than the mid eighteenth century (if you are lucky!). You may be able to tie back to a geographical location rather than to an individual.  Remember that MtDNA is passed from a mother to her children but only daughters can pass it on. In that sense it works like a bit like the Y chromosome, except that surnames change with every generation and it is thus comparatively rare to be able trace back to an ancestor in the past (though it was done in the case of Richard III – whose bones were dug up in a Leicester car park and which were identified by a mitochondrial descent to the present day).

Finally

As always you can ask me to generate limited charts with your results along with others which look potentially related. If you could keep the number of individual results to about a dozen per query that would be helpful. Also, don’t forget that this is for comparison of STR data not SNPand better results are obtained when comparing larger numbers of markers.

Again, I would repeat Neil’s encouragement to do BigY if you can – please ask me for further information if needed [richardmcgregor1ATyahoo.co.uk substituting @ for AT]

Charts were constructed using Dee McGee’s Utility at 
http://www.mymcgee.com/tools/yutility.html?mode=ftdna_mode, using a 75% level of confidence, on Family Tree DNA’s mutation rate, an average of 30 years per generation and with no modal results assigned (except where indicated). The graphic representations of phylogenetic trees are made by Splitstree:
D. H. Huson and D. Bryant, Application of Phylogenetic Networks in Evolutionary Studies,Mol. Biol. Evol., 23(2):254-267, 2006.

Surname participants in the MacGregor project who have SNP L1065+ – not everyone with the surname will have L1065+: MacGregor; Anderson; Bain; Lawrie; McLachlan; Tuttle; McFarlane/McFarlane; Buchanan; Miller; Laird; McDaniel; Adams; Duncan; Simpson; Stewart; Watkins; Davis; Gregory; King; Moore; Cain; Grier; Jamieson; Murchison; Cameron; Griffin; Napier; Doran; Bissett; Gillis; Ferguson; Allen; McViccar; McAfee; BradyPierce; Kincaid; (Mc)Whannell; Hanby; Brown; More-Gordon; Henderson; Laird; Eunson; Colborne; Peden; Robertson; Scott; Looper.






















-->-->-->-->-->-->-->-->-->

MacGregor Project DNA Blog 2020 (number 20)

$
0
0



Welcome to the MacGregor Project DNA blog for 2020. This year I am concentrating on two particular subjects: 1) the search for the genetic signature of Rob Roy MacGregor and 2) Grier DNA  (including Grierson, Greer, McGreer and various other spellings).

 Note to enlarge images simply click on them, and to dismiss and return to text click the X in the top right-hand corner - back arrow works on a phone). In the analyses that follow you will need to have your kit number to hand as I am not labelling the charts with surnames.

When the DNA project for MacGregors began back in 2001 one of the key objectives was to find out how many individuals still carried surnames that were adopted because of the proscription [outlawing] of the clan name which took place from 1603-60 and again from 1693 to 1774. About six years later we started to look for evidence of particular family groupings which were known as far back as the later Medieval period, based on some characteristic of that particular family – whether it was where they came from, or, based on a family connection, or a description. So the grouping defined as Roro or Glengyle were based on places (as was later Glencarnock). There was also those MacGregors who were connected to an individual known as Gregor McIan  and this family were sometimes later referred to as ‘of Brackley’. Gregor McIan was a patronymic (for definition see later), whereas Brackley was a place in Glenorchy. All these families were understood to have originated in Glenorchy/Glenstrae in Argyllshire even if by the mid-16th century they had spread out into Dunbartonshire, Stirlingshire and Perthshire.
     The MacGregors of Glengyle were not originally designated by their place of origin but by their connection to an earlier ancestor known as Dougal Ciar. To quote from Peter Lawrie’s ‘glendiscovery’ website http://www.glendiscovery.com/amelia-vol2-chap18.html): ‘this Dougal was distinguished as Dougal Ciar from some peculiarity, probably grey or mouse-coloured hair and eyes. It was universally believed by the rest of the Clan that the House ranked fourth or fifth in point of seniority… They were a very turbulent race and seem generally to have acted independently’.
     It was natural that the search for MacGregor identities should at some point focus on one of our most famous characters, the well-known Rob Roy MacGregor who was born in 1671 and died in 1734.
     Rob Roy became particularly well-known after Sir Walter Scott wrote his novel of that name in 1817 and, a bit like spin-offs of the present day, there was quite a lot of hype round the novel, as well as derivatives such as the stage play created in 1818 (not to mention Berlioz’s Overture Rob Roy of 1831). It was not long before every MacGregor in Scotland it seemed (and beyond) assumed that they were descended from Rob Roy, through one of his male children – especially those of James More who reportedly had 14. It didn’t seem to matter that all these families came into existence in about three generations.
     In more recent times Dr John Ward, who died not long ago, made a mammoth collection of individuals’ family trees which purported to have their origins in Rob Roy. Many of these he was able to speculatively relate together in one large tree, but there were an equally large number that he simply listed in another file as Rob Roy connected. Rob Roy is buried in Balquhidder church graveyard (despite what may be claimed on the information board) but there is, or has been, no real progress in identifying positively a male individual who is definitely a descendent of Rob Roy himself. This is partly to do with the fact that parish registers are often deficient, and entries were not recorded, particularly if the birth took place at some distance from the parish church.
     Some of the claims for descent from Rob Roy are through the female line which can’t really be tested effectively, and some claims which have come through Y-chromosome DNA, but unfortunately haven’t held up genetically – assuming of course that Rob Roy was indeed descended from the Argyllshire MacGregors.
During the last year however a more credible descent has been tested.  First using a 37 marker test to ensure that the signature was what was expected from an Argyll MacGregor descendant, and then, when this proved positive, the Clan Gregor Society paid for an upgrade to Y700, formerly known as Big Y.
     The family tree begins:


What is interesting about this tree is that there are just two links for which the evidence is not completely certain and the first is verifying that the James born 1755, is the same one who married Mary Ferguson and subsequently had Donald who married Elizabeth Stewart. Such proofs are not straightforward in Scottish records when there were several McGregor children in the Balquhidder/Strathyre area with the same name. A list of those who signed the Chief’s ‘recognition’ in 1787 has three males called James any one of whom might be  married to Mary Ferguson.  However, there is also a list of children in the parish of Balquhidder. The archive handlist dates this to 1830 but it cannot be that date and comparison with the list of 1787 shows that it is from that date (and demonstrates that the list of those who recognised the Chief in 1787 were all males from a family regardless of age).
     We know from parish registers that the James who married Mary Ferguson had a male child Duncan in 1785 and that at the time he was living in Ruskachan.  The list of children  [Stirling Archive PD60/651] has the following entries:

Dougal Keers Family

Duncan in Minachallwaroo
         John           1
         Robert                 2

Patrick in Blarerioch
         Malcom     1

Dougal in woodend Stronslany
         John           1
         Gregor                 2
         Donald                3
         Malcom     4

Gregor in Gartnafuaran
         Dougal                 1
         Gregor                 2

Duncan in Stronair
         Malcom     1
         Alexr          2
         Robert                 3
         Allan          4

John in Criganmore
         Gregor                 1
         Duncan      2
         James                  3
Twins         John and Donald          2

Donld         Coock in Ruskachan
         Dougal                 1

James in Ruskachan
         Duncan      1

The last named James must be the same individual. The significance of this part of the list is that James is identified as being of the Dougal Ciar family which is the family to which Rob Roy MacGregor belonged.
     So, we know that this James had a second son Donald in 1787 and therefore the last link to be proved is whether Donald who married Elizabeth Stewart is the same Donald who married Elizabeth Stewart [or Stuart] as the death certificate of one of the family shows. There is only one existing parish register marriage entry for a Donald and Elizabeth Stewart – and that is in Port of Mentieth. However, in 1804 Donald would have been just 17, which is young for a marriage in Scotland but not impossible although the parish register does not suggest that either party was a minor. He could have misrepresented his age and said he was 21 – the marriage took place in Port of Mentieth which is a parish  some 20 miles or more from Balquhidder. If Donald did then in fact move to Ayrshire immediately after his marriage and the children were born there, then he is the Donald who appears in the 1841 census aged 60 [1841 census rounded ages down] with daughter Janet [age 30] and son Donald [age 15].  Donald declares he was not born in Ayrshire but, since he was deceased by the 1851 census, confirmation of a Perthshire birthplace is not possible. If Donald is the son of James his age should be 54 [rounded down to 50] but it is possible that he continued to falsify his actual age. However, just to add to the confusion the death certificate of Donald’s daughter Catherine [Wilkes] says that her mother’s name was Janet Stewart, but the Muirkirk parish register suggests that this death informant was wrong.
     This long genealogical diversion was a necessary preamble to considering the results of the Y700 DNA test which has been done for this line. The chart below, prepared by Prof. Neil McGregor in Australia shows the result in relation to other individuals who have tested to Y700 [result are anonymised to conform to European data protection regulations and FtDNA requirements].




Fig. 1: Argyllshire MacGregors Y700 results analysed by Prof. Neil McGregor

There are other claimants for Glengyle family heritage but as with kit IN61406 there are gaps in the parish record and more or less at the same time – the end of the 18th and the beginning of the nineteenth centuries. For many years the Vice Chairman of the Clan Gregor Society, Peter Lawrie, has been trying to prove that his mother’s ancestor, Duncan, who died in 1826, having been a sailor, is the same Duncan as is stated in this extract from a genealogy of part of the Glengyle family:

Gregor McGregor  of Glengyle died  21 Aug 1777 age 88
sons     John
             Donald /Daniel  shipmaster Glasgow dsp 24 Sep 1791

              Robert of  Corarklet born 1743 m. Isobell daughter of John Graham of Drunkie
Son Donald  alive 1775
              Duncan (McGregor or Graham), sailor,  Glasgow,  served  heir 1793 to uncle     
                 Donald
              
In the chart above this line is represented by kit IN13885. The problem is that the nearest matching kit 133637 has a genealogical tree which is generally believed to go back to the MacGregors of Roro.
     So, in both cases there is one piece of genealogical information missing that is crucial to prove a definitive connection to Glengyle and would allow us to state absolutely that this was the Rob Roy’s genetic signature. Given how many families believed from Victorian times onward that they were related to Rob Roy it is ironic that just a hundred and thirty years later it is proving so difficult to assert with the same confidence as was shown in 1890 when the railings were placed round Rob Roy’s grave and the report stated that in attendance was  “Mr Norman Macgregor, Lloyds, London direct descendant of Rob Roy”.
The chart above links individuals together with shared DNA characteristics and the same relationships can be expressed as a box tree as it is on the FamilyTreeDNA website which they have based on the original work by Alex Williamson for “The Big Tree”.


Fig. 2: the Argyllshire MacGregor family groups from Y700 – FtDNA chart

The darker outline is the group within which kit IN61406 currently fits. At the moment this is our best candidate for Rob Roy’s Y chromosome signature, but as can be seen the evidence is still incomplete and whether or not we have two separate Glengyle tree claimants of which one is correct and one not, or in fact neither is correct, is still an open question!

         Clan surnames Grier (Greer, McGreer, Grierson): DNA results

Before discussing this specific surname, I thought it would be helpful to repeat the section on clan names and septs from my 2018 blog as this is an area of Scottish clan history which is not really understood well.
     What is a clan? Six hundred years ago this question was quite simple to answer. You were associated with a clan if you had been born with the name – in the MacGregors’ case that might be expressed as Gregor, Grigor, MacGregor, McGrigor, McGregor and a whole range of alternative spellings such as, for example, McGreagor (an attempt to render the Gaelic phonetically into English?). At that time too spelling had not been standardised - so one might find Mckgregor, M’gregor and so on. You were also a member of the clan if your name was an accepted variant, such as Grierson, or Grier, Greig/Grieg/Grig etc. These were considered to be shortened or anglicised versions of the main clan name. So, Grier-son equals Gregor-son and Grier is the same name without the ‘son’ on the end.
Whether or not these accepted names were genetically related to the main line was not the point, since a clan was a collection of related surnames. Members of the clan recognised as Chief the head of the main line (the Chief of the MacGregors for example), and often, especially in the early days relied on him for protection, or rather, on his ability to pull a ‘federation’ of individuals together to ensure, usually armed, protection, or, as a means of seeking retribution on another group for some offence.
     There were others associated with the clan whose names were accepted as belonging to septs of the clan. Sometimes the same name would appear several lists of accepted septs for different clans – such is the case for the surname King, for example. As well, some descriptive words used as surnames were understood to have been borne by people associated with the clan, and such descriptive surnames are found in many clan lists: Bain (or Ban) or its anglicised equivalent White; Roy meaning Red; Dhu or Dow meaning dark or black, are some examples.
     Finally, there were people who answered none of these ‘qualifications’ but who lived as ‘part-takers’ on the land which was under the Chief’s influence. Grant and Menzies rental documents of the 18th century reveal instances where individuals adopted the name of the local chief where formerly they were called only by their patronymics. A patronymic shows the genealogy of an individual back two generations – so my patronymic would be Richard McEwan VicPeter: here McEwan is not a surname but shows that my father was Ewan (not THAT Ewan). On a rental document you might find John McGregor VicPatrick which means that John’s father was Gregor and his grandfather Patrick: if John were a very poor inhabitant – a cottar – it might be that his family had lost the knowledge that they were genetic MacGregors and so they ended up taking the surname Grant. It’s unusual to find this situation among MacGregors because of their turbulent history but it happens in other clans. Paradoxically, even though the MacGregor name was proscribed [forbidden] for so long [1603-60 and 1693 to 1774] many families held on to the knowledge that they were MacGregors despite having been forced to take other surnames. Some families never changed back to MacGregor when it was finally possible to do so – which is why in the DNA project we see individuals called Drummond, Stirling, Campbell etc who are genetically MacGregors - their ancestors never readopted the name when it was safe to do so.

   Taking the Grier family as an example this year (and including all variant spellings, such as Grierson and McGreer etc.) we will first look at the 37 marker results. The number of individuals who have tested at that level  [37 markers] is over 60, but the number of results will decrease as we look at the results for 67, and then further decrease for 111 markers.



Fig 3: Grier (Greer, Grierson etc) 37 marker results

What is immediately clear from this is that there are at least 4, if not 6 or 7  different origins for the surname. A recent Facebook post which derived its information from an e-book ‘The Tribe Within’, states:
    “The Greer name is found in the British Isles, but its origin according to DNA is from the north-west coast of the Emerald Island. The Greer story [dominated by DNA tribal marker (haplogroup) R1b-L513, Subgroup A1] can trace their origins to the Finn Valley in Donegal, Ireland from 50 BCE. Perhaps the journey begins with the Clanna Dedad; Deda, son of Sen or Deda Mac Sin. The Greer surname origin is from a Northern Ui Neill [R1b-L513] tribe.”  Undoubtedly this will be true for some individuals with the surname Greer but it cannot be true for all since some have a genetic origin which is clearly Norwegian Viking as well as at least one other group’s genetic origin being potentially different (for the moment we will not consider single isolated results).
     There is a possible explanation for this genetic diversity which relates to the acquisition of surnames. Surnames did not come into usage in Ireland until well after the end of the Viking invasions and it is possible that different individuals adopted the name at the same time but did not know, or it did not matter, that they were not genetically related.
That genetic diversity has a considerable time depth can readily be seen when we examine the slightly smaller group of individuals who have tested to 67 marker level.

     The diagram is very similar to the 37 marker one but with greater discrimination between individual results representing the more specific branches.



Fig 4: Grier (Greer, Grierson etc) 67 marker results

What we see from this diagram quite clearly is how close in genetic time many of the individuals are. The Viking group for example are all descended from a single individual probably branching at kit 552222. The Irish branch are very strongly related from one individual (leading for example to 97279) and then a second branch slightly later on (leading to 333215 for example).  The branching is clearer in this expanded view below:

Fig 5: Grier ‘Irish-related’ group expanded chart

In the Viking group the known ancestries (5 out of 19) are Irish: County Antrim, County Down, County Cavan, County Tyrone. Other place of origin labels say Scotland but give no detail, but of those that did 67 markers, ALL match genetically with the group who have specific Irish locations for their ancestry. In the Scots/Irish group kit 57019 has ancestry going back to County Fermanagh but all the others have no specific location declared except kit 42120 who simply locates his ancestry to Ireland. However, where the origins become particularly interesting is in the group which contains kit 7874 because a number of these participants have clear genealogical origins going back to Scotland, often to the south-west corner. There is a printed genealogy for this area for the Griersons of Lag which suggests a movement from Scotland to Ireland in the 17th century, but not the other way around, and yet this group has a Y chromosome signature which is generally associated with Ireland suggesting movement across the Irish Sea on more than one occasion. Kit 2342 locates to County Meath in Ireland.

     The next grid shows the Time to Most Recent Common Ancestor [TMRCA] for the 67 testers – the figures give estimated years between any two individuals at the intersection of across and down. From this it is obvious both how close some individuals are and how far apart others. The rates of mutation are FtDNA’s based on 50% probability with generations set to an average of 30 years:


Fig 6: Time to Most Recent Ancestor grid for Grier 67 marker testers.

Finally, here is the chart for those who have tested to 111 markers.


Fig 7: Grier 111 markers with added SNP labels

In this diagram the boxed labels are the SNPs which have been tested. I-M223 and I-M253, and R1b-M269 are SNPs from many thousands of years ago whereas  those in smaller boxes are usually the currently-known terminal SNPs and may date from just a few, to several hundred, years before the present. It is likely that any others close to the individual who has tested these SNPs will themselves have this SNP (as well, perhaps, as others).
     I repeat the information given at the end of my 2018 blog:
Professor Neil McGregor who analyses the Clan Gregor data remarked that “the best recommendation is that people get Y700 as everybody seems to have between 3 and 8 separate SNPs which will allow them to be separated from everybody [else], other than from their own immediate family or first cousins. Some of them [those who have tested under Y700] appear to have a cluster of SNPs which appear to have mutated together and may represent one mutation. A mutation seems to be as low as once per generation through to once every 4-5 generations – seems related to the number of STR [the marker scores that participants start with] mutations as well.
      The clan seems to be divided into two major clusters and this would appear to be early on. The section I am in has at least 3-5 sub-branches as does the other major group. The dividing SNP appears to be BY28714”.
     Just to repeat that I can do comparisons of STR results for individuals – comparing the participant with up to 10 to 12 others. I would repeat Neil’s encouragement to do Y700 if you can – please ask me for further information if needed [richardmcgregor1ATyahoo.co.uk substituting @ for AT].

Charts were constructed using Dee McGee’s Utility at http://www.mymcgee.com/tools/yutility.html?mode=ftdna_mode, using a 50% level of confidence, on Doug MacDonald’s mutation rate, an average of 30 years per generation and with no modal results assigned. The graphic representations of phylogenetic trees are made by Splitstree:
D. H. Huson and D. Bryant, Application of Phylogenetic Networks in Evolutionary Studies, Mol. Biol. Evol., 23(2):254-267, 2006

This site uses cookies from Google to deliver its services and analyze traffic. Your IP address and user-agent are shared with Google along with performance and security metrics to ensure quality of service, generate usage statistics, and to detect and address abuse.


Article 0

$
0
0

Welcome to the MacGregor Project DNA annual blog for 2021.  This past year with the arrival of the virus has had an inevitable effect on the progress of the DNA project as everyone has been concerned to deal with the changes in lifestyle which have been needed.  I thought therefore that I would use some of this update to explore some ideas, opportunities and limitations of autosomal testing – typically called ‘All my Ancestry’ or ‘Family Finder’, and also to share some thinking on the MacGregor Big Y results.

     It is well known now that only males carry the Y chromosome and that this particular part of DNA is usually associated with surname (except for cases of adoption etc). It follows that for surname studies only males with the surname of interest can be tested for connection with a specific surname. For more recent family connections it can often be possible to find a male of the surname to do the test on a person’s behalf but going further back in time does not work so easily when thinking about autosomal (i.e. Family Finder) testing as I will explain. Inevitably genetic scientists will say that the following explanation does not cover all the possibilities and in some senses tends towards simplification: there is a certain inevitability about this when dealing with a wide and diverse audience.

     I have often been asked “why can’t I  be part of the surname project because my great great grandmother was a MacGregor?”, or “my ancestor was Rob Roy, can I do the DNA project to prove it (where the connection is through Rob Roy’s daughter or granddaughter)?”, or “why do the results of Family Finder not show in the surname project?”, or “why are surnames not included in the surname project?”.  I hope to answer all these with the following text.

     Everyone inherits autosomal chromosomes from their male and female ancestors. The proportions received from each ancestor varies, and also these tend to disappear over time as they are replaced by the DNA of succeeding generations. Otherwise, we would have millions of bits of autosomal  chromosome going back into prehistory: having said which it IS possible that some of a person’s autosomal chromosome has survived from prehistory to the present day. Every generation doubles the number of forebears which you have: 2 parents, 4 grandparent, 8 great grandparents, 16 great great grandparents and so on. As you go back in time that number increases, so, around 1300, in the Medieval period, each person will have over a million ancestors and working on the principle that most people are related by ‘6 degrees of separation’ there must be many common ancestors between any two individuals.   

      We all have 46 chromosomes in total and two of these are the X and the Y: if you got a Y and an X chromosome from your parents then you are a male, if you got two X chromosomes then you are female. That leaves 44 chromosomes (2 x 22 pairs) for Family Finder to compare.  Family Tree DNA’s Family Finder distinguishes between matches which arise from autosomal chromosomes (the 22 pairs) and the X chromosome and results are therefore shown for both autosomal chromosomes and also for what FtDNA refer to as X Match

[But note that, surprisingly, of the 5 matches on X Match out of the first 1000 Family Finder matches which I have, there is no detail in the graphics provided – I would be interested to know if others have found this to be the case also). 


To explain this further let’s now take a specific example or two:On the familysearch.org website you can create your own  fan chart of ancestry, and in an article Jessica Grimaud presents a filled-in fan chart: (https://www.familysearch.org/blog/en/genealogy-fan-chart/)


 Fig. 1 Fan chart from www.familysearch.org


There is an amazing amount of genealogical material presented here and it is relatively rare for a researcher to be able to document quite so many generations.  As a  comparison here is my own fan tree which lacks detail from my mother’s side because I have not been able to ascertain with any certainty her Russian German mother’s ancestry.


Fig. 2 Fan chart Richard McGregor


I will begin by discussing an example of how this fan tree was used to identify a common ancestor with myself in Family Finder.  I do not have a fan chart for the person who I match but there is a family tree for this person deposited on the Family Tree DNA website which contains sufficient information.  We are listed as being 3rd to 5th cousins and these are the places that we have autosomal matches:


Fig. 3 Chromosome matches for myself and the other participant


This may look at lot but when compared with someone who is a 1st to 2nd cousin you can immediately see the difference. If you count the number of shared chromosomes in Fig 4 and Fig 5 you will see that the latter has many more. That indicates the first problem – the further back in time you go the less SNPs in common you will have with someone else and if it is 5 or 6 generations back then you have to work out which of up to 64 ancestors you actually match to:


Fig. 4: Richard McGregor matching chromosomes for 1st to 2nd cousin


Going back to the Fig 4 Family Tree DNA do a version of these results in graphic form which comes out as below (the shading represents the areas of match):


Fig. 5. Graphic form of shared SNPs between Richard McGregor and Family Finder match 3rd- 5th cousin


Working back with our researched genealogies we came back to a meeting point with a Georg Konrad Major born in 1797, two of whose children started the separate lines we descend from.  I have over 4000 matches on Family Finder and only a few of these can actually be traced back to a specific individual shared between us – part of the reason for this is that not all participants share ancestral names or genealogical trees on the website.  I have very few 1st to 3rd cousins and while it has been relatively easy to find the genealogical connections with these, when the relationship suggested is more distant, it becomes rather more difficult.   There are quite a few examples where Family Finder participants have given ancestral names which match with those in my tree but none has led to a definite link. 

 

Hopefully it is obvious that finding specific links with Family Finder relies on having access to genealogies which are as complete as possible and the closer the genetic connection the easier it is to identify the connection. When we are trying to identify a common ancestor who lived more than 250 years ago,  it is vital to  be able to eliminate other possibilities.


*    *     *


Update on MacGregor DNA


This year has seen more participants upgrading to the Y700 test which has resulted in an expansion of the genetic tree of connections. There have been a large number of terminal SNPs identified, and on the grid at www.familytreedna.com/public/macgregor these are shown in the green colour. Sometimes these just represent a SNP that has been singly tested and is not the terminal SNP, and the best way to determine if you are looking at a terminal SNP is to assume that it will have a haplogroup (such as R or I) followed by a BY, FGC, or FT number [Big Y, Full Genome Corp or FamilyTree]. Quite often individuals who have tested to Big Y have SNPs which are not shared with others who have Big Y tested: these SNPs are known as Private SNPs and will be so identified until there are other participants found who have any of these SNPs – these will then be shown as shared SNPs and will indicate common ancestry between the individuals who bear them. When there are no SNP matches  between individuals the Big Y results default back to the most commonly held SNP which is why some of those genetic MacGregors who have tested with Big Y default back to S690, which is the SNP that all genetic MacGregors have. As more results come in an individual with S690 might  find that one of the Private SNPs becomes the terminal SNP. As I just indicated, in that case it might be reasonably assumed that the two (or more) individuals concerned share a more recent ancestor normally bearing the surname of interest.


Below I have copied the Big Y Block Tree graphic which is currently shown for the MacGregor group descending from Ian Cam. It is based on the format developed by Alex Williamson [The Big Tree]. I have removed individuals’ names for data protection reasons – however the terminal SNP can be cross checked on the results grid link given above to identify the earliest known ancestor where this has been filled in by the individual participant.


 Fig 6: Block Tree (current) for MacGregor Ian Cam group


As another example I have chosen a McGregor result from what is labelled ‘McGregor distant group’ on the results grid:


 Fig 7: Block tree diagram for a McGregor in the McGregor distant group


In this case the surname of participants varies across the grouping , starting with McKellar for the first two lines of descent followed by Robertson; no name given; Campbell; Stewart; McKellar, McKellar (which is the McGregor participant’s shared match); followed by McKellar; then unnamed at the far right,  All Big Y participants can see their results presented in this way through their results page. However, it is important to say that if  an individual’s terminal SNP has been determined without going through Big Y then this block graphic is NOT given in the participant’s results because some of the other SNPs which contribute to the Block Tree  will not have been sequenced. For example, I determined my own terminal SNP BY4303 by deduction and testing, and not through Big Y, so I cannot see who else is close to me.


[Note: McKellar is considered a sept of Clan Campbell and is found across Scotland with authorities variously placing the origin in Argyllshire, Angus and/or Strathclyde].


Finally a McGregor distant result, but with a known Sutherlandshire connection:


Fig 8: Block tree result for a McGregor in the McGregor distant group whose forebear came from Sutherlandshire in the north of Scotland


In this case the rest of the Block Tree group matches are almost entirely called Nicholson (except for a solitary MacDonald).  The individual’s McGregor result box does include an additional named individual whose surname is also McGregor – and in that case this participant and the other McGregor share an ancestor in the more recent past. Who this was might be established by comparing genealogies – the difficulty being that many Sutherland parish records are deficient for pre 19th century.


[Note: the name Nicholson in Scotland is said to derive from a lawyer who lived in Aberdeen in the 16th century, but there was also a family associated with the Isle of Skye who petitioned the Lord Lyon for recognition – which they got, on condition they were designated Clan MacNeacall of Skye]


Our own analyses which have been conducted by Prof. Neil McGregor is shown below (as at October 2020, although the previous date of May has been retained). This gives broadly the same result as Fig 7 above and represents the current state of our knowledge:


Fig 9: Ian Cam group Big Y results as presented by Prof. Neil McGregor


We have however identified an issue related to the first chip used by FtDNA.

You will see from the grid given below that the top group lack some results that are available for the lower group. We have contacted FtDNA about this:


 Fig 10: Partial results grid Ian Cam group


You will have noticed that this year I have not used spider diagrams as in previous years. This is partly because comparatively little has changed over the year, but also because I prefer to offer individuals the possibility of seeing a spider diagram for their own result which is designed for them alone. You can see these spider diagrams by looking at previous versions of this blog. If you would like me to generate one for you ask me offline and suggest up to 12 individuals with whom you would wish to be compared by reference to their kit number. You can email me on chairmanATclangregor.org [change AT to @ - I just want to avoid spam!).





Article 0

$
0
0

                           MacGregor Project Blog 2022

Welcome to my annual blog for the MacGregor DNA project. This past year has not seen a great deal of development in the relationship between genealogy and DNA as far as the project is concerned, although it has been good to observe that people have used the opportunity of being at home during the pandemic to do more genealogy, and to order Y700 tests which will, during next year, as the results begin to gather, potentially allow for more focused research on the various family  relationships which are being revealed by the SNP links. 
     To begin this discussion I thought it would be useful to show how SNPs can be used to suggest origins, not just genetic, but geographic. This year, therefore, I am presenting case study based on the McAdam, Macpherson, and Irish results. The second half of this blog will consist of an update on Gregg (Greig/Gragg) and Gregory SNPs, and conclude with comments on the Rob Roy descendant discussion, following on from last year’s blog.

                                McPherson/McAdam

During the year, I was contacted by a McPherson participant who could not work out why in his Y700 results his nearest matches were McAdams. The genealogical origins of the two surnames are quite different. Macpherson as a surname is mostly associated with a group of families originating in the south of Inverness-shire – round the areas of Kingussie, Newtonmore, Insch and Laggan (see this parish map for location http://www.scotlandsfamily.com/parish-map-inverness.htm). The McAdams on the other hand mostly originated in south west Scotland below the central belt, more or less the opposite end of the country. They also were found in Ireland and there is a fairly direct connection between Southwest Scotland and Ireland.

     If we look at the spider diagram for the McAdams [also Adam, McAdam and McCadam] (Figure 1) who are in the MacGregor Project we can see that there is a very large group, and then several apparently unrelated singles (such as 499181) or pairs (such as 510026 and 49834) results.


Figure 1: McAdam/Adam/McCadam


Next, in Figure 2, the kit numbers for those results which seem to suggest an Irish origin, which, in a good number of cases, is confirmed by known genealogy (plus IN100960 McPherson)


Figure 2:  McAdam/Adam/McCadam group connected with Ireland (bottom right)


Figure 3 shows the spider chart for the Irish group (all the charts are based on 67 markers):

Figure 3: the Irish group (kit numbers only)

In Figure 4 I have overlaid those McAdam results onto the Irish results and included the kit numbers with the addition of the first letters of the surnames (this being all the program will allow me to do). However, you will see that the McAdams in that tight group of results is embedded within the Irish results – for example kit 165907 or kit 183682. This suggests that these surname groups arose in the same location but in different individuals, creating strong genetic relationships which are not indicated by the surnames. You can see that the Irish group, which appears to have three separate branches coming from the main stem, contains various surnames including White, Black, and, what is especially notable, a tight and separate group of Grier/Greers.

Figure 4: McAdam results and McPherson IN100960 embedded in the Irish group


One thing I do need to mention at this stage is the limitation of STR testing and subsequent mapping of relationships. Since STR mutations are random, comparisons of results can be influenced by the occurrence of a mutation earlier in the sequence of 37, 67 or 111 markers – in other words if a mutation occurs in the first 25 numbers as read from left to right the program assumes that this is a significant mutation, because, in the early days of DNA testing for genealogy certain markers were chosen because they mutated more rarely than others.  With the program assuming that a mutation in this area is more significant it is possible for an individual’s results to be grouped with others which have that same mutation rather than the program ignoring that and focusing on later mutations so occasion misalignments may take place.  This limitation alone makes the use of terminal SNP testing more reliable for grouping individuals together (and why Y700 uses the SNP tree format designed by Alex Williamson and which Family Tree DNA call Block Tree). I discuss the meaning of ‘terminal SNP’ later (see the subsection heading below).

    Now to return to the question which I began with. In Figure 3 kit number IN100960 is McPherson and it is quite clearly in the same group as the McAdams but the line is longer indicating greater predicted passage of time since the shared ancestor. The most obvious interpretation of this result is that IN100960’s ancestor originally came from Ireland pre surnames. It is said that Ireland seems to have had the earliest use of surnames but by the 12th century it was only really the highest class who would be using them, while most of the population would be known by patronymics (e.g. John son of Patrick son of John). The Ancestry blog website https://blogs.ancestry.com/cm/history-of-irish-surnames-is-yours-here/ says the following:

‘It was around the 1100s, as the population was increasing, that people in the upper social classes started taking hereditary surnames (those that remain fixed over the generations); others didn’t need surnames, or even get around to them, until the 1500s’

     Perhaps the earliest that that IN100960’s ancestor could have come over to Scotland would have been in the settlements that took place not long after 500AD, but it could equally be much later especially since the distance between Ireland and the Mull of Kintyre in Scotland is just 15 miles across the North Channel between the two land masses.  Over several centuries one can imagine the movement of people who were originally settled in Argyllshire migrating further north to Inverness-shire, so that one of IN100960’s ancestors found himself in the lands of the MacPhersons, and, instead of using patronymics started to use the surname of the local landowners.

     What I have just suggested follows the DNA and social history and is perfectly possible as an interpretation of IN100960’s DNA result, but it should be noted that it is only one interpretation. There are still comparatively few individuals who have DNA tested to find their terminal SNP but of those that are available, and checking not just the terminal SNP, but also those which came immediately before it shows that there are related individuals in Scotland and Ireland. It is possible to look at results which have been mapped onto a  very useful SNP mapping website:

https://www.genetichomeland.com/homeland.asp. You have to register to use this site but once registered (it’s free) you can map the results for most SNPs which gives you geographic locations for these SNPs and you can save a limited number of maps for individual SNPs to your account.

     Finally, it is important to remember that there are other reasons for the adoption of a surname, and in particular death of father and the adoption of a stepfather’s surname, and illegitimacy, where the child is named for the mother’s surname, being the most common.


                              Grieg(Greig/Grig/Gragg/Gregg)/Gregory


Having previously explored the results for Grieg and Gregory I thought that it would be interesting to see if there was any link between them, always remembering that while those who test are a random sample of the population they include a high proportion of those in America who may in fact be related as a result of genealogical connections which have occurred in America/Canada over the last 400 years.

     Figure 5 shows the complete picture of the two surname groups in order to show those results which lie on the right of the Figure and belong to quite separate genetic groups. Starting from the top, 275997 is Gregory but the haplogroup is J which has its origins in the fertile crescent or further east in west Asia. The larger group with 877623 etc contains the surnames Gregory and Gregg – and these have Viking origins (haplogroup I). 239031 (Gregor) is haplogroup T which also originated in West Asia.  Finally, 212046 is from Haplogroup R1a which spread from eastern Europe but may have originated in India.


Figure 5: Gregg/Gregory all results


Figure 6 is a close-up of the much larger collection of Griegs (Greig/Gregg/Grig/Gragg) most of which can be seen beginning with ‘Gr’ while the Gregories are shown as “GY’. What this shows are two discrete large unrelated groups, one of Greigs (including 29673) and one of Gregories (including 8100). Then, apart from three much smaller groups, [the first in the bottom right which has a collection of Griggs, Graggs and a Gregg (e.g. 372126) and the second near the middle bottom of Gregg and Gragg (including 851482), and third, a quartet of Gregories (including 374579)], almost all the rest are either singles or pairs of results which lie some distance from their common point of origin (the split point where their lines divide). Only one pair looks as though they could have been related, namely, 950624 Griggs and N52932 Gregory but they only share a point of origin and are not otherwise a related pair.

Figure 6: close up of Grieg (all variants) and Gregory results


                                Terminal SNPs


Figure 7 adds the terminal SNPs which have, so far, been tested for these two surnames. To understand the significance of a terminal SNP it is important to understand that, in almost all cases, a SNP is a point on the Y chromosome which has become a fixed point in the genetic tree. Very early on in DNA genealogy the SNP R-M269 was the most recent SNP used to classify individual results for the R haplogroup common in Europe, but that SNP appeared sometime between 4000 and 10,000 years before the present. More recent SNP research and identification has been able to create a family tree of SNPs reaching down through time towards the present. So, a terminal SNP represents the most recent occurrence currently known to research. The company YFull publish a tree where some of the SNPs in sequence have approximate dates assigned – however more recent SNPs are mostly as yet undated.

Figure 7: Figure 6 with tested terminal SNPs added


In Figure 7 the four SNPs which are seen in the Gregg results [top middle] are all connected – some of the SNPs shown occurred earlier than others in time but they all connect together so that the individuals whose results they show are close cousins genetically of each other. Current mapping puts this SNP group in Scotland.  The screenshot in Figure 8, which is taken from the Gregg Block Tree shows how the lines spread out from the common terminal SNP BY135575 (courtesy of familytreedna.com). 

Figure 8: FT108241 as terminal SNP with later terminal SNPs BY212591 and FT106941 in the other individuals


The large Gregory group has a different terminal SNP – there are only two individuals in this group who have tested but their terminal SNPs are likely to proceed from BY34346, a completely different origin from the Griegs (all variants). All the other terminal SNPs shown for Gregories are either unique to an individual or shared by a small group. In some cases, for example 214992 Gragg and Gregg BY173545, there is a close genetic relation but the Gregg individual has acquired a later terminal SNP.


                                  Descendants of Rob Roy MacGregor


In the last two DNA blogs I have shared some of our DNA research to try to identify an individual who is definitely descended from Rob Roy MacGregor, and discussed some of the difficulties in verifying descent through paper genealogies and available records. As well as trying to identify a Rob Roy descendant we are also hoping to identify the Glengyle family of MacGregors of which he was a part.  In my report, I explored the possibility that an individual participant in the DNA Project was descended from Rob Roy’s grandson James who had a son Donald but suggested that it was a concern that this Donald who married Elizabeth Stewart in 1804 was probably too young to be the correct one, and, in addition, terminal SNPs suggested a closer connection to the Chief’s line of Glencarnoch. I am grateful to John Andrew Hutchison who contacted me with a huge amount of information from years of research into the various families who lived in and around Balquhidder and surrounding parishes. John has a family connection to the MacGregors and has been trying to link families together across generations, a task made much more difficult when individual members of families moved around and away from their ancestral homes.There is simply too much information from John to try and summarise here in any meaningful way, particularly as it involves consideration of a large number of different MacGregor families in historical records, as well as family records going from the 18th century onwards. However the main thrust of the research suggests that Donald who married Elizabeth Stewart was  probably not of the Glengyle branch but possibly of the family who were designated in the Balquhidder children list of 1787 (which I repeat here complete below) as of Brackley, which could explain our participant’s close genetic connection to the Chief’s line – of Glencarnoch  - and/or possibly to the Roro family.    


This list is found in Central Archives Stirling with the call number PD60/651. It is incorrectly dated as possibly 1830.


A list of the young children of the tree of McGregor with their names and Designations within the parish of Balquidder [all listed McGregor unless otherwise stated]


Bracklys family [sic]

Inverlochlairgemore

Patrick

Robert1

Gregor2

Donald3

Alexander4

William5


Alexr in Lickseridan

Peter1

Alexr2

Dun3


Alexr in Gartnafuaran

John1

Duncan2

James3

Gregor4

Malcom5


Dun in Inverlochlairgemore

Donald1

Alexr2


Peter1

John2

2 orphans


Dun in Craganmore

James1

Alexr2


Peter in Milltown

Dun1

Alexr2


Donald in Rinacraig

Donald1

John2

Dougal3

Alexr4

Alpin5

all orphans


Dun in Rinacraig   

Duncan1

in all 27


Dougal Keers Faimily


Duncan in Minachallwaroo

John1

Robert2


Patrick in Blarerioch

Malcom1


Dougal in woodend Stronslany

John1

Gregor2

Donald3

Malcom4


Gregor in Gartnafuaran

Dougal1

Gregor2


Duncan in Stronair

Malcom1

Alexr2

Robert3

Allan4


John in Criganmore

Gregor1

Duncan2

James3

Twins John and Donald 2


DonldCoock in Ruskachan

Dougal1


James in Ruskachan

Duncan1

in all 20


McGregors Faimily

Donald in Ardchullary

Hugh1


Robert in Adchullary

Donald1

James2


Robert in Rinacraig

Peter1

John2


Alexr in Rinacraig

Peter1

John in Rinacriag

Peter[ ]


Robert in Achtowmore

James1


John in Achtowmore

Hugh1


Hugh in Achtowmore

John1

in all 9


Roro Faimily

Duncan in Stronslary

John1

Donald2

Peter3

Hugh4


Duncan in Balcnoick

Alexr1


John in Lichscridan

William1


Alexr in Dalreach

John1

Hugh2


Hugh    Drover

James1


Patrick Decest Balifoil

Duncan1

John2

Donald3

in all 12


A mistake in Brackly Faimily that they not aded togather (sic)

John Decest in Gartnafuaran

Donald1

John2

Robert3

Jobert (sic)4


Donald in midleachton

Peter1

Hugh2

John3


John Decest in midle achtow

Duncan1

John2

Gregor3


Dun in Glentarkan

James 1

Peter2

John3

17

Brackly Faimily

27 (+) 13  (=) 40 of them


As usual I end this blog with the offer to compare a single result with up to 11 others using the spider charts, or to offer further advice on further DNA testing, particularly Y700 and terminal SNPs (and consequently the relationship of differing groups of the same name to each other). Contact me on richardmcgregor1ATyahoo.co.uk (substitute @ for AT).

I am grateful to Central Archives, Stirling, familytreedna.com. and the authors of the spider diagram program Splitstree, Daniel Huson and David Bryant.












Article 0

$
0
0

MacGregor DNA Project 2023

Welcome to the MacGregor DNA Project blog for 2023. I began last year’s DNA blog looking at how SNP results could be used to understand historical connections between surname groups and how different family groupings are suggested by subgroupings of the results. Over the past year, there has not been considerable development in terms of DNA testing other than the fact that individuals have realised the usefulness of Y700 for identifying closer family relationships. I am going to use the initial part of this blog to explore further this idea based on the fact that family tree DNA have introduced a new feature which uses scientific tools to explore potential connections between participants. I am taking examples taken from the MacGregor project and based on the main Argyllshire line as we already have some traditional sources which suggest when family groups split off from the main branch.

     I want to begin with an example which appears to show a family connection within the last 250 years, and which suggested that individuals that appeared in two family trees could in fact be brothers – which in turn led to a search in parish registers for an appropriate family living in the area from where it was believed these individuals lived.  In this ftdna example our 2 participants are suggested to have close familial connection dating to approximately 1802. As it happens that is slightly too late as we appear to be dealing with two brothers butt given the proximity to 1800 of the averaging process it was determined that these two participants ancestors were more than likely related. 

 Fig 1: two closely genetically related individuals – with SNP BY23340.

 

It seems that BY23340 is a relatively modern terminal SNP as can be seen in this graphic analysis:


Fig 2a: graphic representation of the DNA relationship between two individuals and Fig 2b the Mean which suggests a date of 1802.

 

In Fig 3. we see how this is projected forward:


 Fig 3: How this translates into two related lines today.

 

This parish register research suggested that two brothers James and Robert were the children of Alexander who lived in Stevenston, Ayrshire  in the second half of the 18th century.


          SNP – R-S696

In the MacGregor results one SNP, which was identified almost a decade ago by Jim Walker,  has significance as it is borne by the Chief’s family and two others (both related to emigrants to America whose earlier histories are not recorded).  The SNP is labelled R-S696.

  Fig 4: The genetic descendant to S696

 

The graphic in Fig 4 is slightly misleading as it only relates two SNPs whereas in fact R-A14374 is a descendant of S690 and we already know that S690 arose about 1300-1350. The suggested split of A14374 sometime in the early to mid 1500s seems very likely, and that S696 did indeed happen not too long after the banning (proscription) of the MacGregor name following the Massacre of Glenfruin in 1603.  The fact that two of the participants with S696 ended up in America but with no known genealogies available suggests that they may have gone there during the time of proscription to escape the personal danger and legal difficulties that were in force at that time. Proscription lasted from 1603 to 1660 when Charles II became King after the Commonwealth collapsed. Proscription was reimposed in 1692 following the unsuccessful attempt to restore the catholic King James II to the throne since the MacGregors were largely supporters of the Stuart cause. The second period of proscription was principally legal in nature but lasted through to November 1774. 

 Fig 5:  other lines related to S696. 

 

You can see from this diagram that there is a younger line (FT140264) from the same descent. One of these participants has a genealogy suggesting a Glengyle MacGregor connection which has been discussed in the last three blogs (Rob Roy MacGregor was a Glengyle). In summary: there was a gap in the documentary evidence at c1800 which threw  doubt on that family connection and that instead this participant comes from the main MacGregor family (that is, of Brackley)

     One final example will show, I hope, how accurate this scientific predicting could turn out to be. It is well known within MacGregor genealogies that the main line split into 4 main houses – Glengyle or more correctly Dugall Keir (as later epitomised by Rob Roy), as mentioned, was one, MacGregor (main line – Brackley) another, Roro (based on a location in Glenlyon near Fortingall in Perthshire) was a third and the family (Sleik [sliochd = tribe]).  of Gregor vcIan.

     The family based in Roro were known to be there from about 1460, so this Time Tree for SNP R-BY54364, carried by a descendant who has a good claim to have Roro descent, is therefore suggestive that the claim is accurate.


 Figs 6a/b: the genetic split that suggests the origin of the Roro family of MacGregors

 

The above discussion of SNP connections is clearly still somewhat speculative, but, since we know that S690 split off around 1300, or slightly later, leading to the establishment of the MacGregor main bloodline, then the results do begin to help us identify family groupings post 1300. We can see that so far there are seventeen family lines currently identified, some reaching back to the late Medieval/early Renaissance, and others (as in Fig 1) much later. This shows the importance of as many individuals undertaking the Y700 test, particularly those who have more comprehensive genealogies, so that more inter-family relationships can be established.  To end this section here is the current family descent tree for S690 as revealed by Y700.

Fig 7:  MacGregor Y700 test results as at December 2022

 

For the second half of this blog I want to go back to questions I am often asked:

 

1)    What is a clan?

Six hundred years ago this question was quite simple to answer. You would have been born with the name or you would have adopted it, rather than being called by a patronymic (e.g. John MacPatrick vicDonald = John son of Patrick son of Donald).  The name itself then becomes complicated with non-standardised spelling and attempts to render Gaelic into English which meant that, in the MacGregors’ case, the word might be expressed as Gregor, Grigor, MacGregor, McGrigor, McGregor with a whole range of alternative spellings. There were also accepted variants, such as Grierson, or Grier, Greig/Grieg/Grig etc. These were understood to be shortened (or ‘anglicised’) versions of the main clan name. So, Grier-son equals Gregor-son and Grier is the same name without the ‘son’ on the end.  Whether or not the people bearing these accepted names were genetically related to the main family line was not the point, since the clan was composed of members using a collection of related surnames who recognised, as Chief, the head of the main line (the Chief of the MacGregors for example).

     The time of Proscription (when using the name MacGregor was banned), also caused a great many different surnames to be associated with Clan Gregor, many as the result of the adoption of an alias (such as Stirling, Drummond, Bain, Black, Campbell – hence Rob Roy’s alias of Robert Campbell).

 

2)    What is a sept?

Other clan members bearing different names were from septs or associated groups of the clan. Sometimes the same name can appear on several lists of accepted septs for different clans.  This is the case for the surname King, for example: for MacGregors this surname was certainly thought to connect to the clan motto ‘Royal is my Race’.

 

3)    We have male MacGregors in our tree, how can they test?

Go to www.familytreedna.com, use the surname search box for ‘MacGregor’ and follow hotlinks on that name until you get to the project join page then select Y chromosome testing for one of the males – do at least 37 markers or any number of markers above this number.

 

4)    Why are there no family grids/list of results for Ancestry/Family Finder results and can I not prove my genetic connection to MacGregors that way and appear on the list?

 

From here I am largely repeating some text from my 2017 blog.

 

There has been a noticeable increase in the number of individuals taking the test known as ‘Family Finder’, or something similar, rather than Y chromosome or mtDNA tests. This has probably been as a result of quite aggressive marketing by Ancestry.com [it has a variety of website endings depending on where it is based] in particular. This has promoted the equal use of DNA testing for both males and females, and tied it into the submission of family trees which individual testers can use to identify the same family name(s) with others who have tested and submitted their genealogies. What has perhaps been rather glossed over in this is the fact, firstly, that DNA gets ‘lost’ over time – if it didn’t, we would have the DNA of billions of ancestors in our bodies, and, secondly, that it is only a tiny portion of our DNA which is currently being examined for genealogical purposes.

     You do not inherit 25% of your autosomal ancestry from each of your 4 grandparents. This is because your autosomal DNA is randomly recombined, and not in equal proportions from each parent, and so the more you go back in time the percentage inherited from people in a particular generation becomes smaller and smaller and therefore the more distant the ancestor is the more difficult it becomes to identify what you received from that person. What then are the chances of that same bit of DNA being preserved from a specific ancestor in yourself and someone else? For example, if your name is, say, Smith, and your male MacGregor ancestor lived 10 generations ago on your mother’s side it is really not feasible with today’s technology to identify that ancestor’s DNA by looking at your DNA today. The tests which are offered by Ancestry, Family Tree DNA etc. only try to identify links to 5/6 generations back. The key thing to remember is that if you and someone else have, say, people called Brown in your trees it does not necessarily mean that you have a recent ancestor in common, or indeed that you have any ancestor called Brown in common at all. For these tests of connection to work properly, you, and the person you are comparing with, need to have as much genealogical information as possible on every ancestral line in your respective trees, going back 5 or 6 generations (and that you both have a significant shared portion of DNA). This test works best for up to third cousin. You have 4 grandparents, 8 great grandparents and 16 great great grandparents and beyond that it is usually very difficult to have all ancestral lines documented.

     So, you and the person you are comparing with both have to put all your family information into one of the computer genealogy programs (like Family Tree Maker, Reunion etc.), save it as a GEDCOM file, and upload it to whichever DNA company with which you have both tested your autosomal DNA. The possible links between the two family trees are then highlighted in some way which allows a comparison of ancestry to be made in order to explore if there is a family match on some surname. As /I said in the previous paragraph the fact that there is a match on surname does not necessarily mean that it is the same family, only that there is a surname in common. Clearly the more unusual the surname, the more likely that the match will be with the same family.

    I give an example from my own ancestry. Using the compare options on the top of the page I find this result:

   Fig 8:  comparison results for FMS (full mitochondrial sequence) and X-match.

 

For x-match the x means ‘not a match’ and the ‘-‘ means no result. The 9 matches that I do have all have 3 variants except for two individuals who match me either exactly or with one variant.  For obvious reasons I have edited out the names if those who match. This is My mother’s side (and her mtDNA) and unfortunately her family history knowledge was non-existent. 

 Fig 9: a search using Family Finder

 

I have 171 pages of matches, and this is fairly common, but almost all are 3rd to 5th cousin or distant, and only one is described as a first cousin to second cousin. In fact I am his second cousin. Interestingly his son is described as my second to fourth cousin, and he is actually my second cousin once removed. None of the other participants show surnames I can relate to, and about half of the participants have given no family genealogy background at all which means that it is impossible to work out the relationship, although in most cases a contact email is available. My experience here is quite typical of what happens with Ancestry-based tests and it would be good if more people who do DNA tests for genealogy could be encouraged to upload trees – or even just a list of ancestral surnames.

     As a project administrator I am very aware of the need to keep participants’ personal details secure and in familytreedna I can only undertake changes if given ‘Advanced Access’ (and the default for familytreedna is ‘Limited’ so I am unable to update most participants’ earliest known ancestor tab which means that the results grid has gaps). I would encourage everyone who reads this to update their information – it helps both the participant and others.

     To end with, I repeat my final paragraph from last year’s blog, which can easily be accessed from the side menu:

     You will have noticed that again I have not used spider diagrams. I now prefer to offer individuals the possibility of seeing a spider diagram (Y chromosome only) for their own result, generated for them alone. You can see the sort of spider diagrams I mean by looking at previous versions of this blog. If you would like me to generate one for you, ask me offline and suggest up to 12 individuals with whom you would wish to be compared by reference to their kit number (for me to be able to do that these 12 need to be in the MacGregor project). You can email me on chairmanATclangregor.org [change AT to @ - I just want to avoid spam!).
















MacGregor DNA Project update 2024

$
0
0
This year’s DNA update is rather later than usual, partly because it’s not been a year where very much new has happened so it is difficult to know what might be of interest in a blog like this. I have previously discussed aspects of Scottish clanship and how what we see in the DNA reflects the diversity of origins for clans and their associated names. 
     I will begin by repeating an important fact about clans and kinship. Clans were never derived from a single surname; they were always collections of related names and septs. That means we should not expect everybody who is associated with the clan to have the same genetic profile. In many cases, we have to remember that surnames were adopted long after they first appeared, particularly in Scotland because individuals who were less well placed in society, and therefore did not sign documents or make legal agreements, often were unable to write their names. 
      It is true that in Scotland there was a move in the 18th century to provide at least an elementary education for all children, and in some cases children from very poor backgrounds who had particular abilities were not then confined to their social class. Much of the early work in education was done by the SSCPK (Scottish Society for the Propagation of Christian Knowledge)

The Society began to establish schools in the Highlands with the aim of reducing Jacobitism and resisting the rise of Roman Catholicism. The first school was opened on St Kilda in 1711. By the end of that year, the SSPCK had five schools, by 1715 twenty-five, by 1758 176, and by 1808 189, by which time 13,000 children were attending the schools [Wikipedia]

 In the Highlands particularly, surnames were not so commonly used, particularly where individuals were simply known by their patronymics - for males this was Christian name followed by son of (Mc) father’s Christian name followed by grandson of (vic) grandfather’s Christian name. It’s not surprising, then that when it came to adopting a surname, individuals chose surnames that were local, influential, or had some connection to them – it might even be the landowner’s name. In the next section of this blog, I will discuss a new dating mechanism which is attached to Big Y results. I am using the SNP data from the main MacGregor bloodline because there are sufficient results within it to begin to try and begin to make sense of family connections going back into historic time. This therefore is a principle which can be applied to other groups of surnames both within and beyond the project.
     What Big Y has shown is that clan groups are made up of many many lines which split off from each other over thousands of years. One of the latest developments which FtDNA offer is ‘Discover (Your Haplogroup story)’. This option gives you the ability to track when your terminal SNP (given by Big Y) [terminal here means the SNP which is identified as most recent in historical time to you] is estimated to have occurred, and then to track SNPs back in time to see roughly when the line split to form a new line and then back from that into more distant history. 
     I will say more about the dating offered by FtDNA shortly but for the moment here is an example. This individual’s terminal SNP is BY74650 estimated to have split off from S690 in 1300CE and the individual’s ancestor was ‘born’ around 1400CE (common era – that is 600+ years ago) and two individuals on current testing have that terminal SNP. The sequence back in time then goes as follows: S690 split from BY144 around 1050 with 50 individuals from that line. [comment: this is possibly misleading as it suggests that people with S690 should be identifiable who are NOT called MacGregor, [Gregor the genetic origin of the clan bloodline probably WAS born about 1300], but to date NO other individuals with S690 and different surnames [other than those adopted as aliases during proscription] have been identified. Also misleading is the ‘50 individuals’ bearing S690 whereas in fact in the DNA project everybody called MacGregor who is in the main bloodline group has that SNP, and that is over 200 individuals. It suggests that the 50 individuals noted by FtDNA are those who still have S690 as their terminal SNP). Going back in time from S690: 

BY144 split from S703 at around 750CE (55 individuals) 
S703 split from FGC 59322 around 550CE (55 individuals) 
FGC59322 split from S695 around 500CE (56 individuals) 
S695 split from S691 around 450CE (148 individuals) 
S691 split from S744 around 400CE (251 individuals) 
S744 split from Z-16325 around 350CE (802 individuals) 
Z-16325 split from L1065 in around 150CE 

Without going into detail, it’s clear that there are anomalies and approximations here. Another company, Yfull, also does dating estimates – but not for all the later SNPs – their date for S695 occurring is about 600CE and most recent ancestor 650CE, while for BY144 occurring about 650CE and most recent ancestor 1150CE. If you examine the detail on BY144 which these estimates are made at YFull you see that the range is given as between 1220 and 1447 years ago therefore the estimate is given as 1333 +30 years = 1363 years ago (there is a formula which is given on the website, so my explanation is somewhat simplistic). Anyway, 1363 years from 2023 gives 660CE as the presumed date of origin. The map given is strongly UK located with much fainter occurrence in Scandinavia [https://www.yfull.com/tree/R-S695/].  
     The Yfull method for estimating time to most recent ancestor on the other hand is much more straightforward and is based on three estimates added together and divided by 3 – however again the formula for achieving the estimates is much more detailed. However, the estimate for TMRCA is 851 years before present – that is, 1172CE. Whereas the mechanism in FtDNA is not apparent, Yfull is much clearer as to how approximations have been calculated. 
     For the SNPs they do cover (and S690 is not one of them) the mapping component is helpful, but not recent enough. A website which maps more recent SNPs is www.genetichomeland.com. To view maps for a SNP you must register but registration is free, and you can store up to 10 maps. Note that Google Chrome seems throw up an ‘unavailable for legal reasons’ problem so a non-Google browser is better (Firefox worked best for me). 
     You will notice that the time estimates given by FtDNA are always rounded up or down to 50 or 100 years. The point is that these are truly estimates of when a split occurred. So, it’s important NOT to take the dates as absolute, and particularly when trying to relate two or more individuals together. A gap of 50 years between SNPs could therefore be anything from 1 to 3 generations (or more). 
     I have drawn out a terminal SNP chart using the approximated years given by FtDNA to show how some of the DNA lines are connected to each other. This is particularly true of the line:
 
FGC17836 to FGC17835 to  FT203761 to MF133765 

which, using the earliest ancestor information given looks as if it might relate to the area north from the top of Loch Tay – those 2 testers who claim descent from John who died in Ontario might look to that area for origins.



It is interesting to note how many of the different genetic lines have an alias associated with them, reflecting the clan’s turbulent history. It might be obvious to say this but the point where the SNP divides is not, particularly in the historic SNPs, to be thought of as a place where historic divisions of MacGregor families (such as Roro and Glengyle [Dugal Ciar] took place. They are simply the point where a DNA mutation took place. This is well illustrated with the only division that produced two subsequent branches here, namely S696 and FT140264 whose shared ancestor had mutation A14374 but currently no one in the project has retained that mutation as their terminal SNP. In fact, this happens all the time in different family groups - whenever a division occurs, that indicates a shared ancestor. And in some cases, like the Stirlings, that is apparently a very recent shared ancestor. 
     No significance should be drawn from mutation happening very early on historically, nor the kit tester having that alias (e.g. Pressley), since almost all the aliases will have been taken post proscription of the MacGregor name. In towns before that there is evidence of individuals dropping the Highland ‘Mac’ because Highlanders were not particularly welcome – so we have Johnsons as an alias before 1600 as in these two entries: 

 “1575. June 25. Advocatis against McGregor and others, George McGregour alias Johnstoun, Johne and George McGregor his sones, Duncan McGregor & Burgesses of Perth to have broken Lawborrowis5. Delayed till 15 Jan. next. “1576. June. 2. Charter and infeftment given to Gregour McGregour alias Johnstoune, burgis of Perth, by Dene Adam Forman Prior of the Charterhouse of Perth. is rescinded by the Court of session in so far as regards the sunny half of three quarters of St. Leonards. Lee. - Decreets of Court of Session. 

     It is worth noting that the alias Drummond is in 2 different generations (see BY39563 line) – one might hope it was the same family, but Drummond was rather a common alias for different branches of the MacGregors to take whereas Pressley came as rather a surprise. 
    Also, I have noticed that Ireland comes up several times when considering aliases – most notably the Boyds. What is rather unusual about the Boyds is that they split into three distinct family groups (split from S690 that is) at about the same time – presumably between 1600 and 1700 as a result of proscription and relocating to Ireland. In the book Illustrations, Historical and Genealogical of King James’s Army Lists there is the following rather fascinating entry which may give some clues to the Boyds who are MacGregors: 

 ‘…about this time an Alexander McDonnell was appointed by Lord Tyrconnel Governor of Galway; he, in the progress of the war, became a Brigadier-General. Colonel O'Kelly, in his Excidium Macario, says he was a "soldier of fortune, raised by merit from the ranks;" and Croker, in his brief notes on that little work, adds that he was otherwise called. 'McGregor,' and was of Drumsna, County of Leitrim. He married in 1685 the Lady Jane Nugent, a sister of Thomas Nugent, afterwards created Lord Riverston. In December 1690, he was removed from the Government of Galway. It is remarkable that in the Outlawries of 1691 he is styled Alexander McDonnell, alias Gregor, alias Boyde, of Clonin, County of Westmeath. At the same time were attainted in Antrim, Daniel Mc Donnell of Dunluce…’ 

   A similar process to the one done here can be applied to other surname groups, as long as there are sufficient entries from BigY tests. For example, many of the Greigs share a common Y line when the SNPs are related to each other. Anyone can check the SNP sequence of mutations to find out if the DNA results point to a common ancestor. Simply put the SNP label and ‘DNA’ into Google and select the resulting ‘FtDNA Discover’ link. The most significant Greig line mentioned is: 

BY135575 [c.900CE] to FT108241 [c.1550CE] to BY212591 [c.1600]  to FT106941 [c.1750] to BY166266 [c.1900]. 

 As always if you find a problem with doing this process you can contact me at richardmcgregor1Atyahoo.co.uk [substitute @ for AT]

Article 0

$
0
0

 

              MacGregor DNA Project  Blog update 2025 

It has been a rather quiet year for DNA developments. In terms of the tests which have been available over the past couple of years there is nothing new. Y700 still remains really the best Y chromosome test around but only males can do it, and it relates directly to male surnames. What has been developed by FamilytreeDNA are tools for understanding a person’s male genetic past on the Y chromosome using SNPs (rather than the ‘usual’ STR results). STRs or ‘short tandem repeats’ are necessarily less accurate for genealogical matching because they mutate randomly – in some individuals hardly at all, while in others, mutations occur with greater frequency (reasons evinced have been age of father, diet, exposure to radiation etc).  
    However, as I will explore in a moment with an example in this blog, it seems that the relatively small genetic pool of results from those with male ancestry in the Highlands of Scotland means that some individuals have a large number of ‘private’ SNPs in their results – in other words nobody tested so far has a SNP that is shared which would make it no longer ‘private’. Familytreedna has developed a tool to use when two or more individuals share a common SNP, which gives an approximate date for when that SNP was shared. Last year I explored how this gives a dating sequence for family groups of those called MacGregor (or alias names) who descended from the Argyllshire MacGregors. SNPs (or Single Nucleotide Polymorphism) are points on the Y chromosome that occur and stay fixed (as is believed). 
    This year I was contacted separately by two individuals, named MacGregor, who currently share the same terminal SNP, R-BY3023. They are not related to the Argyllshire MacGregors, or indeed to the other main MacGregor groups (that is, from Rosshire, from Ireland and from near Perth). They do share that SNP with quite a few other individuals called Campbell, MacDonald, Cameron but most particularly, MacKenzie. 
     While one of the two MacGregors has a terminal at BY3023 the other is currently waiting for results of BigY which should prove interesting to see if he comes out with a closer match to the MacKenzies. 
    But that is where the problem arises and in part interpretation is severely handicapped by the incomplete (and sometimes non-existent parish registers). And that’s also where it gets interesting:
    Taking one of the family genealogies I was sent, the nineteenth century line, thanks to census and statutory records, is well established, so that there is direct evidence for Hugh born 1822/3 – he gives his age consistently in the census records and when he dies in 1881 his age is given as 58. And, he is also consistent in claiming that he was born in Kilmallie parish, Invernesshire. However, there is no record of a Hugh being born in 1822/3 at all let alone to Angus McGregor and Catherine Cameron who are named on his death certificate. 
    What records there are for Angus in Kilmallie are as follows (I have included some other entries for what comes next):
1773 Angus/Mary Cameron John, Kinlochleven, 8 June 
1774 Duncan/Florence Cameron Ann, Achintore, 5 July 
1776 Donald/Sarah Cameron Marion, Inchrie, 28 Apr 
1782 Ewen/Sarah Boyd Janet, Blarachern, 17 Mar 
1783 Ewen/Ann Boyd Rachel, Blarachern, 16 Nov 
1785 Donald/Sarah Cameron Donald, Inchrie, 20 Jan 
1785 Angus/Sarah McMillan Flory, Blarmacfeltach, 19 Apr 
1794 Harry/Betty Wilson Elspet, Maryburgh, 30 Aug 
1797 Harry/Beatrice Wilson Robert, Maryburgh, 17 June
1800 Harry/Beatrice Wilson Beatrice, Maryburgh, 13 Jul 
1803 Harry/Beatrice Wilson Archibald, Maryburgh, 30 May 
1806 Harry/Eliza Wilson Catharine, Maryburgh, 9 Mar 
1810 Ewen/Rachel McMillan Catharine, Blaraclurin, 23 Jun 
1813 Ewen/Rachel McMillan Ann, Blarnaclerach [Blarnaderach] 28 Aug 
1818 Angus/Catherine Cameron Ewen, Blarmacfoldach, f 8 Nov 
1823 Angus/Catharine McGregor,[ - ], John, 5 Dec 
1825 Angus/Catharine Robertson, Rachel, Blachorin [dist Fort William], 25/26 Jun 
1827 Angus/Catharine Robertson Sarah, Badacheanan Corran, 18 Aug/8 Sep 
1829 Ewen/Ann Margaret 19/19 May 
1829 Angus/Catharine Robertson Ewen, Blarchoachan, [Corran Dist] 16 Apr/17 May 
1834 Angus/Catharine Robertson Catharine, Blarmachfailan , 21 Mar/6 Apr 
1839 Angus/Catharine Robertson Susan, Blarmacfoldach, 23 Jul/8 Sep 

 Key: father/mother child, place, one date = baptism, 2 dates = birth/baptism, f = in fornication

There is a tombstone inscription in the Fort William Episcopal Churchyard: In loving memory of Angus McGregor and wife Catherine Cameron, this tombstone is erected by direction of their son Donald McGregor Achnacarry who died Banavie June 11th and was buried here June 13th, 1892 
 
FORT WILLIAM [gravestone from GD50/233 in National Archives of Scotland] Henry (McGregor) imo son Charles died 25 May 1811 age 12y 9mo 
 
In the census record there are some entries that show an Angus: 
 1. 1841 
 Angus McGregor born c 1786 (possibly 1782-86 because of census rounding down), wife Catherine (40), son Donald, 14, daughter Sarah 8, at Blarmacfoldich 
2. And: in Fort Wiliam [Maryburgh] 
Angus McGregor age born 1791 (1788-91), wife Ann age 60, grandson? Peter a 5 
3. And: 
Angus MacGregor age 85 (1752-56), Ann (daughter?) age 30 
4. And: 
Catherine MacGregor age 35 (born 1802-06), Ewen age 10, daughter Annabell age 8, daughter Catherine age 6, son Donald age 4 at Blarmacfoldich 

5.In the 1851 census 
Angus MacGregor born c1783 woollen weaver Fort William, wife Ann born c1776 
6. And: 
Angus McGregor born c1779 widower, farmer Muirsheorlich, children, Donald 23, daughter Sarah, 20, son James 13 
7. And: Angus McGrigor born c1791, wife Catherine age 49 (born 1802), son Ewen age 21, son Donald age 14 and son Alexander age 6 at Blarmacfoldich 

1 and 6 go together, 2 and 5, and 4 and 7 (3 is clearly dead by 1851). But there is no Hugh aged c17/18 (rounded down to 15) in 1841 or aged 27/28 in 1851 

 There is a death record for Angus in 1863, a farmer? aged 82 (therefore born c1781) with parents given as Donald McGregor and Sarah Cameron, and he is the widower of Catherine Cameron – the informant being Donald McGregor, son at Auchintore. So, that ties up with the tombstone inscription, and therefore 1 and 6 in the census. Is this the same Donald who had a son Hugh in 1823? It doesn’t look like the the Ewen aged 10 is the same as Hugh as heshould be given as at least 15. 
   However, if the wife in 4 and 7 is Catherine Cameron that suggests that the wife for 1 and 6 is Catherine Robertson. 
   Look though at the names in the parish register – Sarah Boyd becomes Ann Boyd, Sarah Boyd becomes Ann Boyd and Betty Wilson becomes Beatrice: you can’t but get the feeling that whoever recorded wife’s names wasn’t too accurate. So, you wonder if John son of Angus and Catherine McGregor in 1823 was actually a child Hugh by Catherine Robertson. OR is that Catherine Cameron again?? OR is it Ewen because that is interchangeable, apparently, with Hugh? 
    I have included Harry in the list above because according to Joseph Foss he is related in some way (but again it all depends on the DNA connection! 
    I should say that there is a letter from Duncan in Blarmacfoldich in 1825 in the PD60 Central Archives deposit saying that his brother Angus is bearing a letter of recommendation from John Gregorson of Ardtornish and that the family had been farming in that area since Prescription - so that suggests that they were there a while before Rob Roy and his son James Mhor. The significance for this is that it is claimed that Donald father of Angus was the last child of James Mhor son of Rob Roy. That would be doubtful if they had been farming there before Rob Roy. And where is Hugh – before he turns up in 1851 in Ballantrae Ayrshire aged 29 married with wife Elizabeth aged 22, son Angus 2 and son John aged 1 month? 
   The second family who are related through the early SNP BY-3023 (estimated to have arisen about 250CE [used to be labelled AD] may provide a clue. They also descend from an Angus – this Angus died in 1866 at the age of 68 according to his son, but on his marriage certificate (2nd marriage) in 1855 he says he is 50 and then in the censuses he gives his age as 45 in 1851 and 59 in 1861 so clearly didn’t actually know his age. But, presumably, when he said his father was Duncan, (a mason) and mother Margaret Fraser he knew that for a fact. We can find Duncan in the parish registers. Duncan was actually for a time in “Locheil’s Highland Regiment of Foot” and there are children born to them: Catherine in Paisley (however we cant be sure if this the same couple as he is a ‘threadmaker’ according to the record), a daughter (unnamed) born in 1799 in Boleskine and Abertarff , Invernesshire [bottom end of Loch Ness] (the father was ‘not near’ so the child required a sponsor), and then in 1805 the same couple (he is still a solder) had a son Duncan in Edinburgh. Angus’s son knew that his father’s father had been a soldier, but he said that Angus’s father was Angus not Duncan (however he also says the regiment was the 78th so that could be checked). Duncan was no doubt from the Highlands and since Lochiel’s lands include the Fort William area it seems likely that Duncan came from the same parish and family as the family of Hugh that was discussed above, but the parish records are incomplete, and as you can see what someone’s children thought their grandparents were called was not always accurate. 
    I know this was a long diversion but I wanted to show what the steps were to trying to find out the facts and how DNA can be used to confirm or deny a story, If you have followed the thread so far you will have guessed that Angus who married Catherine Cameron (1 and 6) was the son of Donald McGregor and Sarah Cameron but not entered in the parish record. The significance of this, as I noted earlier, is that Joseph Foss maintained that Donald was a son of James Mohr MacGregor, son of Rob Roy born in 1753 – more or less at the time James fled to France where he died in 1754. So, the DNA for Hugh son of Angus McGregor and Catherine Cameron would not necessarily show any connection to the Argyllshire MacGregors (always assuming Rob Roy’s line was legitimate) and the confusion of males with the same christian name creates a problem. In other words Donald born 1753?was not a child of James Mohr (unless Rob Roy’s line did not descend from the Argyllshire MacGregors. Of course, there is one little fly in the ointment – that Hugh (not entered in the parish record) had a sister called Annabell (also not entered) – which of course was the name of James Mohr’s wife (Annabell McNicol). Wouldn’t it be good if genealogy was straightforward.? So we are still looking for a probable male MacGregor descendant of Rob Roy!!


For the second part of this blog, I wanted to deal briefly with the Greig, Gregor, and Gregory families (no matter how spelt) to see if there was any overlap now that a number of BigY SNP results are available. This process involved downloading and combining all the Y chromosome 67 STR marker data from the FamilytreeDNA results page for the several surnames and editing it so that only the kit numbers appeared. I then ran the number sequence through Dean McGee’s Y DNA tool to produce a grid which shows the apparent relative closeness of some family groups by the colour matching – note the relationship time is not necessarily correct as I asked for a high percentage of probability. The resulting grid is Fig, 1. I say ‘apparent’ because this seems to show the relative closeness of families with different surnames, but the dates start to go back into the past when the closest relatives of the same name are ignored. There would have been less colour if I had opted for a lower percentage of probable time when individuals were related.


Fig 1 name relationship grid. From these, 6 more related groups can be seen – the largest being Gregories.


A well as this, Dean’s programme produced a Phylip grid – effectively the above in another form  -  and  that was uploaded into Daniel Huson and David Bryant’s program Splitstree (as usual)  which produced a spider diagram of  results as below (Fig. 2):



Fig 2:  raw chart of kits for the various surnames


At this point I inserted the surnames attached to each kit number and then added in red the BigY terminal SNPs (Fig 3).



Fig 3: the grid name and Big Y terminal SNP data


From this you can now see the various family groups and more particularly the high incidence of relationship among some members who share a surname, BUT, also  that others of that same surname are much less closely related. The SNP results suggest the common ancestor date, but again this needs to be treated with caution because the connection date could be a lot further back in  historic time than it appears. It does seem that ARE some branches which produced different surnames (for example Gregg and Gregory), but their common root is far back along the line, potentially pre surnames. Other than this there does not seem presently to be a direct link between or among these different surnames.

     As always, this is just my interpretation, but do feel free to contact me at richardmcgregor1ATyahoo.co.uk (substitute @ for AT). My thanks to Dean McGee, and the creators of Splitstree.



Viewing all 31 articles
Browse latest View live