DNA Genealogy
We'll focus mostly on Y-DNA, as this site is about the Taylor surname,
passed down by males.
Genetic Genealogy
Genetic genealogy is the name for the field of study using DNA patterns to identify specific ancestors within a genealogical
time frame -- the last 800 to 1,000 years. A related, but separate, field is "genetic anthropology" which
uses DNA to trace the developments & migrations of the human race during
pre-historic times.
Basic principle
The basic principle of each field is that both Y-DNA (from fathers) & mtDNA
(from mothers) are passed down to offspring almost without change through many
generations. Genetic genealogy consists of looking for sufficient similarities in
DNA patterns to say that two patterns are essentially the same.
How it Works
When two or more DNA test results are compared and found to be identical
(or nearly so) we conclude that the individuals from which the samples come
share a common ancestor. The comparison is made based on the number of loci
tested in common. (One can't compare data versus no data.)
Step-by-step:
- Order a test from a reputable company with good after-test service and a
Taylor surname project. Decide how many markers you want tested.
- Receive the test (sample) kit in the mail. It will consist of a pair of
cheek swabs, instructions, and a container to mail the samples back.
- Swab the inside of your cheek as instructed, put the swabs in the
container & mail it back to the laboratory.
- Receive your individual results in the mail or by e-mail.
- Compare your results with others to find matches
and determine the genetic distance. Here is
where the payoff comes and where a project support group
can be very helpful.
- If you find matches, focus your documentary research on identifying the
most recent common male ancestor.
How much testing do I need?
DNA testing costs money; there's no denying it. And, it's a waste to
spend more on it than you need to. The typical 12-marker panel (the minimum
offered) costs about $150; a 37-marker panel about $350. The 12-marker panel
is less than half the cost of the 37 markers.
However, more of a waste is to spend some money and not get the benefits.
And, there are ways to save money and still get a good
result. So, we'll make some
recommendations for Taylors considering Y-DNA testing:
We'll start with these facts:
- Taylor is a common surname and was even more common in the past than it
is presently, after immigration introduced more surnames. Name alone isn't a factor to discriminate
one family line from another.
- Most Taylors are of English, Scots or Irish descent & fall into the R1b haplogroup, for whom the first 12
markers tested may match many people with no common ancestry for thousands
of years. {If you already know that you're not R1b, you
could probably get away with the smaller test.}
- More markers provide a greater base for confidence in a
CMA and may allow more precise determination of the probable
TMRCMA. (See Rule 2 below.)
- Upgrading a Y-DNA test to additional markers is more expensive than
ordering sufficient markers initially. {We do recommend
upgrading, however, if you have a 12-marker test resulting in many "false
positive" matches. }
Recommendations
- We recommend an "industrial-strength" initial Y-DNA test for
Taylors, no fewer than 25 markers and 37
markers is better. A 24 of 25 (or 35 of 37) marker match is considered
indicative of of a CMA where an 11 of 12 match is not.
- We recommend joining a Taylor Surname Project at the testing company of
your choice. This will get you a better price on the testing and may yield
better after-test service, depending on the activity of the project's
volunteers. For example, the current FTDNA price for a 37-marker Y-DNA test
as part of the Taylor Surname project is $259, a saving of $90 over
the non-project price.
- We recommend following up your test results by posting them to a free
database such as www.ysearch.org &/or
www.ybase.org. This will give you the
ability to find matches among those tested by other companies and
potentially maximizes the payoff from your investment.
Who to have do the test?
There are several companies in this field and sometimes the choices are
bewildering. While we have our own preferences, we recommend that you evaluate the options and make up your own
mind. Charles Kerschner's
website gives a list and links under the heading "Testing Companies and Organizations";
scroll down his page to find it.
Also see a sub-page,
http://www.kerchner.com/kerchdna.htm, or
http://freepages.genealogy.rootsweb.com/~bonsteinandgilpin/dna/ydnaco.htm.
These pages give information to help select & contact a testing company.
We have been happy about our experience with
Family Tree DNA. It has excellent technical standards, good service and a
fine group of Taylor Surname Project volunteers (including me) for post-test
support.
Interpreting results:
Making sense out of the report you get back from the laboratory and turning it into usable information is
the most difficult, and often frustrating, part of the process. Your results
mean nothing except in comparison to others. A greatly over-simplified view of
interpretation is this:
Identical results
Establish (with sufficient STR markers) certainty of a common ancestor but
do not provide a reliable timeframe for when this ancestor may have lived. Y- & mtDNA
change so infrequently that the most usual case is no change for many
generations.
Minor differences
Allow use of probabilities to estimate when this ancestor may have lived.
The more differences there are, the more likely it is that more generations
have passed.
Major differences
Establish unlikelihood of a common ancestor in the direct male or female
line.
We'll continue this discussion below,
after you've had a chance to get some basic concepts.
Some terms are essential to understanding genetic genealogy:
- Genealogy = A field of study which uses documentary records and
other techniques & resources to identify a person's ancestors, family
relationships, and other pertinent information such as dates & places of
birth, marriage and death. The field is limited by the information available
to (except in rare instances) historic times.
- Haplotype = A specific pattern in Y-DNA or mtDNA, unique to a
line of descendants. If two people share the same haplotype, they must share
a common ancestor within a genealogic time frame.. Haplotypes are described
in short-tandem-repeat (STR) analysis as a series of loci (place) names and their
corresponding allele counts.
- Haplogroup = A group of haplotypes which have some common
characteristics. Two people who share the same haplogroup may not
share a common ancestor within the past 20,000 years. Haplogroups are
classified and named according to their (believed) origins and are most
useful in genetic anthropology. The name starts with an initial letter, then
alternating numbers & letters are added to identify subdivisions.
For example, most men with the Taylor surname (like most with European
ancestry) belong to the R1b haplogroup. However, the Family Tree DNA Taylor
project has identified 27 groups of matching haplotypes among their
participants, plus about 80 other single haplotypes. Of those 107 separate
haplotypes, about 100 are in the R1b haplogroup.
Haplogroups, being so general, are not particularly useful in genetic genealogy.
A haplogroup is too broad a description of ancestry to be of much
service.
- Mitochondrial DNA = the DNA contained in mitochondria within the bodies of our cells. The
purpose of mitochondria is to supply energy to the cell so that it can function.
DNA from mitochondria is abbreviated "mtDNA".
- Y-chromosome = one of the two kinds of sex-determining chromosomes. Men have one X- & one Y-chromosome.
Women have two X-chromosomes. DNA from the Y chromosome is abbreviated
"Y-DNA".
- Autosomal DNA = the DNA of a non-sex chromosome (neither X-
nor Y-).
Autosomal DNA determines much of what we're about, hair & eye color for
example.
It is inherited from both mothers and fathers in ways which are not
completely understood, though seeming to average -- for groups -- about half
from each parent .
- STR (short tandem repeats) is the typical analysis method for
Y-DNA. It identifies loci (places or addresses) on the Y-chromosome and counts the
number of repeated patterns at each locus. Here is an example of a STR result with 30
tested loci:
| Locus |
Y393 |
Y390 |
Y19 |
Y391 |
Y385
a |
Y385
b |
Y426 |
Y388 |
Y439 |
Y389 |1 |
Y458 |
Y389 |2 |
Y458 |
Y459 a |
Y459 b |
| Alleles |
13 |
24 |
14 |
10 |
11 |
14 |
12 |
12 |
11 |
13 |
13 |
29 |
16 |
9 |
9 |
|
| Locus |
Y455 |
Y454 |
Y447 |
Y437 |
Y448 |
Y449 |
Y464 a |
Y464 b |
Y464 c |
Y464 d |
Y460 |
AN01T AP4 |
Y464g ||a |
Y464g ||b |
Y456 |
| Alleles |
9 |
11 |
11 |
25 |
19 |
29 |
16 |
9 |
9 |
11 |
11 |
25 |
15 |
19 |
16 |
Red font indicates "faster-mutating" (e.g., 1/250 generations) loci.
Differences in alleles may be less significant than "slower-mutating" (e.g.,
1/400 generations) loci.
- SNP (single nucleotide polymerase) identifies specific mutations of the DNA.
It is the typical analysis method for definitive identification of
haplogroups, for deep ancestry studies and for mtDNA. For example, those in
the R1 haplogroup share the M343 mutation; the R1b group share the P25
mutation; the R1b1b group share the P297 mutation.
(Mutations are named by the code for the research group discovering them and
the order in which they're discovered.)
- Marker has either of two meanings:
- In STR testing, it refers to a
place (locus) on the Y-chromosome, such as "Y385a" in the table above.
- In SNP testing, it refers to a specific mutation, such as "P297", which marks
off one haplogroup from another.
- Alleles (or allele value) = the count of the number of of short
tandem repetitions at a specific locus (marker).
- Surname Project = A Y-DNA study of a particular surname. In many
cultures, a family name passes from father to son which correlates with
Y-DNA inheritance. Though there are exceptions, this facilitates use of
Y-DNA to identify paternal lines. Joining a project provides benefits such
as discounted testing and post-test support in interpreting results.
- Direct Descendancy = An unbroken line of descendancy through
either a paternal or maternal line:
- Father to son to son's son to son's son's son, etc., OR
- Mother to daughter to daughter's daughter, etc.
There may be no intervening children of the opposite gender or the direct descendancy line is
broken and the Y-DNA or mtDNA will not be inherited.
- Match = the degree to which DNA results are similar, often
expressed as a ratio of markers (loci) in agreement as the numerator, and total
markers tested in common as the denominator, e.g., 12/12, 24/25, 35/37.
- Genetic Distance = A measure of the difference between two
haplotypes.
- Simplified method: Differences in allele counts are added across all
loci tested in common.
- If there is a difference in allele counts of 1 in one locus,
genetic distance is 1. An allele difference of one each in two loci
or 2 in one locus gives a genetic distance of 2.
- Differences are absolute; a minus on one does not cancel a plus
on anther.
- The simplified method is the one commonly meant in genetic
genealogy.
- The true scientific method is more precise & useful for
inter-species comparison (e.g., humans vs. chimpanzees), but it is more complex
mathematically :
- D =−log(In) ,
- where In = (Σ Pix•Piy)÷[(ΣPix²)·( ΣPiy²)]½.
- Pix is the proportion of allele i in population X, Piy is the proportion of allele i in population Y.
- Common Male Ancestor = the ancestor established by Y-DNA to be
shared by the individuals who have been tested. It is abbreviated "CMA".
"MRCMA" refers to the most recent CMA, reasoning that if one ancestor
is shared previous ancestors will also be shared. "TMRCMA" refers to
the time (in generations) back to the most recent common male
ancestor.
- Confidence level = the confidence, expressed as a percentage, one
may have in a statistical estimate. For example, a 90% confidence level
means that one can be 90% certain a statement is true but there is a 10%
chance the statement is false.
- Non-paternal event = any event which results in a child bearing the
surname of someone other than his or her biological father. Non-paternal
events may include
adoptions & name changes.
So, you have your Y-DNA results for at least 25 markers in hand or on
the website and you want to know if any others in this computer database
share a CMA with you.
Rule 1:
Comparison of results is a series of comparisons, one against another, one
pair at a time.
One set of results (yours) is constant and others are examined for agreement; each set of markers
(loci) with allele values are compared. Then, those
with the matching parameters you set are selected for review. Computers
can do this so fast you don't notice, but that's what's going on.
- To be compared, both markers must have a test value.
Markers that haven't been tested by both cannot be compared. You don't know
whether they match or don't, so any conclusions must be based on those
markers tested in common.
- Naturally, conclusions based on more markers are better than those
based on fewer markers. A comparison of 37 markers tested in common is
worth more than a comparison of 12 markers tested in common.
- Conclusions based on 12 markers or fewer are suspect as to whether a
CMA exists within a genealogical time frame.
Rule 3:
Genetic distances (differences between allele
values) are cumulative & absolute. A minus one on one marker isn't canceled by a
plus one on another marker; they add to give a genetic distance of 2.
Rule 4:
All markers (loci) aren't exactly the same, nor is the
significance of their differences. Some markers are believed to mutate
faster than others. In general, the first 12 markers tested by most
labs are the slower-mutating markers. Differences in faster-mutating markers
may be less significant than differences in slower-mutating markers.
Rule 5:
The greater the percentage of markers that agree, the
higher the chance is that you and the other person share a common male
ancestor.
Rule 6:
The greater the percentage of markers that agree, the
more likely it is that the shared common male ancestor is within a fewer
number of generations.
Rule 7:
The smaller the percentage of markers that agree, the less likely it is that
you and the other person share a common male ancestor and that any CMA is within a genealogical
time frame.
What do we mean by identical, minor differences & major
differences?
-
Identical results: Are those with no
differences in any of the loci tested in common. When 25 or more loci are
tested in common and agree exactly, the probability of a CMA within the
most recent few generations is almost 100%. Due to inherent limitations, an
exact match on only 12 loci is not considered as relevant.
- Minor differences: A 24/25, 35/37 or 64/67
match carries a high probability of a CMA
within the most recent few generations.
- Major differences:
Establish unlikelihood of a common ancestor in the direct male or female
line.
Probabilities:
DNA questions such as "Do I and another person share a common ancestor?"
and "When might that ancestor have lived?" are subject to answer by the
application of probability theory. For an article on this subject see "Probabilities
in DNA".
One line of Taylors is descended from sons of an Abraham Taylor of
Baltimore County, Maryland; the sons arrived in Craven County in 1729 and
enough of their descendants have been tested to establish Abraham's
haplotype.
Approximately 10 men throughout North America have been found to share the haplotype
and most documented as his heirs. (One or two seem to have descended from
another branch that remained in England longer.)
Click here to view this haplotype.
Genetic anthropology also uses Y-DNA & mtDNA, with somewhat different techniques
& purposes. It seeks to trace the
broad outlines of human development & migration in prehistoric times.
Essentially, it identifies haplogroups and their prevalence in various parts
of the world to determine how the haplogroups relate to each other and how
ancient groups of people migrated. It is sometimes referred to as "deep ancestry".
Genetic anthropology, for example, provides the scientific basis for the
book, "The Seven Daughters of Eve".
This is an important field of study, contributing much to our understanding
of the human race. It is, however, not genealogy; it is anthropology. Someday, the fields of
genetic genealogy and genetic anthropology may connect; that day is not now nor in
the near future.
Yet a third use of DNA analysis is to attempt to identify the genetic
origins of ALL of our ancestors, by proportion. This field uses autosomal DNA
and matches patterns found against databases of prevalence of those patterns. It might be
thought of as a middle ground between genetic genealogy (specific ancestors)
and genetic anthropology (deep ancestry).
Reports generated with this technique typically contain places of
origin and percentages of ancestry. For African-Americans, it may identify
African tribes of association.
This approach is sometimes sought by those who have little hope of finding
specific ancestors by the usual genealogical techniques.
This author is skeptical, for these reasons:
- It is not genealogy. We define genealogy as study to identify specific
individuals as our ancestors and facts about them. This approach can not satisfy the definition.
- Inheritance of autosomal DNA is insufficiently understood. Overall, the
science
has not advanced significantly since Mendel's studies of white & red flowers.
On average, over a sufficiently large group, inheritance seems to be half from
mothers & half from fathers; but averages are inapplicable to individuals. We do not
know the rules which govern what percentage we get from our maternal
grandfather versus our paternal grandmother. If we do not know this, we can
not reason backward to prior generations.
- Adequacy of the databases from which conclusions are drawn is suspect. Only a very small
fraction
of the world's population has undergone any sort of DNA testing and even
fewer have had autosomal DNA tested.
- Proprietary technology is undisclosed. We do not know how the reports
are arrived at and independent scientists can not evaluate the methodology
or methodologies.
Some results reported on mailing lists seem (to say the least) bizarre. One might
not be able to absolutely rule out that one's English third-great-grandmother had
an affair with a Chinese sailor (e.g., "3% Asian") during the
Victorian era, but the chance seems remote in the usual instances.
In short, a Ouija board is a cheaper alternative, and possibly as accurate.
Other Sites & Resources
Here are websites which the author has found to be helpful and
authoritative:
- Charles Kerschner has
published an informative and helpful website, covering many aspects of
genetic genealogy. (We've unashamedly borrowed some of his concepts here.)
- International Society of Genetic Genealogy,
founded by DNA project coordinators, maintains a website to introduce the
subject to the public.
- Family Tree DNA is a provider of genetic genealogy services. Its website
provides much useful information. {Disclosure: The author
volunteers on FTDNA's Taylor Surname Project.}
Contact Author