What is an Unique Ancestor?
A unique ancestor is simply a person who is an ancestor, notwithstanding that they may have contributed to our descent in more than one direct line.
How few or how many Unique Ancestors do we actually have?
I present herewith a hypothesis for a distribution curve that may give us a clue in the quantisation of the unique ancestral counts. There is a complex relationship between the total ancestors we have in a binary tree sense and those who are unique.
For some time now I have been interested in the mathematics of ancestry. My interest was triggered by the early April 1997 discussion thread in soc.genealogy.medieval on the counts of ancestors to a particular generation, and the expectation of the number of unique ones that count included. Subsequently, I have found that other people share this same interest, and who have followed similar lines of analysis to mine.
There appeared to be a strange divergence in examples previously given between the growth rates of total and unique ancestors that warranted further examination. I therefore wrote a small computer program of my own to assist in the study of these effects. For a specific individual in a database, the program counts the recorded ancestral links by the unique people involved. I have used this information to extrapolate an empirical curve for unique ancestors. Another programmer has also written a program to perform these same calculations, reference to which is shown below.
Theory and Example Data
I emphasise that my "research" is basically empirical, not theoretical. I am also not an expert in mathematics, merely an interested observer and applicator. The phenomenon I describe must have previously been noted, and perhaps there are other explanations of which I am unaware.
For the purpose of these notes, I have defined a few special terms used in the table, as follows.
The following details for Prince William of Wales from my own database cover the range to 80 generations (including the data of some legendary or mythical people), but it serves to illustrate the effects involved. Appropriate figures are shown in scientific notation for brevity. Whether or not we know exactly who our ancestors were by name, we can still be sure that people existed in those ancestral roles!
Since 1997 I have been researching and recording the ancestors of Prince William with kind assistance from a number of other people. In this time his ancestors in my database have grown from 13,000 to about 54,100 currently. In 1998 at the 40 generation point my records showed 21,400 out of a database total of 22,200. At that time my 40 generation projection of Estimated Uniques was 185,220. Compare that to now, where with 53,451ancestors at that same point the projection is 240,135. Considering the huge change in the number of people recorded, there is still a quite similar result for the projection.
The data we have is in effect a statistical sample, albeit a biased one. The greater the sample size, the higher the reliability of the information. In the above case, the percentages of Recorded Knowns to Total Ancestors at each 5 generation step are 100%, 78%, 44%, 32%, 22%, 12%, 4%, 1% and really tiny after 40 generations. The most recent generations therefore provide more reliable and complete data. For this reason I have only shown Estimated Uniques in the above table to 50 generations. Even at that point I believe the number is well overstated due to the sample size effect.
My database is not a particularly
large one, but the growth in relationships is still significant. Furthermore,
as the number of known people linked to the ancestral tree for the studied
individual grows, the relationship counts grow at a larger but varying
rate. These relationships result from the many complex interactions between
descendants. Historically, the volume of data is unfortunately weak on
the female side. When the data of later generations is examined where
full inclusion of female ancestry applies, a much richer set of complex
links result. As more people are added to the "tree" and linked
to existing ancestors, the Recorded Knowns grow significantly. This has
happened over the last five years in my data.
Prince William's Ancestry
The information I have compiled so far on the ancestry of Prince William shows some interesting numbers. It is a work in progress and details change day by day.
We often hear and see the comment that the royal families are inbred. This is generally a fallacy and derives from an emotive impression of the effects of remote cousin marriages. In the case of Prince William I have analysed his ancestry using my program ATMatch, described below. This shows that in 12 generations of parental ancestry he has a Wright Coefficient of Inbreeding of 0.0052% representing 33 common parental ancestors occupying 113 ancestral positions (out of 16,380). In addition, his Coefficient of Ancestry (a related measure showing where he has common ancestry regardless of which parent has the links) is 1.835% representing 701 ancestral repetitions occupying 2,821 ancestral positions. These coefficients are very insignificant.
Conclusion and Projection
My conclusion is that we all probably all have less than 1,000,000 real unique ancestors, within 80 generations.
This conclusion arises from my observation of the data available to me, in particular the first 40 generations. The following indicative curve shows my predicted pattern of incremental unique ancestors for succeeding generations. The vertical axis is the number of uniques per generation, and the horizontal the number of generations. The latter parts of the curve are logically derived, and may ultimately end with two people if taken to the limit.
There are mobility factors in the population that affect groups of people in different ways and alter the ancestral composition accordingly. The land owning classes tend to live and marry within small geographic regions, whereas itinerant workers move around the country so spreading their genes. This seems to be partially offset in the case of royals, as there is considerable mixture across wider social and political boundaries. It will take a lot more investigation before these patterns are clear.
The shape is similar to that of the skewed statistical F Distribution, but I do not profess to have a sensible explanation for this result. The peak appears to occur at around 25 to 30 generations, and moves a bit over the centuries. This may reflect the consequences of wars and famines.
For any individual, a hypothetical set of unique ancestors could accumulate as follows:-
I hope that these notes provoke some further thought and contributions on this topic.
On request I have provided some further details from my study of Prince William's ancestry. Click Here to view the extra information.
My simple FTRIPLET analysis program is freely available here. It is a useful tool to review an individuals relationships within their databases. It is menu driven and runs under DOS or the command prompt in Windows 95 and NT and reads its data from a GEDCOM file.
Please use the program to examine
the fascinating results of the relationships in your own databases, and
be sure to let me know if you find any particularly complex or interesting
ones. I would appreciate the data to help further research in the theory.
This Windows program provides a means
for comparing two separate ahnentafel lists for the same individual to
identify the similarilies and differences between them. I use it to compare
the information in my database with that of another person.
In addition to my own FTRIPLET and ATMATCH analysis programs, I have made use of a utility program written by Torben B. Andersen. This program is command line driven and provides a number of very useful facilities for examination and manipulation of GEDCOM files. It is called GEDUTIL.
It runs significantly faster than my own program in the calculation of unique ancestors, though providing a different set of other features.
Among its features is the ability to: -
The program was freely available
on e-mail request to the author Torben B. Andersen but may no longer be available.
Site Search Facility
You are visitor to this page. Thank you.