GEDCOM database table download

Started by Private User on Monday, December 5, 2011
Problem with this page?

Participants:

  • Private User
    Geni member
  • Geni Pro
  • Private User
    Geni Pro
Showing all 8 posts

Hi guys, I am a database developer and was curious to know if anyone else knows how to download the GEDCOM format into a flat file or a database table in order to do relational analysis.

I was thinking of doing some simple analysis, for example in a file with this type of information:

Unique Key
First Name
Last Name
Birth Year
Death Year
Country of origin
Fathers Name/Key
MothersName/Key
Generation Level

Or something similar to this.

Randy Stebbing Private User may want to comment or may know who else would have insight.

By the way, have you been to www.gedmatch.com which does data mining / analysis of DNA data (for genealogy purposes)? I have worked in this area a bit (data mining / analysis of large data collections) and would love to see analytical tools applied to the combined GEDCOM / DNA data.

GEDCOM is a plain text file, so it is just to extract the fields you want.

Well its not in tabular format, thats the problem, I wonder if one of the Geni developers could handle this for me or I could use one of the GEDCOM viewer programs. Ill figure it out eventually.

The types of questions I'd like to answer in a relational database are for example, what percentage of ancestors originated in what country times 1/# of ancestors supposed to be at that ancestor generation level.

Also, percent ancestors missing at that relationship level, i.e. where is that branch terminus? And what generation do my branches generally terminate at?

Also, what # of ancestors are repeated in various branches of the trees?

I found a few free programs that will convert the GEDCOM to a flat file, problem solved.

https://www.google.com/#hl=en&cp=4&gs_id=g&xhr=t&q=...

Hey guys I was able to answer an interesting question about my background using a relational database. Its about my ancestors birth country of origin:

I used the formula 1/generation # calculated using a recursive query (like grandparents are 25% great grandparents are 1/8 or 12.5%) times country of origin within the tree:

The totals are:

birth in England 37.899%
Unknown/Tree Missing 51.7%
Born in USA, no further info 7.68%
Ireland 1.22%
France .744%
Wales .313%
Germany .287%
Sweden .092%
Scotland .056%

This goes back 22 generations so its far enough back to get most migrations from Europe. By name, I'm assuming my background is probably more like 95% English.

There are some analysis / visualization tools in Geni too. Have you seen them?

Showing all 8 posts

Create a free account or login to participate in this discussion