GEDCOM Files: Mac - PC
August 31, 2005 by admin
Note: This is a very long article. If your not doing a frequent exchange of GEDCOM files with other people, or you don’t have any problems when you do so, you should probably skip this. This grew out of a problem I had with a relative. We did much of what was listed below, until we were comfortable in knowing what the problems were on both sides, and how to work around them.
If your having problems with GEDCOM files, going between Macs and PCs (even Macs to Macs or PCs to PCs), there are a few things you can do:
Check whether you are exporting using ASCII, UNICODE or ANSEL. If your exporting using ASCII, switch to UNICODE or ANSEL, if your using UNICODE switch to..well, you get the point.
If you are importing from a PC, have them generate three GEDCOMs in the three different formats just mentioned, and check all three out.
It may not be a character set problem, it maybe your program or their program is generating too much information (too many tags, etc.), or one of the programs is writing data that is too long for the field and the going back and forth is cutting it off. Technically, the originating program may have no problem with storing data that falls outside of the GEDCOM standard, but when it goes to export and puts that information into the GEDCOM file, it’s either cutting it off, or placing it somewhere outside of the normal GEDCOM structure.
It can be tedious to find where any problems are happening. Many genealogy applications will store data they couldn’t import properly in another area, maybe it’s under some kind of research notes, or whatever.
That doesn’t always help you though, other than tell you that, yes, there is a problem, and here is the data. It doesn’t say “next time, export in this format instead”. It can help you though, if you determine there are only a few minor errors - you can then work with the other erson on how they have their data organized - if it’s just one or two tags having problems, you can maybe have them place the data in another tag, or somewhere else in the file.
Your best bet is for each of you to create a new file, with maybe three generations in a family, with no more than 10 individuals. Try to fill out all the tags that you can, print your file and have them print their file (both printings should be in some kind of format that shows as much data about several individuals as possible).
Then exchange these 10 person GECOM files, and then each print again. Put the printouts side by side, and see how everything came out.
You can also take a standard blank GEDCOM file, edit it with a text editor (but be careful, some text editors want to turn it into an RTF format, or have their own extended characters appended), and repeat the same process as the simple 10-person GEDCOM mentioned above.
You can find blank GEDCOM files here:
http://heiner-eichmann.de/gedcom/gedcom.htm - this site has a lot of information about GEDCOM files, and about the different character sets and fields, and it’s a good starting point.
I would recommend trying Smultron (http://smultron.sourceforge.net) on the Mac side, and stay away from the built-in Mac and Windows text editors, as they trend towards putting everything in RTF format. On the Windows side, I would say give Metapad (http://www.liquidninja.com/metapad/) a try. Both Metapad and Smultron have the ability to export clean text files.
Another way, you can go and download a GEDCOM file from half a dozen places on the internet, just random GEDCOM files, each of you import them, print out the full details for a few individuals out of each one, and compare notes. That sounds even more tedious, but it might help you pinpoint where problems are occuring, because at this point your (hopefully) trying otu GEDCOMs that were created on completely different applications that what you and the other person are using.
Yes, that increases the chances that there will be problems, since your adding to the equation, but at the same time you maybe able to learn a little bit more.
You can also fall back to an older standard on some applications when generating a GEDCOM file, and repeat the above process.
If you are not going to be exchanging data very frequently, it might not be worth it to go through all of the above. There is a point at which it almost makes as much sense to print the data out, or import the data into it’s own family file, and cut and paste the information into your main or research family file.
I would definitely recommend against merging data from one or more GEDCOM files generated by applications other than what you are using, for the simple fact that you should always be careful about what you import (i.e. fact-checking), as well as if there are slight differences between what is originally generated and what is imported, you might not find them until later on, and the problem goes from being relatively harmless to hours and hours of fixing (or simply reverting to an older GEDCOM if you keep good, regular backups.
More information on the GEDCOM 5.5 standard here : http://homepages.rootsweb.com/~pmcbride/gedcom/55gctoc.htm
GEDCOM 6 XML, that is supposed to fix all of our problems, will be a standardized format in time for your gg-grandchildren to use (or rather the automated software applications that go out and do the research for them
)
followup

Comments