What is Source Extractor?
Source Extractor is a (free) program to help you extract sources from notes in a GEDCOM file. Early versions of PAF didn't support Sources, so people often entered their source information in the Notes area. If you have a GEDCOM file with source information in the Notes area and you want to generate Sources for that information, Source Extractor will help you do this.
How does it work?
Source Extractor works as a three-step process:
First Run the Notes Extractor over your GEDCOM file. This creates an alphabetically-sorted list of all of the note lines in your GEDCOM.
Second Edit this file to group the lines together into lines representing each source. You should leave blank lines between the groups. In the next step, a source will be added to the gedcom file for each group, and citations to that source will be added every time a note matches one of the lines in the group. For example, if you had:
Allen County Marriage Records Allen Co Marr Recs Allen Cty Marriage Recs Allen County Birth Records
in your grouped notes file, then two sources would be added to your gedcom file in the next step: one for Allen County Marriage Records and one Allen County Birth Records. For every individual with a note line of "Allen County Marriage Records" or "Allen Co Marr Recs" or "Allen Cty Marriage Recs" we would add a citation to the first source, and for every individual with a note line of "Allen County Birth Records" we will add a citation to the second source.
Third Run the Sources Extractor over your GEDCOM file, giving it the grouped notes file you just created. Sources Extractor will generate an updated GEDCOM file with new sources and source citations based upon your grouped notes file.
How can I get it?
You can get the program from http://www.quass.org/pafutils.jar. Instructions for running it appear below
How do I run it?
Warning: In its current state, Source Extractor is not for the faint of heart. It requires installing Java onto your computer (a ~20MB download) and entering data into a Command Prompt window. Also, I've tested this with Java 5.0 but not 6.0, although it should work.
1. Download JRE 5.0 or 6.0 from Sun:
2. Make sure you can run java
3. Save pafutils.jar (available from http://www.quass.org/pafutils.jar) (available from http://www.quass.org/patterns.txt) to someplace easy to get to from the Command Prompt window. I'd suggest doing the following, but you can save it anywhere.
4. Create a gedcom file from your pedigree and save it in the directory you just created
6. Review the notes file you just created
7. Review the patterns file I sent you
8. Group the note lines into sets of lines corresponding each source to create
Allen County Marriage Records
9. Once you have grouped the notes file, you need to create the source citations
Who wrote it?
The program was originally written by Josh Monson, with modifications by Dallan Quass and guidance from Don Snow.
I know Java - how can I help?
Leave me a message. If other people are interested in contributing I'll put it on SourceForge.