[This page has been machine-translated.]

Duplicate check

operation

According to Omega's concept , one and the same person may only appear once in a database. Individuals appearing twice are called duplicates.

The search for duplicate entries in a specific directory is performed via the "Duplicate Check" menu item in the directory's context menu . Duplicates for a specific person can be found via the "Duplicate Check" menu item in the person's context menu when viewing the directory's details , or via the "Duplicate Check" menu item from within the directory. -Searching for the menu of the index card .

Unless automatic duplicate checking is disabled in the Omega data entry settings , an automatic check is performed before saving a newly created person to determine whether the newly created person is a potential duplicate of an already recorded person.

Proceedings

The program uses a heuristic procedure to check whether it finds pairs of index cards that are so similar in terms of name, dates of birth, places, parents and spouses that they could be duplicate entries of the same person.

The procedure does not use the exact names of the persons included in the examination, but rather their phonetic equivalents, in order to identify, for example, SCHMITT Francisca and SCHMIDT Franziska as possible duplicates.

To compare two people, a similarity score is determined, which is higher the more similar the two people are. For example, matching dates lead to an increase in the similarity score, while contradictory dates lead to a decrease.

If the similarity score reaches or exceeds the threshold set in the Omega search and check settings , the individuals are identified as duplicates. A higher threshold results in fewer potential duplicates being detected, while a lower threshold produces more false positives.

Evaluation of the results

After the duplicate check, a report is displayed showing the potential duplicates of each person checked, along with their similarity value:

In addition, a list of persons called duplicates "..." is created, which records all persons for whom at least one sufficiently similar other person exists in the file.

To investigate and, if necessary, correct the identified inconsistencies, proceed as follows:

  1. The results list "Duplicates "..." is your worklist.
  2. Open each person in this results list one by one.
  3. Run the duplicate check from the -Open the index card menu to view the specific exam result for the selected person.
  4. In the person selector that opens, which displays duplicates, you can view the identified duplicates via the person's context menu or merge a single person with the person on the index card (standard function when double-clicking the duplicate in the person selector with the left mouse button).
  5. Use the submenu in the menu bar of the index card , to remove the person from the duplicate entries in the person directory and thus from your worklist as completed.
Note: If automatic duplicate checking is enabled in the Omega data entry settings , auxiliary data structures (cross-indexes) are created before a person is saved to a record for the first time. This can take some time for large records with many people. Please see the tip on speeding up this process in the Data Sources and Records section.