MyHeritage have announced the publication of a massive new collection of 982 million names, extracted from their US and Canadian historical newspaper collections.
Historical newspapers are some of the most important sources for genealogical information because they are very rich in detail. Newspapers can often add color and personality to the dry facts that are often the output of other genealogical sources such as census records.
Table of Contents
About the New Index
The collection is an index of names that were extracted from existing free-text US and Canadian newspaper collections on MyHeritage. The free text in these collections was generated from the scanned images of newspapers using Optical Character Recognition (OCR) technology, which converts images into text.
The new Newspaper Name Index does not replace the free-text newspaper collections, but is added on top of them as a separate collection. What’s more, this name index is the fruit of only half of MyHeritages newspaper holdings and the other half of the name index is currently being generated and will be published soon, so that nearly one billion additional records will soon be added.
Records in the index include a person’s name, a snippet of text mentioning them in the newspaper, and the newspaper’s publication title, date, and place of publication.
Each record includes a scanned image of the original newspaper article. Some records will also include additional searchable information such as the name of a spouse and the place of residence based on the information extracted by the machine learning algorithms. Year range and place coverage in this collection vary greatly.
The new Newspaper Name Index will make it much easier for you to locate exciting details about ancestors that people may have missed in prior searches. With the addition of this huge collection, there are now 15.1 billion historical records on MyHeritage.
Read more at: MyHeritage