Tools

DIGGER

DIGGER contains geocoded place names for 102 million news items from the digitised newspapers from Delpher.

Historical newspapers OCR ground-truth

A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.

Newspaper ngram collection

This dataset was generated by PoliticalMashup and contains yearly counts for word ngrams for n ranging

Europeana Newspapers NER

Data set for evaluation and training of NER software for historical newspapers in Dutch, French, Austrian

Ground-truth IMPACT project

Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins meant for training

Example set

This collection consists of a small selection of our digitised publications from the years 1870-1871.

You are here