DBNL OCR Data set Product - 05 Jun 19 This data set consists of 220 texts digitised by the DBNL in TEI and txt (OCR).
Dictionary viewer Product - 16 Dec 16 The Dictionary viewer visualises the appearance of a word list in the newspaper corpus over time.
DIGGER Product - 05 Aug 20 DIGGER contains geocoded place names for 102 million news items from Delpher.
Entangled Histories: Ordinances of the Low Countries Product - 04 Dec 19 This special collection is made up of 108 books of ordinances published in the Early Modern Era.
Europeana Newspapers NER Product - 30 Nov 16 Data set for evaluation and training of NER software in Dutch, French, Austrian and German.
Example set Product - 30 Nov 16 This collection consists of a small selection of our digitised publications from the years 1870-1871.
Frame generator Product - 22 Mar 17 Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.
Genre classifier Product - 22 Mar 17 The Genre classifier predicts the genre of a Dutch newspaper article, using plain text as input.
Is your OCR good enough? Product - 22 Mar 21 Comprehensive assessment of the impact of OCR quality in Dutch newspaper, journal and book collections.
jpylyzer Product - 30 Nov 16 Jpylyzer is a validator and feature extractor for JP2 (JPEG 2000 Part 1) images.