Skip to main content
Layer 1
Logo KB Lab
Hoofdnavigatie
Datasets
Tools
Tutorials
News and events
Blogs
About us
Affiliated researchers
Team
Contact
Secondary menu
NL
Open Menu
zoeken
Dataset
DIGGER
DIGGER contains geocoded place names for 102 million news items from Delpher.
Entangled Histories: Ordinances of the Low Countries
This special collection is made up of 108 books of ordinances published in the Early Modern Era.
DBNL OCR Data set
This data set consists of 220 texts digitised by the DBNL in TEI and txt (OCR).
Narralyzer
Narralyzer finds and visualises characters in texts and the relationships between them.
Newspaper ngram collection
This dataset contains yearly counts for word ngrams from the KB newspaper collection.
Frame generator
Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.
Genre classifier
The Genre classifier predicts the genre of a Dutch newspaper article, using plain text as input.
Europeana Newspapers NER
Data set for evaluation and training of NER software in Dutch, French, Austrian and German.
Ground-truth IMPACT project
Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins.
Example set
This collection consists of a small selection of our digitised publications from the years 1870-1871.
Python API
Simple API to access KB collections using Python.
ALTO Edit
ALTO Edit is a simple browser-based post correction tool for ALTO XML files.
Scansion generator
The Scansion generator is a tool developed to detect meter in Dutch poetry.
Newspaper ngram viewer
The PoliticalMashup ngram viewer visualises the frequency of a phrase in Delpher newspapers.