Skip to main content
Layer 1
Logo KB Lab
Hoofdnavigatie
Datasets
Tools
Tutorials
News and events
Blogs
About us
Affiliated researchers
Team
Contact
Secondary menu
NL
Open Menu
zoeken
Tool
DBNL OCR Data set
This data set consists of 220 texts digitised by the DBNL in TEI and txt (OCR).
Narralyzer
Narralyzer finds and visualises characters in texts and the relationships between them.
SIAMESET
The SIAMESET dataset consists of images and metadata of advertisements from two Dutch newspapers.
SIAMESE
Tool to identify visual trends in advertisements in Dutch historical newspapers.
Newspaper ngram collection
This dataset contains yearly counts for word ngrams from the KB newspaper collection.
Frame generator
Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.
Genre classifier
The Genre classifier predicts the genre of a Dutch newspaper article, using plain text as input.
KBK-1M
The KBK-1M Dataset is a collection of 1,603,396 images and accompanying captions from 1922 – 1994
Example set
This collection consists of a small selection of our digitised publications from the years 1870-1871.
Python API
Simple API to access KB collections using Python.
KB Newspapers image count
Graphical overview of images in KB Newspapers.
Scansion generator
The Scansion generator is a tool developed to detect meter in Dutch poetry.