Skip to main content
Layer 1
Logo KB Lab
Hoofdnavigatie
Datasets
Tools
Tutorials
News and events
Blogs
About us
Affiliated researchers
Team
Contact
Secondary menu
NL
Open Menu
zoeken
Dataset
Historical newspapers OCR ground-truth
A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.
SIAMESET
The SIAMESET dataset consists of images and metadata of advertisements from two Dutch newspapers.
KBK-1M
The KBK-1M Dataset is a collection of 1,603,396 images and accompanying captions from 1922 – 1994
Europeana Newspapers NER
Data set for evaluation and training of NER software in Dutch, French, Austrian and German.
Example set
This collection consists of a small selection of our digitised publications from the years 1870-1871.