Skip to main content
Layer 1
Logo KB Lab
Hoofdnavigatie
Datasets
Tools
Tutorials
News and events
Blogs
About us
Affiliated researchers
Team
Contact
Secondary menu
NL
Open Menu
zoeken
Dataset
Automatically extract XML content with Python
A quick-start into working with XML files using Python. The course covers various XML formats.
Historical newspapers OCR ground-truth
A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.
SIAMESET
The SIAMESET dataset consists of images and metadata of advertisements from two Dutch newspapers.
KBK-1M
The KBK-1M Dataset is a collection of 1,603,396 images and accompanying captions from 1922 – 1994
Example set
This collection consists of a small selection of our digitised publications from the years 1870-1871.