Skip to main content
Layer 1
Logo KB Lab
Hoofdnavigatie
Research
Datasets
Tools
Tutorials
KB's digitized collections
News and events
Programs & services
Researcher in Residence
KB Summerschool Digitale Collecties
Data Services
Data Lab
Newsletter
Blogs
About us
Team
Affiliated researchers
Contact
Secondary menu
NL
Open Menu
zoeken
Journal-29
Automatically extract XML content with Python
A quick-start into working with XML files using Python. The course covers various XML formats.
Dutch Novels 1800-2000
Dataset that contains a corpus of 1346 novels from DBNL.
Historical newspapers OCR ground-truth
A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.
CHRONReader
With CHRONReader you can search in Delpher's newspaper images using categories and keywords.
xportal
A simple search interface on Delpher data used for testing and demonstration.
Dictionary viewer
The Dictionary viewer visualises the appearance of a word list in the newspaper corpus over time.
KBK-1M
The KBK-1M Dataset is a collection of 1,603,396 images and accompanying captions from 1922 – 1994
Europeana Newspapers NER
Data set for evaluation and training of NER software in Dutch, French, Austrian and German.
Ground-truth IMPACT project
Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins.
Example set
This collection consists of a small selection of our digitised publications from the years 1870-1871.
Python API
Simple API to access KB collections using Python.
ALTO Edit
ALTO Edit is a simple browser-based post correction tool for ALTO XML files.
Newspaper ngram viewer
The PoliticalMashup ngram viewer visualises the frequency of a phrase in Delpher newspapers.