Skip to main content
Layer 1
Logo KB Lab
Hoofdnavigatie
Datasets
Tools
Tutorials
News and events
Blogs
About us
Affiliated researchers
Team
Contact
Secondary menu
NL
Open Menu
zoeken
Tool
Extracting text from EPUB files in Python
Johan van der Knijff published a brief introduction to extracting unformatted text from EPUB files.
NL-Menu
NL-menu was the first Dutch web index which was restored in 2018.
Ot & Sien dataset
Data for the development of the automatic visual object recognition tools in children’s books.
Assisted keyword assignment using Annif
Annif can be used to make cataloging more efficient by suggesting authors and keywords.
Historical newspapers OCR ground-truth
A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.
Brinkeys Tool
Brinkeys is a demo that suggest a number of Brinkman topics for a dissertaton based on its contents.
Web Collection Chinese Netherlands
This web collection contains archived websites from the Chinese community in the Netherlands.
CHRONReader
With CHRONReader you can search in Delpher's newspaper images using categories and keywords.
CHRONIC
The CHRONIC dataset consists of metadata for 313K classified newspaper images using computer vision.
SIAMESET
The SIAMESET dataset consists of images and metadata of advertisements from two Dutch newspapers.
SIAMESE
Tool to identify visual trends in advertisements in Dutch historical newspapers.
xportal
A simple search interface on Delpher data used for testing and demonstration.
Europeana Newspapers NER
Data set for evaluation and training of NER software in Dutch, French, Austrian and German.
Example set
This collection consists of a small selection of our digitised publications from the years 1870-1871.
ALTO Edit
ALTO Edit is a simple browser-based post correction tool for ALTO XML files.
jpylyzer
Jpylyzer is a validator and feature extractor for JP2 (JPEG 2000 Part 1) images.