Datasets & Tools


A simple search interface on Delpher data used for testing and demonstration.

Ground-truth IMPACT project

Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins meant for training

Example set

This collection consists of a small selection of our digitised publications from the years 1870-1871.

Python API

Simple API to access KB collections using Python.


ALTO Edit is a simple browser-based post correction tool for ALTO XML files.

You are here