Tools

Is your OCR good enough?

Description This webpage contains information about the datasets used and code developed as part of the

Frame generator

Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.

Ground-truth IMPACT project

Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins meant for training

Example set

This collection consists of a small selection of our digitised publications from the years 1870-1871.

Keyword generator

A command-line tool to extract significant keywords from a collection of sample texts.

ALTO Edit

ALTO Edit is a simple browser-based post correction tool for ALTO XML files.

You are here