Tools

DBNL OCR Data set

This data set consists of 220 texts digitised by the DBNL in TEI and txt (OCR).

Narralyzer

Narralyzer finds and visualises characters in texts and the relationships between them.

Frame generator

Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.

Europeana Newspapers NER

Data set for evaluation and training of NER software for historical newspapers in Dutch, French, Austrian

Ground-truth IMPACT project

Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins meant for training

Keyword generator

A command-line tool to extract significant keywords from a collection of sample texts.

Scansion generator

The Scansion generator is a tool developed to detect meter in Dutch poetry.

DBNL ngram viewer

An ngram viewer counting terms and phrases in the Digital Library of Dutch Literature (DBNL).

You are here