Datasets & Tools

DBNL OCR Data set

This data set consists of 220 texts digitised by the DBNL in TEI and txt (OCR).

Narralyzer

Narralyzer finds and visualises characters in texts and the relationships between them.

Frame generator

Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.

Keyword generator

A command-line tool to extract significant keywords from a collection of sample texts.

Scansion generator

The Scansion generator is a tool developed to detect meter in Dutch poetry.

You are here