Tools

Word embedding playground

Word embedding playground provides tools for training and fine-tuning word embedding models.

DIGGER

DIGGER contains geocoded place names for 102 million news items from the digitised newspapers from Delpher.

Newspaper ngram collection

This dataset was generated by PoliticalMashup and contains yearly counts for word ngrams for n ranging

Frame generator

Tool for extracting topics, keywords and their co-occurence patterns from a Dutch corpus.

Genre classifier

The Genre classifier predicts the genre of a Dutch newspaper article, using plain text as input.

Example set

This collection consists of a small selection of our digitised publications from the years 1870-1871.

Keyword generator

A command-line tool to extract significant keywords from a collection of sample texts.

You are here