Skip to main content
Layer 1
lab

Hoofdnavigatie

  • Datasets
  • Tools
  • Tutorials
  • News and events
  • Blogs
  • About us

Secondary menu

  • NL

Text-analysis-23

Icoon dataset

DBNL OCR Data set

This data set consists of 220 texts digitised by the DBNL in TEI and txt (OCR).
Europeana Newspapers NER 1

Europeana Newspapers NER

Data set for evaluation and training of NER software in Dutch, French, Austrian and German.
Icoon dataset

Ground-truth IMPACT project

Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins.

Filters

Content

  • Newspaper (4)
  • Book (3)
  • Parliamentary paper (2)
  • Radio bulletin (2)
  • Journal (1)
  • (-) Manually corrected text (3)

Category

  • Enrichment (1)
  • (-) Text analysis (3)

File format

  • ALTO (2)
  • TEI (1)
  • TXT (1)

Copyright

  • Other CC-licence (1)
  • (-) Public domain/CC0 (3)

Product

  • (-) Dataset (3)

In the KB Lab you can find experimental tools and data built for and from the digital collection of the KB, National Library of the Netherlands.

Footer-menu

  • Terms of use
kb-logo