Skip to main content
Layer 1
lab

Hoofdnavigatie

  • Datasets
  • Tools
  • Tutorials
  • News and events
  • Blogs
  • About us

Secondary menu

  • NL

Text-analysis-23

Louis Couperus

Dutch Novels 1800-2000

Dataset that contains a corpus of 1346 novels from DBNL.
Courante_uyt_Italien

Is your OCR good enough?

Comprehensive assessment of the impact of OCR quality in Dutch newspaper, journal and book collections.
Example Dataset Entangled - French

Entangled Histories: Ordinances of the Low Countries

This special collection is made up of 108 books of ordinances published in the Early Modern Era.
Icoon dataset

Ground-truth IMPACT project

Collection of 99,95% correct OCR of books, newspapers, parliamentary papers and radio bulletins.
icon for an Parliamentary paper

Example set

This collection consists of a small selection of our digitised publications from the years 1870-1871.

Filters

Content

  • Newspaper (7)
  • Manually corrected text (4)
  • Journal (2)
  • Parliamentary paper (2)
  • Radio bulletin (2)
  • (-) Book (5)

Category

  • Data access (3)
  • Computer vision (1)
  • Enrichment (1)
  • (-) Text analysis (5)
  • (-) Visualisation (1)

File format

  • ALTO (3)
  • CSV (2)
  • TXT (2)
  • JSON (1)
  • MPEG21-DIDL (1)
  • TEI (1)
  • TIFF (1)

Copyright

  • Public domain/CC0 (3)
  • Other CC-licence (1)

Product

  • Tool (6)
  • Tutorial (1)
  • (-) Dataset (5)

In the KB Lab you can find experimental tools and data built for and from the digital collection of the KB, National Library of the Netherlands.

Footer-menu

  • Terms of use
kb-logo