Skip to main content
lab

Hoofdnavigatie

  • Datasets
  • Tools
  • News and events
  • Blogs
  • About us

Secondary menu

  • NL
Dark Light

Newspaper-26

OCR scores

Historical newspapers OCR ground-truth

A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.
Icon News

CHRONIC

The CHRONIC dataset consists of metadata for 313K classified newspaper images using computer vision.
icon for an image

SIAMESET

The SIAMESET dataset consists of images and metadata of advertisements from two Dutch newspapers.
An example of an image and caption extracted from the front page of the January 27th 1951 issue of the De Nieuwsgier.

KBK-1M

The KBK-1M Dataset is a collection of 1,603,396 images and accompanying captions from 1922 – 1994

Filters

Content

  • Image (3)
  • Book (1)
  • (-) Newspaper (4)

Category

  • Computer vision (2)
  • Data access (2)
  • Enrichment (1)
  • Text analysis (1)

File format

  • JPEG (2)
  • ALTO (1)
  • JPEG2000 (1)
  • JSON (1)
  • TXT (1)

Copyright

  • Public domain/CC0 (4)
  • Other CC-licence (1)
  • (-) In copyright (4)

Product

  • Tool (2)
  • (-) Dataset (4)

In the KB Lab you can find experimental tools and data built for and from the digital collection of the KB, National Library of the Netherlands.

Footer-menu

  • Terms of use
kb-logo