Skip to main content
News and events
Historical newspapers OCR ground-truth
A dataset consisting of 2000 pages historical newspaper groundtruth, OCR and images.
The CHRONIC dataset consists of metadata for 313K classified newspaper images using computer vision.
The KBK-1M Dataset is a collection of 1,603,396 images and accompanying captions from 1922 – 1994