Canonizer Product - 24 Jan 22 The Canonizer demonstrates how well canonicity can be classified based on the text of a novel.
Analysing hyperlinks in the KB web collection Blog - 14 Apr 22 Introducing the link analysis blog series, about the KB web archive, upcoming blogposts and links to published blogposts.
Let's get some data - Link analysis part 1 Blog - 15 Apr 22 About the reason for starting with this research, formulating a research question, selecting harvests and getting some data. Also containing information about extracting data and metadata fr...
Tagging the news from the Indies to automatically trace people and places through multiple historical languages Blog - 15 Apr 22 This blog will elaborate on how images of newspapers were automatically transcribed and manually labelled with tags.
Finding Phrases to Recognise Entities: Curating historical corpora in different languages to construct a single register of persons, places, groups and quantities Blog - 20 Apr 22 This blog will attend to whether historical NER can be improved by applying it to natural languages from two different language families and given the mixed language use in colonial newspape...
Digital Film Listings (DIGIFIL): Automatically Extracting Film Programming Information from Digitised Newspapers Blog - 04 Aug 20
KB National Library of the Netherlands adopts OpenJPEG for Delpher and more! Blog - 07 Oct 19 This guest blog post by René van der Ark describes the KB's process of adopting the open source library OpenJPEG.
Examining a multi-layered approach for classification of OCR quality without Ground Truth Event - 05 Mar 21