LEEUWARDER COURANT

271 YEARS OF NEWSPAPER HISTORY ONLINE

In 2023, the Dutch newspaper De Leeuwarder Courant celebrates its 271st anniversary. No other newspaper can claim a longer publication history under the same title. Every day, avid readers get their news from one of the most respected newspapers in the Netherlands, either in print or via the online edition.

ARCHIVE SPECIFICATIONS

Archive Period: 1752 -2023

Pages in the archive: 2,223,976

Articles in the archive: 23,160,270

Number of words in the archive: 6,321,178,891

BEFORE ADDING THE ARCHIVE:

  • Unique visitors per month: 350,000

  • Page views per month: 3,000,000

  • Average number of page views per visit: 3

AFTER ADDING THE ARCHIVE:

  • Unique visitors per month: 600,000

  • Page views per month: 9,500,000

  • Average number of page views per visit: 9

SUMMARY OF THE PROJECT

X-CAGO, a media intelligence solutions and technology company based in Roermond, the Netherlands, is the exclusive provider of digital and archiving services for the publisher of De Leeuwarder Courant. The contract includes making the latest issue available online, as well as digitising and making available online almost 1 million pages of news from the 271 years the newspaper has been published. The first issue of the newspaper was printed in July 1752.

As today's news is tomorrow's history, the cultural heritage contained in the archives of De Leeuwarder Courant is considered one of the most important historical sources in the Netherlands. The newspaper's archive includes seminal events of history including the French Revolution, Napoleon, the Titanic, Hitler, Stalin and the World Cup - but also obituaries and local news.

The initiator of this project is the Foundation Digital Archive Leeuwarder Courant, a cooperation of De Leeuwarder Courant, Tresoar and the Frisian Historical and Literary Centre. Digitising the vault of the Leeuwarder Courant has long been on the agenda of all project partners, and X-CAGO's superior technologies were key to making the rich content of this extraordinary newspaper archive accessible.

THE DIGITISATION OF THE ARCHIVE

X-CAGO's approach, which differs from that of many other providers, is to scan directly from bound books and other hard copies rather than digitising from microfilm. Microfilm has many disadvantages - lack of colour, lack of authenticity and inferior optical character recognition (OCR). The Foundation heeded X-CAGO's advice and reputation and opted to digitise from the original sources.

By digitising the almost one million newspaper pages from hard copy, a rich source of Dutch and Frisian culture and heritage is unlocked and made available to the public via the internet. Moreover, this content is preserved for future generations, making it equally accessible to journalists, scientists, politicians, bankers, students and parents, whether it is news from today or an article from 250 years ago.

WHY DIGITIZE THE ARCHIVE?

WHAT MAKES THIS PROJECT UNIQUE?

271 years of continuous journalism from a single, important and influential source, offering a wealth of Dutch and Frisian cultural heritage that stimulates and influences scholarly research in many disciplines.

Diversity: The content covers several languages, including Dutch, Frisian and French, and looks at the different uses of language over the last 271 years.

IMPLEMENTATION

X-CAGO is carrying out this project with a turnkey end-to-end solution using the most advanced video scanners currently available on the market. This includes:

  • Scanning the newspaper pages;

  • Segmentation of the pages into articles and advertisements;

  • Labelling of headlines, captions and images;

  • Tracking of the reading order and publication on the internet

A recurring problem when digitising historical content is changes in spelling over time. In addition, sometimes completely different words with different meanings may have been used. Without updating spelling and language usage, significant problems can arise in finding the information sought. To solve these problems, X-CAGO developed its own web-based solution called ‘Archive ExPress’. As a result of this project, X-CAGO now includes Frisian in the list of supported languages. ‘Archive ExPress’ also has the ability to eliminate typical OCR errors.

SPELLING AND QUALITY CONTROL

ADVICE & SOLUTIONS

FOR MORE INFORMATION

Please contact X-CAGO for more information at sales@x-cago.com

SERVICES YOU MAY BE INTERESTED IN

  • SUPERSET

    X-CAGO currently processes more than 5,000 newspaper and magazine titles from PDF input files into one or more XML/JSON output formats.

  • WEB CRAWLING

    This is the conversion of articles on web pages into a consistent XML/JSON output format. This is achieved through the use of a high-precision web crawler.

  • HISTORIC DIGITISATION

    This involves the digitsation of hard copy archival content for media companies / publishers.

  • ARCHIVE EXPRESS

    Archive ExPress successfully captures, stores, researches, publishes, distributes and syndicates content from both print media (newspapers, magazines, books, catalogues, etc.) and digital media.

  • CONTENT TRANSLATIONS

    X-CAGO can provide fast and reliable automated content translations in no less than 30 languages. New languages are being added regularly.

  • ABOUT US

    Create new revenue opportunities through X-CAGO’s Software Media Solutions made just for you.