The PastReader 2025 competition at IberLEF focuses on the automatic transcription of digitized historical Spanish press texts, addressing challenges such as OCR errors, low-quality pages, and complex newspaper structures. It includes two main tasks: correcting errors in OCR-generated text and end-to-end information extraction, aiming to advance the automation of historical text retrieval.
Forum
Year
2025
Link to publication

