On the 10th and 11th of April 2014, the Succeed project will hold a hackathon at the University of Alicante. Its aim is to look for ways of improving state-of-the-art open-source tools for digitisation of textual content such as books and newspapers.
Over two days, developers will work together in small groups to discuss, roadmap and plan the future development of existing tools. Some of the topics up for discussion are:
- How to train the Tesseract OCR engine.
- Creation of XSLT stylesheets for format conversion, e.g. hOCR, PAGE, FRXML.
- Debian package generation.
This hackathon provides a unique opportunity to meet developers involved in digitisation projects from all over Europe. The PSNC representative will take part in the workshop. Participation in the event is free of charge, but to ensure it, you should reserve your place. Participants are also encouraged to take a look at last year’s hackathon’s outcomes and background information.