Research portal


Methods for Automated Text Digitisation

Research output: Book/ReportReport

  • David Owen
  • Quentin J. Groom
  • Alex Hardisty
  • Thijs Leegwater
  • Myriam van Walsum
  • Noortje Wijkamp
  • Irena Spasic
  • Mathias Dillen
  • Laurence Livermore
  • Sarah Phillips
  • Zhengzhe Wu
In this document we describe an effective approach to automated text digitisation with respect to specimen labels. These labels contain much useful data about the specimen including its collector, country of origin and collection date. Our approach to automatically extracting these data takes the form of a pipeline. Recommendations are made for the pipeline’s component parts based on some of the state-of-the-art technologies.
Original languageEnglish
Number of pages133
Publication statusPublished - 31-Jan-2019


Log in to Pure