Scribe Identification in Medieval English Manuscripts

Research output: Contribution to conferencePaperpeer-review


In this paper we present work on automated scribe identification on a new Middle-English manuscript dataset from around the 14th - 15th century. We discuss the image and textual problems encountered in processing historical documents, and demonstrate the effect of accounting for manuscript style on the writer identification rate. The grapheme codebook method is used to achieve a Top-1 classification accuracy of up to 77% with a modification to the distance measure. The performance of the Sparse Multinomial Logistic Regression classifier is compared against five k-nn classifiers. We also consider classification against the principal components and propose a method for visualising the principal component vectors in terms of the original grapheme features.
Original languageEnglish
Number of pages4
Publication statusPublished - 23 Aug 2010
Event20th International Conference on Pattern Recognition (ICPR 2010) - Istanbul, Turkey
Duration: 23 Aug 201026 Aug 2010


Conference20th International Conference on Pattern Recognition (ICPR 2010)

Bibliographical note

20th International Conference on Pattern Recognition, ICPR 2010, Istanbul, Turkey, 23-26 August 2010

Cite this