Scribe Identification in Medieval English Manuscripts

Research output: Contribution to conferencePaperpeer-review

Abstract

In this paper we present work on automated scribe identification on a new Middle-English manuscript dataset from around the 14th - 15th century. We discuss the image and textual problems encountered in processing historical documents, and demonstrate the effect of accounting for manuscript style on the writer identification rate. The grapheme codebook method is used to achieve a Top-1 classification accuracy of up to 77% with a modification to the distance measure. The performance of the Sparse Multinomial Logistic Regression classifier is compared against five k-nn classifiers. We also consider classification against the principal components and propose a method for visualising the principal component vectors in terms of the original grapheme features.
Original languageEnglish
Pages1880-1883
Number of pages4
DOIs
Publication statusPublished - 23 Aug 2010
Event20th International Conference on Pattern Recognition (ICPR 2010) - Istanbul, Turkey
Duration: 23 Aug 201026 Aug 2010

Conference

Conference20th International Conference on Pattern Recognition (ICPR 2010)
Country/TerritoryTurkey
CityIstanbul
Period23/08/1026/08/10

Bibliographical note

20th International Conference on Pattern Recognition, ICPR 2010, Istanbul, Turkey, 23-26 August 2010

Cite this