Language model based on POS tagger

Bartosz Ziolko, Suresh Manandhar, Richard C. Wilson, Mariusz Ziolko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Language models are necessary for any large vocabulary speech recogniser. There are two main types of information which can be used to support modelling a language: syntactic and semantic. One of the ways to apply syntactic modelling is to use POS taggers. Morphological information can be statistically analysed to provide probability of a sequence of words using their POS tags. The results for Polish language modelling are presented.

Original languageEnglish
Title of host publicationSIGMAP 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS
EditorsP Assuncao, S Faria
Place of PublicationSETUBAL
PublisherINSTICC-INST SYST TECHNOLOGIES INFORMATION CONTROL & COMMUNICATION
Pages177-180
Number of pages4
ISBN (Print)978-989-8111-60-9
Publication statusPublished - 26 Jul 2008
EventInternational Conference on Signal Processing and Multimedia Applications - Oproto
Duration: 26 Jul 2008 → …

Conference

ConferenceInternational Conference on Signal Processing and Multimedia Applications
CityOproto
Period26/07/08 → …

Keywords

  • POS-tagging
  • language modelling
  • speech recognition
  • Polish

Cite this