A comparison of standard spell checking algorithms and a novel binary neural approach

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we propose a simple, flexible, and efficient hybrid spell checking methodology based upon phonetic matching, supervised learning, and associative matching in the AURA neural system. We integrate Hamming Distance and n-gram algorithms that have high recall for typing errors and a phonetic spell-checking algorithm in a single novel architecture. Our approach is suitable for any spell checking application though aimed toward isolated word error correction, particularly spell checking user queries in a search engine. We use a novel scoring scheme to integrate the retrieved words from each spelling approach and calculate an overall score for each matched word. From the overall scores, we can rank the possible matches. In this paper, we evaluate our approach against several benchmark spellchecking algorithms for recall accuracy. Our proposed hybrid methodology has the highest recall rate of the techniques evaluated. The method has a high recall rate and low-computational cost.
Original languageEnglish
Pages (from-to)1073-1081
Number of pages8
JournalIEEE Transactions on Knowledge and Data Engineering
Volume15
Issue number5
DOIs
Publication statusPublished - Sept 2003

Bibliographical note

Copyright © 2003 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Keywords

  • binary neural spell checker
  • integrated modular spell checker
  • associative matching
  • ERRORS
  • WORDS

Cite this