Physics – Physics and Society
Scientific paper
2007-03-21
Physics
Physics and Society
8 pages, 3 figures, to be published in the proceedings of HLT-NAACL Workshop - TextGraphs 2
Scientific paper
The difficulties involved in spelling error detection and correction in a language have been investigated in this work through the conceptualization of SpellNet - the weighted network of words, where edges indicate orthographic proximity between two words. We construct SpellNets for three languages - Bengali, English and Hindi. Through appropriate mathematical analysis and/or intuitive justification, we interpret the different topological metrics of SpellNet from the perspective of the issues related to spell-checking. We make many interesting observations, the most significant among them being that the probability of making a real word error in a language is propotionate to the average weighted degree of SpellNet, which is found to be highest for Hindi, followed by Bengali and English.
Basu Anupam
Choudhury Monojit
Ganguly Niloy
Mukherjee Animesh
Thomas Markose
No associations
LandOfFree
How Difficult is it to Develop a Perfect Spell-checker? A Cross-linguistic Analysis through Complex Network Approach does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with How Difficult is it to Develop a Perfect Spell-checker? A Cross-linguistic Analysis through Complex Network Approach, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and How Difficult is it to Develop a Perfect Spell-checker? A Cross-linguistic Analysis through Complex Network Approach will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-382865