Computer Science – Computer Vision and Pattern Recognition
Scientific paper
2010-03-30
Computer Science
Computer Vision and Pattern Recognition
Proc. Int. Conf. on Information Technology and Business Intelligence (2009) 117-125
Scientific paper
Objective of the current work is to develop an Optical Character Recognition (OCR) engine for information Just In Time (iJIT) system that can be used for recognition of handwritten textual annotations of lower case Roman script. Tesseract open source OCR engine under Apache License 2.0 is used to develop user-specific handwriting recognition models, viz., the language sets, for the said system, where each user is identified by a unique identification tag associated with the digital pen. To generate the language set for any user, Tesseract is trained with labeled handwritten data samples of isolated and free-flow texts of Roman script, collected exclusively from that user. The designed system is tested on five different language sets with free- flow handwritten annotations as test samples. The system could successfully segment and subsequently recognize 87.92%, 81.53%, 92.88%, 86.75% and 90.80% handwritten characters in the test samples of five different users.
Basu Subhadip
Ikeda Hisashi
Rakshit Sandip
No associations
LandOfFree
Recognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT) does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Recognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT), we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Recognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT) will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-81688