Optical character recognition errors



Optical character recognition errors and their effects on natural language processing

FREE-DOWNLOAD [PDF] D Lopresti – Proceedings of the second workshop on Analytics for …, 2008
is optical character recogni- tion, the conversion of the scanned input image from bitmap  Optical
character recognition per- forms quite well on clean inputs in a known font.  introduce many errors
involving punctuation characters, which has an impact on later-stage processing