| Document Imaging Solution's Software...The new standard in document imaging |
Document Imaging Solutions, Inc. Ph: 616-847-5055 PDF Based Document Imaging Software |
| HOME | SOFTWARE | VIDEOS | SCREENSHOTS | ARTICLE LIBRARY | RESELLER PROGRAM | CONTACT US | SITEMAP |
|
To OCR or Label Electronic Documents for Retrieval
When scanning documents, the decision to use Optical Character Recognition (OCR) versus labeling should revolve around the issues of data mining and document retrieval. For documents and/or information contained within those documents to be searchable, electronic documents must be indexed. It is a disservice to the imaging industry's customers when it advocates OCR as the preferred method of document imaging for search purposes. OCR has been promoted to allegedly automate the process. Although our document imaging systems allow you to either label or OCR a document for indexing, the preferred method to use needs to be made on a document-by-document basis. Understanding the drawbacks associated with each method will help clarify when OCR or labeling is preferred. Search Results: OCR is the process of converting text on a scanned image into text that is be searchable. One then executes a full-text search on the OCR document with words and phrases known to be included in a document. The OCR process is extremely sensitive to the quality of the image, as well as the font differences within the document. As a result, the output from an OCR process is seldom flawless. If the OCR process claims to be 95% accurate, then one character in twenty is not recognized. Errors are introduced when characters bleed and touch one another, or when the scanner picks up ghost images from the reverse side of the document. The inaccuracy of the OCR process requires an operator to manually correct the suspect characters. Using OCR in place of labeling often negates any time gained by the automated process because of character corrections.
Fuzzy Search:
Appropriate Conditions for OCR:
Conclusion: |
| HOME | SOFTWARE | LEARNING CENTER LOG-IN | HEALTHCARE | LINKS | RESELLER LOG-IN | CONTACT US | BLOG |
| Copyright© 1998-2007 Document Imaging Solutions, Inc. All Rights Reserved. |