When you get the OCR recognition details, each character is returned with a confidence level to show accuracy. Separate word confidence values provide an additional accuracy indication. Advanced font and location information allows the OCR library to create text representations of the original file with a similar layout.
The ImageGear OCR engine processes all data in a Unicode format. The data output can be formatted for a specific code page with multiple output options such as:
- Image over PDF
- Text-based PDF
Try out OCR yourself. Schedule a demo today.