ImageGear Features: OCR

ImageGear: OCR

Supports auto- or user-defined zone capture capture in Western and Asian languages.

OCR

Improve character recognition with OCR, optimize recognition with data matching/lookup, regular expressions, and custom character sets, and set confidence values to control when operators should review results. ImageGear provides full page and zonal optical character recognition (OCR) for both Western and Asian languages such as Chinese, Japanese, and Korean. ImageGear’s automatic language detection features enable OCR completion.

OCR can be purchased as an add-on to provide a complete document imaging library for your application development.

Language Support

Asian languages with horizontal and vertical text are supported in the Asian OCR edition. The languages supported are:

Chinese – Traditional
Chinese – Simplified
Japanese
Korean

Full Page and Zonal OCR

With our auto-zoning and segmentation, your users have the ability to:

Automatically segment a page into individual zones for processing
Process an entire image or individual region of the page
Define zones by a user, loaded from a file, or detected automatically by the engine

Image Pre-Processing for Maximum Accuracy

What happens before OCR? Take a look at the OCR pre-processing steps:

Advanced image processing methods are available to improve OCR accuracy
Auto inversion functionality detects if the image needs to be inverted for highest accuracy
Automatic image orientation detects and adjusts images so they are properly oriented
Deskew methods detect image misalignment and automatically correct it, improving segmentation and recognition accuracy
Despeckling methods remove minor dots and imperfections in the image capture process

Superior Results Processing

When you get the OCR recognition details, each character is returned with a confidence level to show accuracy. Separate word confidence values provide an additional accuracy indication. Advanced font and location information allows the OCR library to create text representations of the original file with a similar layout.

The ImageGear OCR engine processes all data in a Unicode format. The data output can be formatted for a specific code page with multiple output options such as:

Image over PDF
Text-based PDF
XML

Try out OCR yourself. Schedule a demo today.

Features to Take Your Application to the Next Level

Image Processing

Enable powerful image cleanup, correction, and transformation functions.

OCR

Supports auto- or user-defined zone capture capture in Western and Asian languages.

File Format Control

Read, write, edit and compress PDF files, and maintain compliance with PDF archiving and preservation.

Image Compression

ImageGear provides comprehensive image compression for a wide variety of image file types.

Conversion

Convert images from one format to a variety of others in seconds with our document conversion SDK.

PDF Processing

Read, write, edit, and compress PDF files, and maintain compliance with PDF archiving and preservation.

AcroForms

Easily integrate interactive forms into your application with ImageGear’s AcroForms function.

Upgrade Your Application’s Potential with Accusoft

Product Owners

Backed by 30 years of software development experience, and over 40 patents, Accusoft can help you save development time and get to market faster!

Developers

With robust APIs and SDK’s, and best-in-class support, Accusoft can help you securely embed document & image processing technology in your web applications and system solutions.