Integrate Efficient, Accurate Optical Character Recognition (OCR) in Your Application
Effective document management is now a top priority for organizations, but for many, it remains a challenge. As noted by recent AIIM survey data, companies are struggling to handle both the documents they have and the rapid uptake of new information. In fact, 43 percent said their biggest priority is effectively leveraging the structured and unstructured content they already have, while 57 percent are focused on understanding the overwhelming big data. Optical character recognition (OCR) is a critical component of document management.
For software development firms, this poses a particular challenge. Products are no longer feature complete without critical end-user functions such as advanced optical character recognition and powerful search. However, adding this functionality is not as easy as it sounds. Developers building out this comprehensive construct from the ground up requires both time, effort, and continued maintenance, which is a large undertaking for any company.
Accusoft’s ImageGear SDK offers a way to bridge the OCR gap with comprehensive image processing and manipulation capabilities that both streamline software development and deliver on end-user expectations.*
What is ImageGear?
ImageGear easily integrates into existing applications to deliver cutting-edge document management functionality at scale. Available for both .NET and C/C++ frameworks, ImageGear allows developers to quickly deploy and white-label key features including image processing, manipulation, conversion, and PDF and document search.
This add-on OCR functionality delivers highly-accurate optical character recognition to any .NET (C#) or C/C++ application. ImageGear’s OCR add-on provides full-page character recognition for more than 100 languages — including both Western and Asian languages such as Korean, Japanese, and Chinese character sets. It’s capable of recognizing multiple languages within a single image for enhanced document management. Other OCR features include:
- Automatic page segmentation into individual zones for processing
- Type assignment per zone based on defined flows, tables, or graphics
- Table detection with advanced technology to enhance data reconstruction output
- Entire page or individual region image processing
- Zone definition by user, existing files, or detected automatically by the OCR engine
In addition, software developers can enhance ImageGear OCR functionality by leveraging both predefined and customizable dictionaries to ensure validated results using regular expressions.
Why Optical Character Recognition (OCR) Matters to End-Users
Advanced OCR integration makes it easier for end-users to find what they’re looking for, when they’re looking for it. Instead of forcing users to find additional apps that deliver specific services, in-app OCR delivers increased satisfaction by streamlining user search functionality.
Common use cases include:
- Legal eDiscovery — The eDiscovery process is a critical — and often complex — stage of legal case preparation. Firms need to quickly find key terms, phrases, and images within legal documents to ensure they meet both client expectations and compliance obligations. With many forms now scanned and stored in non-standard file formats that contain form fields, text boxes, and digital imagery, OCR is essential to help lawyers streamline the process of eDiscovery at scale.
- Financial Document Processing — Clients now expect loan applications and credit card applications to be processed at scale and speed. This is especially critical as firms embrace the idea of remote work — both staff at home and those in the office need end-to-end OCR functionality to deliver complete document management.
- Insurance Documentation Assessment — Insurance claims are both complex and comprehensive, requiring complete documentation from clients, contractors, and compliance agencies. As insurance firms move to tech-first frameworks to enhance document processing, speed, and accuracy, OCR makes it easy for staff to find specific data and ensure documentation is complete.
Integrating OCR
Advanced OCR functionality won’t deliver expected outcomes if integration is cumbersome and complex. ImageGear streamlines this process with easy SDK implementation for both .NET and C/C++.
ImageGear .NET can be easily deployed on multiple platforms. These .NET deployments include ASP.NET functions such as image display, thumbnail display, annotation support, and cloud capture along with WPF printing and annotation support. ImageGear for C/C++, meanwhile, offers support for several platforms as well. Check out the developer resources section to see an updated list.
How Your Clients Use Optical Character Recognition (OCR)
PDFs remain the go-to file format for many industries, offering both standardized image and text conversion along with the ability to easily set or restrict document permissions. The problem? PDFs are notoriously difficult to search, making it hard for end-users to quickly find the text or data they need.
ImageGear makes it easy to OCR PDFs using the ImGearRecPage.Recognize Method, which leverages the zone list of the image to deliver accurate OCR — or, if this list is empty, automatically calls the page-layout decomposition process (auto-zoning) to complete the OCR process.
C# supports OCR to PDF.
using System.IO;
using ImageGear.Core;
using ImageGear.Formats;
using ImageGear.Evaluation;
using ImageGear.Recognition;
namespace ImageGearTest
{
class Program
{
static void Main(string[] args)
{
// Initialize evaluation license.
ImGearEvaluationManager.Initialize();
ImGearEvaluationManager.Mode = ImGearEvaluationMode.Watermark;
// Initialize the Recognition Engine.
ImGearRecognition igRecognition = new ImGearRecognition();
// ImageGear assemblies require explicit initialization at application startup.
ImGearCommonFormats.Initialize();
// Open a FileStream for our output document.
using (FileStream outputStream = new FileStream(@"c:\temp\outputDoc.txt", FileMode.OpenOrCreate, FileAccess.ReadWrite))
{
// Open a FileStream for our source multi-page image.
using (FileStream multiPageDocument = new FileStream(@"c:\temp\test.tif", FileMode.Open))
{
// Load every page of the multi-page document. Starting at page 0 and loading the range of spaces specified.
// Since the range is -1, that specifies that all pages shall be loaded.
ImGearDocument doc = ImGearFileFormats.LoadDocument(multiPageDocument, 0, -1);
// Determine the amount of pages in the multi-page image.
int numPages = ImGearFileFormats.GetPageCount(multiPageDocument, ImGearFormats.UNKNOWN);
// Recognize each page of the multi-page document and add the results to outputStream.
for (int pageNumber = 0; pageNumber < numPages; pageNumber++)
{
// Cast the current page to a raster page and import that page.
using (ImGearRecPage igRecPage = igRecognition.ImportPage((ImGearRasterPage)doc.Pages[pageNumber]))
{
// Preprocess the page.
igRecPage.Image.Preprocess();
// Perform recognition.
igRecPage.Recognize();
// Add OCR results to the outputStream.
igRecognition.OutputManager.WriteDirectText(igRecPage, outputStream);
}
}
}
}
// Dispose of objects we are no longer using.
igRecognition.Dispose();
}
}
}
OCR Access and Analysis
Advanced OCR isn’t enough in isolation — developers must also empower end-users to quickly access and analyze OCR output. ImageGear offers multiple options to help streamline this process, such as:
- Storage of Output as Code Pages
- Export to Text Format
- Export to PDF
- Export to MRC PDF
- Export to a Formatted Document
Find Your Best Fit
ImageGear OCR makes it easy for end-users to quickly search critical documents, find the data they need, and analyze optical character recognition output, but don’t take our word for it. Seeing is believing. Test ImageGear in your own environment and discover the difference of advanced OCR.
*Optical character recognition is an ImageGear add-on and must be requested upon purchase of a license.