Technical FAQs

Question

What are the absolute essentials for embedding the PrizmDoc Viewer into my web page/application?

Answer

Viewer API (viewercontrol.js)

The Viewer API is the base building block of the Viewer. We ensure that API changes are backward compatible with point releases (for example, PrizmDoc v13.5 → PrizmDoc v13.6) and will not introduce breaking changes unless critical. With major releases we also endeavor to ensure backward compatibility with previous releases of the Viewer API.

HTML Templates (viewerCustomization.js) and CSS (viewercontrol.css, viewer.css)

The Viewer that is shipped with the product will be maintained and enhanced from release to release. The Viewer HTML and CSS markup will change with each release. Once you have begun to modify your markup, it is recommended that you consider subsequent PrizmDoc releases as sample code, in which you would evaluate product changes and choose to incorporate all or parts of those changes into your customization.

JavaScript files (viewer.js)

The Viewer JavaScript that lies above the Viewer API is unobfuscated and open for customization. While we expect many developer needs will be satisfied through configuration parameters and minor HTML or styling changes, some developers will desire to modify viewer.js for more advanced customization. You should carefully consider your development and ongoing maintenance strategy to ensure that future releases of PrizmDoc are easy to integrate into your customizations. We cannot guarantee backward compatibility of viewer.js in future releases as it is central to the functionality of the Viewer.

For information on integrating PrizmDoc Viewer, see the Getting Started section of the documentation.

For more in depth customization, see the PrizmDoc Customization section of the documentation.

The COVID-19 pandemic may not be the first pandemic to impact global supply chains, but its impact has been orders of magnitude greater than previous public health crises. Businesses around the world have been forced to contend with the twin challenges of transitioning to a remote workforce and overcoming severe supply chain disruption. As 2020 winds down to a close, it’s becoming clear that the pandemic may well represent a “new normal” for logistics strategy rather than a temporary situation.

3 Critical Supply Chain Challenges 

From implementing new technology to establishing relationships with alternative vendors, organizations across multiple industries are taking steps to meet the challenges confronting their supply chains in both the immediate and long-term future. One of the biggest areas of focus is barcode integration. Although they may not seem like they belong at the forefront of innovation, barcodes are scanned more than six billion times every day and serve an incredibly important role in modern supply chain management.

Here are a few of the biggest challenges organizations are facing:

1. Supply Chain Visibility

Speed and flexibility are competitive advantages in today’s on-demand marketplace. Customers want products delivered faster and will quickly lose faith in brands that fail to meet their expectations. According to a 2019 survey of online consumers, 72.7% of respondents were unlikely to order from a company again after a poor delivery experience. Given those high stakes, it’s incredibly important for organizations to maintain high levels of visibility into their supply chains.

Barcode technology makes it possible to track products and materials throughout the entire production and distribution process. A quick scan is all that’s needed to update an item’s status every time it’s processed at a location. That information feeds into inventory tracking software that allows businesses to get a more accurate picture of how shipments move through their supply chain. This visibility makes it easier to identify potential problems and enhance overall logistics efficiency. 

2. Digital Transformation

Despite the growing emphasis on digital transformation across the economy, the average supply chain is still held back with inefficient, legacy infrastructure. Small and medium-size businesses face the greatest challenges, with only 21% of them using digital tools to integrate their supply chain operations with other departments. In the retail industry alone, bad processes and data disconnects cost companies over $500 billion annually. As the holiday season draws nearer and promises to bring with it a massive increase in online shopping, having a fully digital supply chain in place with enough speed to handle that demand is essential.

Implementing the right digital technology tools can help organizations to streamline their supply chains and gather the data they need to make better decisions. Barcode integration software is a foundational component of this transformation because it forms a bridge between the physical and digital elements of logistics management. Shipments can be instantly scanned into tracking systems to provide a wealth of data points across the supply chain, making it easier to adapt to disruptions and develop better processes. Barcode libraries also need to be able to scan and decode both traditional 1D codes and newer 2D codes quickly and accurately to keep products moving swiftly and smoothly to consumers.

3. Automation

As many organizations have put social distancing protocols in place during the pandemic to protect the health of both employees and customers, automation has become more important to supply chain performance than ever before. These COVID-19 pressures are sure to accelerate what was already a growing push toward implementing automated software and systems. Gartner even projected in 2019 that 30% of warehouse workers will be replaced by collaborative robots by 2023, which will further drive the demand for automated scanning and inventory management tools to accompany them.

For these automated systems to work effectively, they will need robust software that can withstand the difficult working conditions of warehouses, assembly lines, and distribution centers. A barcode integration that struggles to accurately read damaged or broken barcode images, for instance, won’t last very long in such an environment. The software powering automated systems must be able to function under a variety of conditions and be adaptable enough to overcome obstacles without constant oversight from on-site workers.

Unleash Your Supply Chain With Barcode Xpress

Building software capable of meeting the needs of a truly digital supply chain is no easy task. That’s especially true when it comes to integrating barcode reading capabilities into supply chain applications. With so many barcode formats in use, developers need an SDK with the ability to accurately detect, read, and write both common and uncommon barcode types used across all industries.

Accusoft’s Barcode Xpress is a multi-language barcode integration capable of reading and writing more than 30 different barcode types quickly and accurately. A truly robust SDK solution, Barcode Xpress leverages the imaging functionalities of ImagXpress to repair damaged orbroken barcodes that other readers are unable to read.

Speed

Barcode Xpress is capable of detecting and decoding multiple barcodes on a page at speeds of up to 1,000 pages per minute. It doesn’t matter where the barcode is located, what type of code it is, or how many are present. The control is able to read them regardless of orientation within milliseconds.

Accuracy

Every time Barcode Xpress detects a barcode, it not only provides the information encoded within the image, but also reports a confidence value to ensure high levels of accuracy. Damaged, broken, or poorly printed barcodes can also be scanned thanks to the SDK’s image processing capabilities. Black noise, white noise, erasures, low resolution, white line streaks, and other common barcode problems are reconstructed and scanned automatically to keep the supply chain moving.

Versatility

With the ability to detect, read, and write over 30 different barcode types, Barcode Xpress is available in multiple configurations for Windows and Linux-based operating systems. The SDK library supports 24-bit color images, 8-bit grayscale images, and 1-bit black and white images, as well as providing more than 80 additional image processing and editing functions thanks to its ImagXpress integration.

Experience Barcode Xpress Today

Easy to deploy and utilize, Barcode Xpress is a true all-in-one solution for desktop and web applications seeking barcode recognition capabilities. Just a few lines of code is all it takes to unlock the full potential of your supply chain technology.

Find out just how simple barcode integration can be with Barcode Xpress. Sign up today to download a free trial or talk to one of our SDK specialists today to learn more.

OCR form

An automated forms processing solution can significantly improve accuracy and efficiency when it comes to managing large quantities of documents containing structured content. Whether an organization needs to digitize existing records or is continuously processing new documents within application workflows, having a versatile optical character recognition (OCR) component working to identify and extract text from multiple languages allows them to capture data more effectively.  Solid OCR form capture is critical.

Although a good OCR engine operates quickly and efficiently, the process of recognizing and extracting text is a highly complex undertaking that can be impacted by a variety of factors. Under optimal conditions, for example, the OCR component within Accusoft’s FormSuite can generate results quickly and accurately, with the ability to read several languages from around the world. However, if an application’s forms processing workflow is not set up efficiently or overlooks a few important considerations, recognition performance may suffer in terms of speed and accuracy.

6 Ways to Achieve the Best Results with the Accusoft OCR Component in FormSuite

 

1. Pay Attention to Image Resolution

As a general rule, OCR components should be provided with high resolution images so the recognition engine is able to distinguish the details that would otherwise be missed on low resolution images. This helps them to recognize the differences between “l” and “i” or “O” and “0” (zero), which results in better, more accurate results.

However, there could be a problem if the image resolution is too high. These images require much more time to process without delivering any benefits since the required letter properties are clearly distinguishable in a lower resolution.

To strike a balance between speed and accuracy, it’s better to scan all images in a 150-400 dots-per-pixel range. This allows the recognition engine to identify all possible letter properties and avoid being bogged down with analyzing a lot of data at the same time.

2. Don’t Lose Image Properties While Preparing to Recognize

To achieve the best results, it’s important to provide the recognition engine with a few helpful hints. In some cases, resolution properties may be lost while an image is being prepared for recognition, leading to worse than expected results. This happens most frequently when working with System.Drawing.Image or SystemDrawing.Bitmap classes directly during operations like clipping, merging, or reducing the bit depth. 

In this case, the best solution is to make sure that HorizontalResolution and VerticalResolution properties are set correctly and reflect initial image resolution values. The ScanFix component within FormSuite can perform this task automatically and is designed to be compatible with the OCR component to help achieve better recognition results.

3. Clean Up Underlined Text Before Recognition

Specks, dirt, and other imperfections within the source image can significantly reduce recognition quality. Sometimes, however, even a seemingly good image can be recognized incorrectly when there are underlined words like URLs, emails, or specifically formatted generic text. 

From the software’s point of view, this kind of text isn’t very different from other types of image distortion. ScanFix’s LineRemovalOptions can clean up the text by eliminating lines that could interfere with recognition. The API also features special parameters that ensures characters with low hanging elements (such as “j” or “y”) will be restored after line removal to avoid another potential recognition problem.

4. Use Long-Living Objects to Avoid Recognition Performance Drop

Creating a new instance requires OCR engine initialization and loading neural network data suitable for specific recognition parameters. This process is not resource free because of the data complexity and may cause delays from ~200 msec to 2 sec depending on the hardware and recognition properties. 

Existing Accusoft OCR instances may be reused to recognize other images with different properties. This will speed up the overall process because initialization will be done only once during the first AnalyzeField call and subsequent calls will be much cheaper in terms of computing resources.

5. Assign Instances to Their Own Worker Threads

Objects are thread safe and can be called from different threads. However, assigning an object to its own thread can avoid extra locking. One of the simplest ways to do this is to use C# Parallel.ForEach loop and create ConcurrentQueue with the pre-allocated objects. 

This ensures that the number of threads will not exceed the number of available CPUs. Any available instance can then be automatically assigned to recognize the images in their own thread while extra possible threads will wait until busy instances will be free to acquire.

Other common patterns are producer-consumer and map-reduce, which are more complex to implement but provide better flexibility when managing input data.

6. Dispose Objects to Avoid Memory High Memory Consumption

This is a generic rule for the C# to call a Dispose for the objects which use non-managed resources. FormSuite’s OCR component uses an external recognition engine, so it is highly recommended to call Dispose when the instance will not be required anymore. This can avoid a situation where the memory will not be available for different parts of the application, especially when a high amount of data exists for post-processing or the amount of available memory is low because of the different processes running in parallel.

Get Accurate OCR Data Capture Results with FormSuite

When properly configured and incorporated into a forms processing workflow, the FormSuite OCR component can accelerate automated data capture and reduce manual errors. Its zonal field recognition capabilities allow it to hone in on predefined field types to improve processing speed and accuracy. Developers can also adjust confidence values for recognition results to determine how frequently manual review is necessary. 

To get a hands-on look at how FormSuite incorporates OCR seamlessly into its collection of forms processing tools, schedule a free trial today.

form workflow automation

Forms have long been used to provide organizations with important information about their customers. For a financial services or insurance company, that information might be used to determine eligibility for a loan or set a policy rate. Legal teams and healthcare providers, on the other hand, often use them to quickly gather information that could be relevant to a client’s case or a patient’s care. By building form workflow automation into their applications, developers can provide these organizations with the tools they need to improve efficiency and provide better service to their customers.

A Better Way to Capture Data with Form Workflow Automation

At its core, a forms workflow is designed to capture data from completed forms and route that information to the appropriate destination. That end point will vary based on the application. In some cases it could be used to autopopulate database entries. Other systems may feed it into machine learning algorithms to identify trends or provide predictive insights. Before any of that can happen, however, automated workflows with forms recognition capabilities need to be in place to identify various form types and extract information from them using various forms of optical recognition.

The primary benefits of workflow automation are speed and accuracy. By building a forms workflow within their applications, developers can help their customers process submitted forms much more efficiently than they could by hand. Even if manual data entry wasn’t so prone to human error, it would still be a waste of valuable resources to have skilled employees performing such a repetitive, routine task. Automating this sort of work is often the first step in maximizing performance in other areas of an organization because it frees up resources that can be directed toward higher-value tasks.

Say Goodbye To Paper (Mostly)

Organizations have talked about going “paperless” for decades, but they frequently find it much more difficult to do so in practice. That’s largely because physical forms continue to be used across many industries. Converting these paper forms into digital format as quickly as possible is critically important. Without some way of incorporating them into an automated workflow, inefficiencies and manual errors will continue to creep back into business processes. 

A forms workflow needs to be able to handle scanned forms images in addition to purely digital documents. Robust forms identification tools are essential for this process because they have the ability to match any submitted form to a library of predefined templates. Without identification capabilities, applications would need to be given specific information about every form. At best, submitted forms would need to be manually presorted before they could be scanned and uploaded for processing rather than being converted into digital format all at once and identified automatically.

Recognition and Extraction

Once forms are scanned, uploaded, and identified into an application, the data capture process can begin. While digital forms can easily send information contained in fields to the proper destination, a scanned form is just a static document image. Even if the form was filled out digitally and never existed as a paper document, the fields may not be responsive or the entire form may be nothing more than a flattened PDF image. In these cases, the only way to reliably capture data is to implement some type of optical recognition.

Optical Character Recognition

For machine printed text, forms workflows can deploy Optical Character Recognition (OCR) to identify and extract information from an identified form. High-quality OCR engines can read multiple languages, allowing them to capture data from almost any source and send it to the next phase of an automated workflow. When extracting text, OCR tools can be set to carry out full-page extraction, which pulls text from the entire form, or zonal extraction, which focuses the data capture effort on a smaller, predetermined area. The latter approach is much more common with forms processing because it allows the application to set parameters on each zone to enhance performance. If the OCR engine is instructed to look for only numbers in one field and specific regular expressions in another, it will be able to identify and extract text faster and more accurately.

Intelligent Character Recognition

Of course, many physical forms submitted for processing will not be filled out with standardized digital fonts, but rather by hand using a pen or pencil. For these handwritten forms, Intelligent Character Recognition (ICR) will need to be deployed to read and extract field contents. Although identifying handwritten text is a much more challenging undertaking, the combination of a powerful ICR engine and good form design can greatly improve accuracy and processing times to keep information moving through automated workflows.

Optical Mark Recognition

Forms frequently use checkboxes or fillable bubbles to indicate important information. When scanned images are run through a forms workflow for processing, applications need to be able to quickly identify the presence of a mark and apply the conditional information associated with it. Today’s forms workflow tools utilize Optical Mark Recognition (OMR) to detect the presence or absence of marks automatically. They can also check the entire form to determine what information might be missing, such as essential fields or signatures.

Unlock Your Form Workflow Automation Potential with the FormSuite Collection

Building an automated workflow for forms processing requires a variety of software tools and specialized imaging expertise. It’s a challenging task that becomes even more difficult when developers are facing tight deadlines for other application features. With the right forms workflow SDKs, software teams can rapidly integrate the features needed to identify a variety of forms and capture vital data using full-page or zonal text recognition.

Accusoft’s Forms Collection bundles our powerful forms toolkits into a single, easily deployed package. Whether you’re using FormFix to identify and align forms, cleaning up scanned images for better recognition results with ScanFix Xpress, or deploying fast, accurate OCR and ICR with SmartZone, FormSuite provides all the SDK resources your team needs to unlock your application’s workflow automation potential. Learn more about what’s included with the FormSuite Collection by downloading our detailed fact sheet.

TAMPA, Fla. – On September 22, 2020, Accusoft announced its latest SDK, ImageGear PDF. This integration enables developers to add a variety of PDF functionalities into an application.

“We are proud to add ImageGear PDF as the latest addition to our product portfolio,” says Jack Berlin, CEO of Accusoft. “We recognized a need in the market for a more robust PDF solution that developers could use to enhance their products. Using our proprietary technology, I knew we could bridge that gap.”

ImageGear PDF gives end-users the ability to merge multiple PDFs, split a PDF into multiple PDFs, rearrange pages within a PDF, add pages or remove pages in a PDF, and more. The SDK adds programmatic annotation capabilities as well as compression, signature, comparison, and data capture.

“ImageGear PDF is a great tool for developers looking to enhance their application,” says Mark Hansen, Sr. Product Manager of SDKs. “Accusoft has a variety of different PDF solutions, but we wanted to add a more robust SDK that solves PDF pain points more efficiently.”

ImageGear PDF is available with an optical character recognition (OCR) add-on feature, which programmers can use to search for specific characters within a document, highlight different sections, and markup the output for easier viewing and collaboration. To learn more about ImageGear PDF, please visit our website at accusoft.com/products/imagegear-collection/imagegear-pdf/.

About Accusoft:

Founded in 1991, Accusoft is a software development company specializing in content processing, conversion, and automation solutions. From out-of-the-box and configurable applications to APIs built for developers, Accusoft software enables users to solve their most complex workflow challenges and gain insights from content in any format, on any device. Backed by 40 patents, the company’s flagship products, including OnTask, PrizmDoc™ Viewer, and ImageGear, are designed to improve productivity, provide actionable data, and deliver results that matter. The Accusoft team is dedicated to continuous innovation through customer-centric product development, new version release, and a passion for understanding industry trends that drive consumer demand. Visit us at www.accusoft.com.

On March 10, 2021, Accusoft announced the arrival of the free-to-use Accusoft PDF Viewer, the latest addition to its family of PDF solutions. An entirely client-side integration with no complicated server dependencies, this lightweight JavaScript PDF viewer also features a responsive UI for out-of-the-box mobile support.

“We’re excited to offer this free version of the Accusoft PDF Viewer to developers,” says Jack Berlin, CEO of Accusoft. “Our team worked hard to build a viewer that’s a step above what you can get from open source offerings. We think it’s going to solve a lot of the problems developers typically encounter with existing PDF libraries.”

Accusoft PDF Viewer integrates into an application quickly and easily with just a few snippets of code. It runs entirely within the browser to deliver an optimized viewing experience across all devices. The intuitive UI controls allow users to zoom, pan, jump to page, navigate thumbnails, and pinch-to-zoom on mobile screens with ease. And thanks to lightning fast full-text search, locating essential information is easier than ever.

“Accusoft PDF Viewer is great for developers because it allows them to maintain complete control over documents without having to set up any cumbersome server infrastructure,” says Mark Hansen, Product Manager. “Having a responsive UI that adapts to mobile displays will also increase their flexibility tremendously.”

The free version of Accusoft PDF Viewer allows developers to quickly add powerful viewing capabilities to their web applications. We’re currently working on additional features (such as annotation and eSignature) that will be included in an upgraded paid version.

To learn more about Accusoft PDF Viewer or download it for a first-hand look, please visit our website.

About Accusoft:
Founded in 1991, Accusoft is a software development company specializing in content processing, conversion, and automation solutions. From out-of-the-box and configurable applications to APIs built for developers, Accusoft software enables users to solve their most complex workflow challenges and gain insights from content in any format, on any device. Backed by 40 patents, the company’s flagship products, including OnTask, PrizmDoc™ Viewer, and ImageGear, are designed to improve productivity, provide actionable data, and deliver results that matter. The Accusoft team is dedicated to continuous innovation through customer-centric product development, new version release, and a passion for understanding industry trends that drive consumer demand. Visit us at www.accusoft.com.

Image compression has become such a ubiquitous aspect of the digital world that the average person doesn’t give it much thought. Even when they encounter textbook compression problems, such as running out of space for photos on their phones, waiting on a slow-loading webpage, or working with an overly pixelated image, they may not consider how effective compression techniques could resolve these issues.

Today’s software developers, by contrast, spend a lot of time thinking about how to incorporate better compression solutions into their applications. That’s why they frequently turn to image compression SDKs to help their end users better manage large and highly-detailed image files.

The Enduring Need for Image Compression

Although advancements in hard drive technology and easily scalable cloud storage have reduced many traditional data management concerns, large image files can still pose significant challenges. Many organizations that can’t utilize cloud storage options for compliance reasons or find the cost of those platforms prohibitively high. 

While they may be able to add more on-premises storage easily enough, this option can also quickly become quite costly. Companies often need to procure much more storage than they may need on a day-to-day basis in order to meet redundancy requirements. Scaling physical storage also locks firms into burdensome equipment refresh cycles.

But simply storing images is only part of the challenge of data management. Large files are more difficult to move, even if an organization has a customized solution in place. If images can’t be shared quickly and easily through a secure platform, users may turn to riskier third-party applications.

Image compression alleviates these problems by reducing the overall size of image files. By compressing image files, organizations can maximize their storage potential and share files more easily. Image compression can also improve website and application performance by reducing the time it takes to load images. 

Although there are many different methods of compressing images, they all involve algorithms that use a variety of shortcuts to reduce the overall size of pixel data. In some instances, compression involves the elimination of image data, which can degrade the image quality and make it impossible to return to its original size (lossy or irreversible compression). Other techniques retain the original image data, but can’t achieve the same level of compression (lossless or reversible compression). 

Image Compression SDKs and Your Applications

While there are many compression options available in commercial imaging software, organizations often need the ability to compress image files within their core business applications without any external dependencies. Opening an image file with another program not only takes additional time and disrupts efficient workflow, but it also creates the potential for security risks and version confusion.

Consider, for instance, a medical provider that needs to send a high-resolution MRI scan to another provider. If the file is too large to deliver electronically, someone may try to get around the problem by using another program to compress the scan and then send it as an attachment over email or share it through a cloud platform. Suddenly, the confidential image file has been accessed by potentially vulnerable third-party applications, which creates a serious compliance issue. To make matters worse, the compressed image may not be associated with the patient’s file in the EHR system. And that’s not even getting in the question of whether or not the compression technique used damaged the image integrity!

An image compression SDK like ImageGear allows developers to integrate the ability to compress and convert image files into their applications without compromising security, efficiency, or quality. Optimized, standards-based compression libraries with support of formats like TIFF, PDF, PDF/A, JPEG 2000, JPEG, and DICOM deliver fast compression/decompression capabilities while ensuring that images remain high quality. 

The primary advantage of integrating image compression capabilities directly into an application is the lack of third-party dependencies. This is crucial for software that is gathering and managing image files because it doesn’t cause any workflow disruptions. With an image compression SDK integration, image files can be shrunk down to more manageable sizes programmatically, which aids significantly in automated processes. Since the images are being compressed entirely within the application, it’s also easier to maintain strict version and access control throughout the life cycle of the file.

Image Compression SDKs vs Open Source Solutions

Many developers turn to open source compression libraries when looking to integrate image compression features into their applications. While this often seems like an easy, low cost solution, open source codecs can lead to unforeseen problems over time. Since many of them are not actively maintained, troublesome bugs can go unresolved and security gaps can create serious privacy risks.

One infamous example of this problem involved the widely used “Cornell Codec,” one of the first open source libraries that supported lossless JPEG compression. Developed in 1994, it was quickly adopted by many healthcare applications that needed to compress high-resolution medical images like MIRIs, CT scans, and X-Rays. 

Unfortunately, the codec had a problem. When it compressed images into DICOM files (the industry standard used in medical imaging applications), it produced an error that made them unreadable when they were decompressed. Since the Cornell Codec was an open source solution embedded into numerous applications, the problem went unresolved for many years until Accusoft developed a code based workaround for our customers.

By choosing a well-supported image compression SDK like ImageGear for their application’s compression needs, developers can rest easier knowing that they’re deploying a tried and true solution that won’t create unexpected problems for their customers. Another benefit of a comprehensive image compression SDK is that it will provide a variety of compression libraries that can accommodate almost any file type and use case. ImageGear, for example, supports more than a dozen unique image compression types, including JPEG (lossy/lossless/progressive), RAW, ASCII, and Deflate.

ImageGear: More Than an Image Compression SDK

Image compression is just one of ImageGear’s many powerful document and image processing features. A versatile code-based solution, ImageGear allows developers to quickly integrate image conversion and cleanup features to their application along with editing, annotation, viewing, scanning, and printing capabilities. With support for a huge number of today’s leading document and image file formats as well as medical imaging support with ImageGear Medical, this SDK toolkit delivers the functionality developers need to get their applications to market faster. See what ImageGear can do for your application today by downloading a free trial.

Document image cleanup is a vital step in building an efficient and accurate processing workflow. In a perfect world, every file an organization receives would be in pristine, high-resolution condition so it could be processed quickly and easily. Unfortunately, the reality is that documents come in all sizes, conditions, and formats. Companies can receive vital information in the form of email, traditional mail, fax, or even text. Documents scanned into a crooked, low-resolution file are just as likely to be received alongside digital versions submitted entirely through a web application.

This poses a significant challenge for software developers building the next generation of automation solutions. Without some way of cleaning up document images, companies that still rely upon manual processes will struggle to read and process files. More importantly, poor image quality interferes with optical character recognition (OCR) engine accuracy, making more human interaction necessary to verify recognition results. By integrating document image cleanup tools into their applications, developers can enhance the speed and accuracy of their automated processes and help their customers leverage the full potential of digital transformation.

7 Essential Document Image Cleanup Features Your Application Needs

There are a few essential document image cleanup tools that should be considered absolutely essential for any application that has to manage multiple file formats. To see these tools in action and understand why they’re so vital, let’s take a look at how these features work in ImageGear, Accusoft’s powerful document and image processing SDK integration.

1. Despeckling

Speckles can appear on document images for a variety of reasons. In some cases, they are unwanted image noise created during the original scanning process (the classic “salt and pepper” noise), but in other instances, they’re simply the result of dust particles on the surface of a scanned document or on the scanner itself. They are frequently encountered when converting old documents into digital form. Speckling not only interferes with OCR engine performance, but can also make it difficult to maintain image fidelity when compressing or converting files. 

ImageGear can reduce or eliminate speckling as part of the document image cleanup process. There are two ways to approach speckle removal:

  • Despeckle Method: This function removes color noise from 1-bit images by taking the average color value in a square area around the speckle and replacing its pixels with that value.
  • GeomDespeckle Method: This function uses the Crimmins algorithm to send the image through a geometric filter, reducing the undesired noise while preserving edges of the original image. This process is applied only to 8-bit grayscale images.

2. Image Inversion

With so many documents being scanned, converted, and transferred between applications, there’s a greater likelihood of something going wrong along the way. One of the most frequent problems is image inversion, which swaps pixel colors and turns a standard white background with black text into a black background with white text. This mix-up can render documents completely unreadable by OCR engines.

ImageGear can be configured to automatically recognize when image inversion is necessary. The invert method can also be used to immediately change the color of each pixel contained in the entire image, turning white to black and black to white.

3. Deskewing

Skewed document images are both cumbersome to manage and challenging for OCR engines to read accurately. Unfortunately, manually scanned documents are often uneven, and the problem is only becoming worse now that many people are using their phone cameras as makeshift document scanners. That’s why the first step in the document image cleanup process is often deskewing, which rotates and aligns the images to enhance recognition accuracy.

The deskewing process often involves more than just rotating a document, especially where images taken by a digital camera are concerned. ImageGear’s 3D deskew feature corrects for perception distortion, which can occur whenever a document is scanned by a handheld camera, using a sophisticated algorithm.

4. Blank Page Detection

Many documents converted into digital format contain information on both sides. If they are fed into a scanner along with single page documents, the resulting file will contain multiple blank pages. This might not seem like much of a problem, but if there is enough speckling or noise around the edge of the image, an application may try to apply an OCR engine to it and generate an error result. Blank page detection can quickly identify any image that is blank or mostly white and flag it for deletion.

5. Line Removal

Although they may not seem very troublesome at first glance, lines can create a number of problems for OCR engines. When lines and printed text overlap, it can be difficult for the engine to distinguish between the two. In some instances, the engine may even misread a line as a letter or number. Removing lines from a document prior to OCR reading ensures that the remaining text will be recognized more quickly and analyzed more accurately.

ImageGear supports both solid line removal and dotted line removal. The first method automatically detects and removes any horizontal and vertical lines contained in the document (like frames or tables), while the second method determines which dotted lines to remove by measuring the number and diameter of dots.

6. Border Removal

When scanned documents don’t align properly with the boundaries of the scanner or were copied onto paper that was larger than the original image at some point, the remaining space is often filled in with black. These borders are not only unsightly, but they also interfere with other document image cleanup processes. Although they can usually be cropped out easily, the cropping process alters the proportions of the image, which could create more problems later.

Removing these large black regions is easy with ImageGear’s CleanBorders option. It focuses on the areas near the edge of the page, which typically should not contain any important image data. 

7. Remove Hole Punches

Important documents were often stored in binders before they were prepared for digitization. When scanned, the blank space from the hole punch leaves a large, black dot along the edge of the document. Unfortunately, these holes sometimes overlap with text or could be picked up as filled-in bubbles by an optical mark recognition (OMR) engine.

ImageGear can identify and remove punch holes created by common hole punchers, including two, three, and five hole configurations. The RemovePunchHoles method can be adjusted to account for differing hold diameters in addition to different locations.

Unlock Your Application’s Document Image Cleanup Potential with ImageGear

Although ImageGear can perform a variety of document handling functions such as viewing, conversion, annotation, compression, and OCR processing, its document image cleanup capabilities help applications overcome key content management challenges and enhance performance in other areas. Improved document image quality allows data to be extracted more quickly, enhances the viewing experience, and reduces complications when it comes to file compression and conversion.

Learn more about the ImageGear collection of SDKs to discover how they can deliver versatile document and image processing to your applications.