Technical FAQs

What are the best quality images to use when processing form data and recognition?

Question

What are the best quality images to use when processing form data and recognition?

Answer

In all cases, you’ll want to have your images as clear and as clean as possible. For any particular procedure, please consider the following: OCR and ICR: Capture images in at least 300 DPI resolution. Ideally, working in black and white will allow the objects of interest on your image to be better defined and recognized. Free the image form all noise as much as possible. As if a human was reading it, you’ll want the text objects on the image to be as legible as possible. For ICR, make sure that the characters are printed (no cursive text, etc). Barcode recognition: As with OCR and ICR, capture images in at least 300 DPI and working with black and white content can provide excellent results. You’ll also want to make sure that the bars in the barcodes are clearly defined on the image and are not mal-formed (for example, the barcodes should have the proper start and stop sequence, etc). As always, clear as much noise from the image as possible. Forms matching and registration: As with the prior 2 items above, capture your documents in at least 300 DPI. Make sure that your resolution is consistent between your form templates and incoming batch images as well. Form templates should only contain data that is common to every image that is being processed (i.e. – Form fields, the text that appears on the blank form itself, etc). The template should not have filled-in field information as this will affect the forms matching process.

View Detail >

What quality should my images be for processing form data and recognition using FormSuite?

Question

What quality should my images be for processing form data and recognition using FormSuite?

Answer

In all cases, you want to have your images as clear and as clean as possible. For any particular procedure, please consider the following:

OCR and ICR: Capture images in at least 300 DPI resolution. Ideally, working in black and white allows the objects of interest on your image to be better defined and recognized. Free the image form all noise as much as possible. As if a human were reading it, you want the text objects on the image to be as legible as possible. For ICR, ensure that the characters are printed (no cursive text, etc).

Barcode recognition: As with OCR and ICR, capture images in at least 300 DPI and working with black and white content can provide excellent results. Ensure that the bars in the barcodes are clearly defined on the image and are not malformed (for example, the barcodes should have the proper start and stop sequence, etc). Clear as much noise from the image as possible.

Forms matching and registration: As with the prior 2 items above, capture your documents in at least 300 DPI. Ensure that your resolution is consistent between your form templates and incoming batch images. Form templates should only contain data that is common to every image that is being processed (i.e. Form fields, the text that appears on the blank form itself, etc). The template should not have filled-in field information as this will affect the forms matching process.

View Detail >

Why the Ability to Watermark Documents Matters for Applications

The ability to watermark documents is essential for many businesses. By utilizing watermarks, organizations can quickly brand their assets and take steps to protect their valuable intellectual property. When identifying mission critical document features for their applications, developers need to keep in mind all the ways that watermarks continue to be deployed for a variety of use cases.

What Is a Watermark?

A watermark is a semi-transparent line of text or an image that is embedded into a file to indicate that it belongs to a person or organization and cannot be reproduced or distributed without permission. In most instances, watermarks are visible, but some text-based watermarks are embedded in such a way that they only become visible after the document is scanned or printed.

Digital watermarks are one of the earliest and most effective forms of security for documents and images. They are most commonly used by photographers who want to protect their images from unauthorized usage. Their watermark is superimposed over the image, making it difficult to remove without also damaging the source image in the process.

The same basic principle applies to documents, especially PDF files that are easy to download, view, and distribute. Many organizations continue to add specialized watermarks to their documents to protect both confidential information and intellectual property. Understanding which types of watermarks to use and why is an important aspect of effective document security.

Why Should You Watermark Documents?

There are a few reasons why an organization may want to watermark documents. Although they’re most often associated with security, they can be used for other purposes as well. Here are a few major use cases involving document watermarks:

Classify Documents More Easily

File management can be a challenge for organizations of all sizes. While steps like naming conventions can help somewhat, it can be difficult to keep track of which files can be shared openly and which ones should only be opened and viewed within a secure application environment. Watermarks can be added to documents to indicate how they should be handled. A contract that’s labeled “Confidential,” for instance, makes it clear to the user that they should follow the proper security precautions when viewing the file, such as not leaving the screen open unattended or not sharing their screen with another participant on a video call. It also removes any ambiguity about the nature of the document.

Track Your Documents

While many free watermarking tools only allow users to add a generic logo or text to a document, dedicated viewing solutions like PrizmDoc Viewer can create watermarks that contain unique identifiers, making it easier to track documents throughout their lifecycle. This makes it easier to track down the source of a leak or breach if a file ends up somewhere unexpected. In addition to the security benefits, customized watermarks that include a user name, time stamp, or revision number helps organizations maintain better version control over their documents and avoid any confusion over which file is the most up-to-date.

Discourage Unauthorized Distribution

Watermarks are an easy way to indicate ownership of a document and its contents. While many files, and PDF files especially, are protected by some form of digital rights management (DRM) that limits who can open, edit, or copy them, this form of security is often circumvented by inventive tactics like taking a screenshot, capturing video footage, or even photographing a screen with an external device.

If the captured document has a watermark, it will be quite difficult for someone to distribute the file without revealing the owner. While it’s possible to remove a watermark without damaging the document contents, doing so can be a difficult and time-consuming process that often isn’t worth the effort. More importantly, if the watermark contains identifying information, it’s often possible to find out where a security breach occurred and then locate those responsible for it.

How to Watermark Documents

While many organizations understand the value of adding watermarks to their documents, they often don’t have the tools in place to do so effectively at scale. There are a number of free watermarking web applications available, but these sites typically require a file to be uploaded first, which creates a huge security risk for most businesses. In many cases, this process is also irreversible because it burns the watermark into the document itself.

Microsoft Word can easily add watermark text or images to a document, but they’re also quite easy to remove. In fact, a quick Google search points to multiple resources for removing Word-based watermarks from both DOCX and PDF files.

PrizmDoc Viewer supports multiple forms of document watermarks natively within an application, including identifying text, diagonal text, and images. More importantly, since the watermark is associated with the document viewing session, it isn’t applied to the source document itself, which remains secure within the main application. Watermark document properties can be specified to include more than one watermark in a viewing session, even supporting mixed types on different areas of the page.

Text Watermarks

The watermark API for PrizmDoc Viewer allows users to adjust the location, size, and style of the text. Information included can be adjusted based on document needs.

Diagonal Text Watermarks

PrizmDoc Viewer can apply a single directional text watermark across the center of the page. By including control characters in the text string, it’s possible to create line breaks in the text.

Image Watermarks

The watermark API can also add a transparent PNG file to the document, which is ideal for displaying a company logo. PrizmDoc Viewer can adjust the opacity of the watermark regardless of whether it’s an image or text.

By adding multiple watermarks, organizations can easily brand their files across the company while also providing effective tracking information for individual documents.

Start Watermarking Documents Today with PrizmDoc Viewer

A fully-featured HTML5 viewer, Accusoft’s PrizmDoc Viewer is easily integrated into web applications in need of comprehensive viewing and document management capabilities. In addition to watermarking, it delivers conversion, OCR, annotation, and redaction support for more than 100 different file types. Get a hands-on view of how easily PrizmDoc Viewer can apply custom watermarks to your documents with our watermarking API demo.

View Detail >

Privacy in Practice: Empowering Your Patient Portal with PrizmDoc Cloud API Functionality

Powerful patient portals are now essential for healthcare organizations to deliver high-quality care, even at a distance. Despite advancements around functionality, however, challenges remain. As noted by Healthcare Info Security, many healthcare providers still struggle with providing solutions to patients in an easily accessible portal that provides the security they require by law.

Here, HIPAA compliance is critical. Healthcare organizations need portal solutions that deliver valuable information without undermining regulatory requirements around data security and handling. Creating innovative, secure patient portals demands HIPAA-compliant tools that deliver advanced viewing and redaction tools while keeping privacy in practice.

The State of Healthcare Security

Security remains a problem for healthcare organizations as attackers ramp up efforts to access private patient and operational information. The healthcare industry saw more than 41 million records breached in 2019 and new attack vectors are now emerging as hackers look to leverage pandemic pressures and breach corporate security. It’s no surprise, then, that last year saw 28,261 HIPAA complaints, the highest number ever recorded, as organizations deployed more user-friendly technology and attackers looked to capitalize on potential weaknesses.

Ramping up security in patient portals and meeting emerging patient needs is a priority for organizations. Accusoft’s PrizmDoc Cloud, a HIPAA compliant solution, is capable of offering user-friendly portal capabilities inside your own secure application. The right combination of existing technology and cloud-based application programming interfaces (APIs) can take your patient portal to the next level. Let’s break down five key cloud-based APIs that can help patient portals deliver on practical potential.

Robust Document Viewing with PrizmDoc Cloud API

Effective medical care depends on documents. From patient consent forms to test results and referrals from other healthcare practices, documents form the core of custom-built treatment plans. While the transition to electronic health records (EHRs) has helped reduce the complexity and confusion that comes with paper-based processes, this digital transition has introduced the challenge of document diversity.

From typical Word documents to Excel spreadsheets and scanned images of handwritten forms, patients need the ability to access documents on-demand, while healthcare organizations must ensure that patient access options are both secure and HIPAA-compliant.

The PrizmDoc HTML5 Document Viewing API offers document and image viewing while also streamlining the process with key features including:

Responsive Web UI — Patients and staff can easily view documents and images that are scaled to fit their tablet, laptop, or mobile phone.
Configurable Controls — Organizations can easily enable or disable tabs, localization, rendering options, and encryption within their patient portal.
Microsoft Office (MSO) Conversion — Healthcare agencies can integrate true native viewing of Word, Excel, and PowerPoint documents.

Reliable PII Redaction

Data privacy is paramount for HIPAA compliance. As noted by Managed Healthcare Executive, this is especially critical in the world of COVID-19. With telehealth now the “new normal” — and likely to continue long after the pandemic subsides — organizations must ensure that protection of personally identifiable information (PII) remains intact.

While robust encryption and identity access management (IAM) tools form part of this function, redaction is another critical aspect. Consider the case of children. As noted by the Health Info Security piece, although parents typically have complete access to the medical records of children under 12, PII for those between the ages of 13 and 18 — such as mental health records — may be restricted. For healthcare agencies, this requires patient portal solutions that allow parents access to some data while also protecting specific PII. Here, robust redaction APIs that allow organizations to obfuscate key information are critical to meet regulatory requirements without compromising ease-of-access.

Regulated Image Compression

Images form a critical component of effective patient prognosis and treatment plans, and while DICOM files used in high-fidelity imaging are often a priority for medical agencies, there’s also a need for image compression solutions that enable the portability of more common image types such as JPEGs.

Consider the simple case of patient identification. By attaching high-quality photos to patient records, medical staff are better equipped to ensure the individual they’re assisting — virtually or in-person — is the patient linked to the account. High-quality JPEG photos are also useful to record and track the progress of specific physical ailments over time. Cloud-based image compression APIs streamline this process with the ability to compress individual or multiple files, set desired quality, remove metadata, and set JPEG mode output.

Rapid File Conversion

Complexity remains a challenge for healthcare records management. As patients visit general practitioners (GPs) and specialists, data volumes rapidly increase, in turn making it difficult for doctors to find specific information and create comprehensive treatment plans.

Multi-file combination and conversion to popular formats such as PDF helps solve this problem — not only can healthcare staff create files that are easily viewed by doctors and patients alike but administrators can also set key permissions around editing, annotating, and printing to ensure information remains secure. File format conversion with PrizmDoc Cloud APIs can help enhance patient portals with key features including:

Easy combination of multiple files into single PDFs
Data security with optional password protection
Specific section or entire file conversion
Searchable output formats

Relevant Watermarking

Last on our list of patient portal APIs is watermarking. By labeling key documents with unique healthcare watermarks, organizations can both improve front-line security and enhance HIPAA compliance. By training staff to only accept and process watermarked images and documents, companies can reduce the risk of potential compromise. If attackers attempt to spoof or modify key documents they can be easily detected because they won’t carry corporate watermarks. These marks also form a key component of auditing and data tracking if healthcare agencies are evaluated for HIPAA compliance by providing a visible chain of custody around document creation, storage, and access.

User-friendly patient portals are critical for healthcare companies to survive in the “new normal” — and embrace what comes next. But speedy access requires a robust security balance; document viewing, redaction, compression, conversion, and watermarking APIs from Accusoft can deliver privacy in practice and capitalize on patient portal potential. Try PrizmDoc Cloud API.

View Detail >

Scalable Vector Graphic (SVG) Format: Seeing the Big Picture

The scalable vector graphic (SVG) format continues to enjoy steady adoption across the web. According to data from W3Techs, SVG now accounts for 25 percent of website images worldwide. But it wasn’t always this way. In 1998, it became apparent that vector-based graphics had a future on the web, and the W3C received six different file format submissions from technology companies that year. Some were mere proposals ready for a complete revamp, while others were proprietary products that W3C wasn’t permitted to modify. Instead of forging a format from one of the submissions, however, W3C’s SVG working group decided to start from the ground up — and SVG was born.

While the file format had lofty ambitions, focusing on common use rather than specific syntax, the original iteration was cumbersome and complex. However, SVG has improved year after year after year. With increased support came more streamlined functionality and usable features. Now, SVG is often the first choice for meeting the evolving demands of scalable, responsive, and accessible web content.

What is a Scalable Vector Graphic (SVG) and how does it work?

Today, SVG is the de-facto standard for vector-based browser graphics. But what exactly is this file format, and how does it work?

Based on XML, SVG supports three broad types of objects:

Vector graphics including paths and outlines that are both straight and curved
Bitmap images such as .jpeg, .gif, and .png
Text

What sets SVG apart from bitmap-based images is the use of lines and curves along the edges of graphical objects. Because bitmap images use a fixed set of pixels, scaling them up creates blurriness where the edges of pixels meet. In the case of vector images, meanwhile, a fixed-shape approach allows the preservation of smooth lines and curves no matter the image size.

SVG also offers the benefit of interoperability. Because it’s a W3C open standard, SVG plays well with both other image format and web markup languages including JavaScript, DOM, CSS, and HTML. This allows the format to easily support responsive design approaches that scale websites and web content based on the user device rather than defining standardized size parameters. Thanks to the curves and lines of SVG, scaling presents no problem for responsive designers looking to ensure consistency across device types.

The Benefits of SVG

While scalability is often cited as the biggest benefit of SVG, this format also offers other advantages, including:

Responsiveness — Images can be easily scaled up or down and modified as necessary to meet web design and development demands.
Accessibility — Since SVG is text-based, content can be indexed and searched, allowing both users and developers to quickly find what they’re looking for.
Performance — Image rendering is quick and doesn’t require substantive resources, allowing sites to load quickly and completely.
Use in Web Applications — Browser incompatibilities and missing functions often frustrate web design efforts, forcing developers to use multiple tool sets and spend time checking content and images for potential format conflicts. SVG, meanwhile, offers powerful scripting and event support, in turn allowing developers to leverage it as a platform for both graphically rich applications and user interfaces. The result? Better-looking sites that enhance the overall user experience.
Interoperability — Because SVG is based on W3C standards, the format is entirely interoperable, meaning developers aren’t tied to any specific implementation, vendor, or authoring tool. From building their own framework from the ground up to leveraging third-party SVG applications, web developers can find their format best-fit.

SVG in PrizmDoc Viewer

Accusoft’s PrizmDoc Viewer offers multiple ways for developers to make the most of SVG elements at scale, such as:

File Transformation — Conversion is critical for effective and efficient web design. If development teams need different file transformation tools for every format, the timeline for web projects expands significantly. PrizmDoc Viewer streamlines this process with support for the conversion of more than 100 file types — including PDFs, Microsoft Office files, HTML, EML, rich text, and images — into browser-compliant SVG outputs. In practice, this permits near-native document and image rendering that’s not only fast, but also accessible anytime, anywhere, and from any device.
HTML5 Functionality — Using SVG in PrizmDoc Viewer is made easier thanks to native HTML5 design. The use of HTML5-native framework not only improves load times with smaller document sizes but means that PrizmDoc Viewer works in all modern web browsers — while also dramatically enhancing document display quality.
Pre-Conversion — One of the biggest challenges with viewing large documents in a browser is delay. Pages toward the end of the document may take longer to load and frustrate users looking to quickly find a specific image or piece of information. PrizmDoc Viewer solves this problem with a pre-conversion API that returns the first page as an SVG while the rest of the document is being converted, allowing users to interact with documents as conversion takes place and lowering the chance that files will experience format-based delays.

SVG hasn’t always been the go-to web image format. Despite a promising start based on open, interoperable standards, the lack of early support and specific use cases for vector-based file formats saw SVG sitting on the sidelines for decades.

The advent of on-demand access requirements and mobile-first development realities has changed the conversation. SVG is now continuously gaining ground as companies see the benefit in this scalable, streamlined, and superior-quality file format. Get the big picture and see SVG in action with our online document viewing demo, or start a free PrizmDoc Viewer trial today!

View Detail >

How Strategic Acquisitions Helped Accusoft Drive Innovation and Growth

For over 30 years, Accusoft has developed groundbreaking digital imaging technology that has revolutionized the way applications manage and process content. From high-resolution image compression and file format conversion to data extraction and barcode recognition, Accusoft technology can be found in some of the most sophisticated and widely used software in the world. The company’s enduring success is built upon a combination of patented engineering innovations and key strategic acquisitions.

Humble Beginnings and Early Success

The origins of Accusoft go back to the late 1980s, when founder and CEO Jack Berlin got a hands-on look at one of the first digital cameras at a trade show and developed an interest in the nascent technology.

“I just fell in love with it and wanted to get involved in digital imaging,” he recalls. That desire led Berlin to found Pegasus Imaging Corporation in September of 1991. “It started as a hobby, but it became serious pretty fast.”

The company quickly found its niche selling innovative image compression and decompression technology that software developers could incorporate into their applications. Over the next decade, Pegasus Imaging made a name for itself thanks to advancements in lossy and lossless JPEG compression that were adopted by leading photo imaging companies such as Kodak, Canon, Nikon, Fujifilm, and Konica Minolta.

In 1998, Pegasus Imaging expanded into medical imaging with the groundbreaking PICTools Medical SDK. The company quickly made a splash in the market by solving the infamous “DICOM bug” associated with the Cornell codec, an open-source codec frequently used for lossless JPEG compression. Pegasus Imaging’s engineers were able to identify why the codec rendered some images unreadable after decompression and provide a means of both compressing images safely and decompressing files once thought to be corrupted beyond repair. That ability to quickly solve imaging problems helped make PICTools Medical a major success with some of the world’s largest medical technology companies, including GE Healthcare, McKesson, Philips Medical Systems, Siemens Medical Solutions, and Toshiba.

Evolution and Expansion

Although Pegasus Imaging enjoyed a great deal of success in its first decade, Berlin knew the company needed to grow if it was going to retain a competitive edge in the 21st century.

“The challenge with the technology business is you have to improve it or lose it,” he says. “You don’t get to sell the same product ten years later. You have to continue to innovate.”

The early 2000s saw Pegasus Imaging embark on a series of acquisitions that helped expand its business and continue pushing the boundaries of innovating in digital imaging. Each merger involved a strategic consideration of how it would position the company to evolve to meet the complex challenges of the future.

“People typically want to buy growth, but we look at it differently,” Berlin says. “We’re a big believer that the reason to buy companies is their customers, their products, and their people. Products are worth nothing without their people.”

In 2004, Pegasus Imaging acquired TMSSequoia, which had developed the most sophisticated structured forms processing and document image cleanup technology in the world. The people behind that technology, however, proved even more valuable to the growing company, contributing to the development of several proprietary innovations over the years that followed. Their ongoing contributions can be seen in products like FormSuite, which remains an industry leader in forms processing technology.

The TMSSequoia acquisition, along with the development of its lightning fast ImagXpress toolkit, set Pegasus Imaging on the path to becoming a much bigger player in the document imaging market. Its next major acquisition came in 2008 with the purchase of AccuSoft, a Massachusetts-based imaging software company best known for its powerful ImageGear SDK. No sooner had the deal closed, then Pegasus Imaging expanded once again by acquiring the UK-based Tasman Software, which had developed groundbreaking barcode recognition technology that would eventually be incorporated into the popular Barcode Xpress SDK product.

Over the course of just a few years, Pegasus Imaging had greatly expanded its portfolio of document imaging and processing solutions by strategically identifying the technology and engineering talent that could fuel the company’s growth. To Berlin, making the most of those acquisition opportunities are critical to sustained success in a competitive industry:

“It’s grow or die. We’re up against larger and larger competitors. Consolidation gives you market share and economies of scale. If profitability goes up, you can invest in new product development, marketing, and growth.”

In 2009, Pegasus Imaging changed its name to Accusoft Pegasus to rebrand the company as a key player to watch in the document imaging and processing industry.

Integration and Innovation

The acquisition of AccuSoft expanded the company’s market share and expertise, but it also created a new challenge for the engineering team. AccuSoft’s ImageGear SDK frequently competed directly with Pegasus Imaging’s ImagXpress. While there was no question that the newly rebranded Accusoft Pegasus would continue to support customers using these products, strategic decisions had to be made about their respective futures.

As the team evaluated each product, it quickly became clear they were the result of very different approaches to software development.

“ImagXpress was much easier to use, but it didn’t have all the platforms and features of ImageGear,” Berlin recalls. “Their idea was to throw everything but the kitchen sink into the product, and if somebody complained, fix it. Pegasus wanted to make everything perfect and add features very slowly. I think there was a happy medium there somewhere.”

ImageGear’s ability to support multiple platforms and the rapid development of modern compilers eventually made it a more attractive SDK for developers, but the Accusoft Pegasus team incorporated the best features of ImagXpress into it to make ImageGear an even better product. Throughout the integration, the company put new processes into place to strike a balance between speed and perfection. Product management structures were also established to improve feedback loops, lay down feature roadmaps, and keep to development timelines without compromising quality.

All of these lessons would prove invaluable when Accusoft Pegasus acquired Adeptol in 2011 and added the company’s AJAX Document Viewer to its portfolio. While Adeptol’s browser-based viewing technology was undoubtedly innovative and correctly predicted what future web applications would require from viewing integrations, the product itself was Flash-based. In addition to being a proprietary technology, Flash contained multiple security vulnerabilities and was already on the decline by 2011.

The Accusoft Pegasus team got to work rebuilding the viewer using HTML5, making it much easier for developers to incorporate viewing features into web-based applications. As the new product took shape, it also allowed the company to expand into the growing market for cloud-based API integration products. Released as Prizm Content Connect in 2012, the HTML5 viewer would be rebranded in 2016 as PrizmDoc.

30 Years of Growth and Innovation

In 2012, Accusoft Pegasus rebranded once again to become Accusoft. Today, the company remains a pillar of the tech community in Tampa, FL even as it strives to expand its business globally. In 2021, Accusoft celebrated its 30th anniversary, a significant milestone for a privately held, employee-owned corporation that began as a hobby.

For Berlin, those humble origins and the long journey to success are what make a company like Accusoft special.

“It’s not just dollars and cents, but a sense of pride and competitiveness,” he says. “It stops being about money at some decimal point. It’s about the people and the legacy and seeing what you can do. We give back a lot to our community. I really enjoy that. We participate in our community, both in tech forums and charity drives and work days. If we cease being Accusoft or cease being Accusoft in Tampa, FL, that’s gone.”

Accusoft and Snowbound Join Forces

That same sense of pride and passion was shared by Simon Wieczner, President, CEO, and co-founder of Snowbound Software. Established in 1996, Snowbound first made a name for itself in the document imaging market with the RasterMaster conversion and imaging SDK. RasterMaster supports hundreds of formats and uses proprietary technology to quickly convert, archive, and display files in high resolution without loss of fidelity.

As two of the leading innovators in document imaging integrations, Snowbound and Accusoft routinely found themselves competing for the same customers over the years. “We were competitors, but not fierce competitors,” Wieczner recalls. “We would mostly run into each other at trade shows and talk about the market.”

Berlin first floated the idea of merging the two companies around 2015, but it wasn’t until much later that acquisition conversations turned serious. With so much money being invested in the tech industry over the previous decade, Snowbound had already received substantial interest from potential buyers, but most of the offers didn’t sit well with Wieczner.

“There’s been a craze over the last few years of growth equity companies looking for SDK companies,” he says. “They were offering good dollars, but without understanding our technology. So there was a little bit of distrust on my part. Some actually wanted to outsource everything and that would totally destroy the company.”

Many potential suitors were primarily interested in incorporating Snowbound’s technology into their own products rather than supporting existing customers and continuing to sell into the market. As conversations with Accusoft continued to progress, Wieczner realized that he’d already found the ideal acquisition partner.

“Accusoft understood the market, what our company did, and how we could fit together,” he says. “That’s why we felt ready to move on together.”

“Both Simon and I are passionate about the success of the company,” Berlin says. “It’s what small business people do. We tend to worship entrepreneurs that get in, build a shell, and get out with a billion dollars, but I don’t know if they’re fulfilled because there’s such fulfillment in seeing what you’ve built accomplish something.”

In late 2022, the companies moved ahead with a deal that saw Accusoft acquire Snowbound. The merger brings Snowbound’s RasterMaster^® and VirtualViewer^® products into the portfolio of Accusoft solutions:

RasterMaster^®: A Java-based conversion and imaging SDK, RasterMaster can be incorporated into applications for all computing platforms, including Unix, Linux, Windows, and Mac. The SDK supports hundreds of document types through its proprietary format library and provides a variety of document imaging tools, such as annotations, redactions, OCR, text extraction, and image cleanup.
PrizmDoc^® for Java, formerly VirtualViewer^®: Developed to meet the demanding needs of the banking industry, PrizmDoc® for Java is an HTML5-based viewer that can be easily incorporated into web applications to allow users to view, annotate, and redact documents and images from any platform, anywhere. PrizmDoc® for Java’s library of APIs and out-of-the-box connectors for popular ECM applications (including Alfresco, IBM, and Pegasystems) make it a powerful tool for developers looking to quickly integrate advanced document workflow capabilities.

The Future

In keeping with Accusoft’s long history of strategic acquisitions, the Snowbound merger brings with it an influx of new customers, new technology, and new talent into the Accusoft family. Those resources will help the company to continue investing in innovation to compete in an increasingly high-stakes market.

“It’s a real challenge to incorporate other companies and take that risk, but they’ve done well with it,” Wieczner says. “We were deliberate in selecting an organization with a leadership team and product portfolio that would continue to grow, develop, and nurture what we have built at Snowbound.”

With three decades of experience and success to draw upon, Accusoft is better positioned than ever to meet the evolving needs of its customers and deliver a new generation of document imaging products powered by groundbreaking technology.

View Detail >

How ImageGear Medical Is Enhancing EHR Applications & Systems

For today’s healthcare organizations, having a versatile electronic health records (EHR) system is essential for running an efficient practice and connecting to other medical providers. Thanks to EHRs, practices can ensure that they’re getting a complete picture of a patient’s health and treatment history, which allows them to deliver much better care outcomes. As developers continue to refine the usability of these systems, they need to consider how they can improve core features like healthcare electronic document management and medical imaging support.

Managing Medical Documents

A typical EHR system has to be able to handle quite a lot of document types. Anyone who has visited a healthcare provider is quite familiar with the myriad forms used to gather patient information. Many of those forms end up being converted into digital formats that need to be managed within the EHR system. Then there are digital versions of lab reports, physician notes, invoices, and financial documents.

While EHR systems may utilize databases to store much of the information they need, healthcare providers still need to be able to produce physical documents and view digital files in many situations. This could include communicating information to patients, complying with regulatory requests, or filing a financial claim of some kind. More importantly, they also rely on digital documents to enter data into the EHR system. The push toward interoperability between EHR systems has improved information sharing, but there are still many instances where medical records are delivered in the form of a document that needs to be managed securely.

Document Conversion

If an EHR application lacks the right file conversion capabilities, viewing and extracting data from those documents could prove difficult. The last thing a practice wants to do is actually remove them from the secure EHR system to open and convert the files using separate software that may not be compliant when it comes to handling healthcare information. Even if the external application is secure, transferring files over, converting them, and then transferring them back is both inefficient and creates unnecessary risk (especially if someone forgets to delete the original file or move it back into the EHR environment).

ImageGear Medical has a document conversion feature that supports a wide range of file types, allowing developers to build EHR applications capable of quickly converting incoming documents. They can even set up their solution to perform conversion tasks programmatically to help streamline workflows and minimize human error. This helps practices to get a better handle on document management, ensuring that they will be able to do everything they need with files completely within the EHR application.

Managing DICOM Files

One of the biggest challenges healthcare organizations face is with managing medical imaging files. When providers need to send X-Rays, MRIs, or CT Scans, they use a standardized file format known as Digital Imaging and Communications in Medicine (DICOM) files. These files are more than just image files, however. They contain extensive datasets that provide a patient’s information along with image pixel data for multi-dimensional medical scans. A DICOM file can be quite large due to the high-resolution image data used by most medical imaging equipment.

Although most EHR systems are capable of transmitting DICOM files (via a DICOM out or DICOM send feature), they usually can’t actually view them in their native format. Since Windows doesn’t recognize them as image files, additional viewing software is typically needed to open and view them. This is why physical storage, like discs and flash drives, are often used to transfer DICOM files along with the necessary viewing software.

ImageGear Medical helps to solve the DICOM dilemma thanks to its extensive conversion and compression capabilities. By decoding the complex data contained within the file, ImageGear Medical can convert DICOM files into image formats that are much easier to view and manage. This is especially useful for smaller practices that don’t have a picture archiving and communication system (PACS) capable of storing, retrieving, distributing, and viewing high-quality medical images.

Converting DICOM files makes it possible for healthcare professionals to view them on any device connected with their EHR system. That could include tablets or other IoT devices that healthcare technology companies are rolling out to put critical medical data on the front lines of everyday care. Developers can also use ImageGear Medical’s conversion tools to allow their EHR system to share viewable versions of diagnostic scans with patients, allowing practices to make good on the promise of providing patients access to their essential health data at all times.

The sheer size of DICOM files makes them difficult for many practices to manage. Simply compressing them tends to degrade the image data, which can create significant problems when files are unpacked and opened for viewing. Losing even a small degree of image quality can make it much harder to render an accurate diagnosis. In some cases, poorly designed compression can even make it nearly impossible to uncompress again at all. Thanks to powerful lossless compression technology, ImageGear Medical makes it easier to share medical images between providers without damaging the integrity of the original data.

Expand EHR Capabilities with ImageGear Medical

Accusoft’s imaging, conversion, and compression technology has been supporting the needs of the healthcare industry for decades. As developers work to expand the capabilities of their EHR applications, our engineers are busy improving the medical SDKs that will provide them with the features they need to stand out in a competitive market.

ImageGear Medical utilizes a combination of efficient code and elegant APIs to deliver the document and image processing tools EHR systems require. For a closer look at this dynamic SDKs capabilities, check out our extensive developer resources today or download a free trial to get started.

View Detail >

Accusoft Corporation Launches the Women In Tech Scholarship

TAMPA, Florida, February 9, 2022 — Accusoft announces the launch of the Women In Tech Scholarship for 2022. The new scholarship program is aimed at female students passionate about pursuing a career in the Science, Technology, Engineering, or Mathematics fields.

The scholarship is open to a female-identifying undergraduate student pursuing a two or four-year degree and studying STEM. Accusoft will award $3,000 in scholarship to the winner. Apply at https://bold.org/scholarships/women-in-tech-scholarship/

The tech field has been exploding recently and offers lucrative job opportunities. Unfortunately, women are still quite underrepresented in all STEM fields, including tech. Despite making up roughly half of the workforce, less than 27% of tech positions are held by women and only 22% of Fortune 500 tech companies have female leaders.

This scholarship aims to support women pursuing STEM so they can find successful careers and create a more diverse workforce.

“Accusoft is a software company with a mission to make the world in which we live and work a better place,” said Jack Berlin, CEO of Accusoft. “We do this by inventing new ways to help organizations become more efficient and innovative, promoting the continuous improvement of our community, and working to minimize our impact on natural resources. We hope that our new scholarship will not only encourage women to go into the tech field but also to give them the resources to complete their degree.”

To apply, students must write an essay detailing why they are interested in STEM and what their career goals are. Apply at https://bold.org/scholarships/women-in-tech-scholarship/

Applications are due May 29th, 2022. The winner will be announced on June 30, 2022.

About Accusoft

Founded in 1991, Accusoft is a software development company specializing in content processing, conversion, and automation solutions. From out-of-the-box and configurable applications to APIs built for developers, Accusoft software enables users to solve their most complex workflow challenges and gain insights from content in any format, on any device. Backed by 40 patents, the company’s flagship products, including OnTask, PrizmDoc™ Viewer, and ImageGear, are designed to improve productivity, provide actionable data, and deliver results that matter. The Accusoft team is dedicated to continuous innovation through customer-centric product development, new version release, and a passion for understanding industry trends that drive consumer demand. Visit us at www.accusoft.com.

View Detail >

How Medical Imaging Annotation Is Transforming Healthcare

Having the right technology in place is essential for healthcare organizations seeking to deliver better patient outcomes. That’s why medical technology developers are working hard to build the next generation of software tools that will help medical professionals to deliver care more effectively.

Annotation features provide a number of benefits in these ongoing efforts. Although typically associated with editing and workplace collaboration, medical annotations also have a very different and very specific role when it comes to diagnostic imaging and patient health records.

Enhancing Healthcare Collaboration with Annotations

One of the most straightforward use cases for medical annotation is communicating important information regarding diagnostic images. As images like MRIs and X-rays are passed back and forth between providers, radiologists, technicians, and clinicians, the ability to add comments and point out important details greatly reduces the chance of confusion or of some critical detail being overlooked.

The challenge in these cases, however, is to annotate images and documents without altering the integrity of the original files. This requires healthcare technology developers to build solutions that can retain an unaltered version of the file even as multiple collaborators view and make comments.

Medical Annotation and Machine Learning

Healthcare solutions are rapidly incorporating sophisticated machine learning tools to analyze large quantities of data and make a quick, accurate diagnosis of conditions. Before these powerful tools can perform that diagnostic work, they need to be properly trained to know what they’re looking for, especially when it comes to very nuanced differences between scanned images and seemingly unrelated details in patient records.

By using annotation tools, medical technology specialists can provide excellent guidance for machine learning development. An MRI scan, for instance, contains so much information that an AI-driven program isn’t going to know what to look for unless the key elements are called out with annotations that indicate certain parts of the image or provide comments about noteworthy aspects.

The DICOM Dilemma

While many software integrations allow developers to incorporate annotation tools for common file formats like PDF and JPEG, the healthcare sector presents a unique challenge in the form of DICOM files. This industry-specific format contains both images and important metadata identifiers that provide information about the image itself and the patient in question. While there are ways to extract images from DICOM files and convert them into a more manageable format, doing so could endanger compliance status or permanently degrade the image quality.

Developers working on healthcare technology solutions need to make sure they can not only deliver annotation tools, but also the ability to add annotations to DICOM files without altering the source file itself.

Mastering Medical Annotation with ImageGear Medical

ImageGear Medical provides a broad range of XML-based annotation features that allows healthcare software developers to implement UI elements for marking up both images and documents. Since this powerful imaging SDK also gives users the ability to create and view DICOM files, it can quickly enhance the functionality of medical applications to enhance collaboration and ensure diagnostic accuracy.

Once integrated into an application with a viewing UI, ImageGear Medical supports several commonly-utilized annotation marks that makes it easy for users to highlight certain aspects of an image, comment on them, and even cover up some elements using filled-in graphical objects. Annotations can also be grouped in layers to make them easier to manage and distinguish from one another.

ImageGear Medical annotation objects for DICOM include:

Text: Adds descriptive text using a variety of fonts, colors, and sizes. Opacity can be adjusted and the text object can appear with or without a border.
Point: Places a coordinate point on the image or document, which can be used to support other annotation marks.
Polyline: A series of connected straight lines formed by dragging and clicking a mouse or pointer.
Curve: Used for creating spline curve marks. Users can select multiple vertices and tensions when creating curves.
Ellipse: A circular outline mark that can be used to indicate important elements of an image or document. When filled, it can also cover up areas of the image.
Polygon: Like the ellipse, it can be filled or unfilled and is typically deployed to cover or highlight some aspect of an image or document. Polygons are especially useful for medical annotation because they can capture more lines and angles than simple rectangles or circles.

In order to maintain the integrity of the original image, ImageGear Medical stores annotations as a separate file that is overlaid upon the image during display. While annotations can be merged, or “burned in” the file, keeping them separate ensures that the original image itself is not altered directly. This is incredibly important when it comes to DICOM files, which often need to be kept on file for baseline comparisons on a future diagnosis.

Enhance Healthcare Flexibility with ImageGear Medical

Annotations and DICOM viewing support are just the beginning of ImageGear Medical’s expansive feature set. It also provides advanced filtering tools for sharpening and smoothing as well as image cleanup functions like despeckling, noise removal, and deskewing. With support for several dozen medical image and document formats, ImageGear Medical can easily convert files into easy-to-manage formats and compress files for efficient storage.

Available for .NET and C/C++ environments, ImageGear Medical can turn your healthcare application into a powerful annotation platform with full support for DICOM files. Start your free trial of this powerful SDK to discover first-hand how it can empower your medical annotation solution.

View Detail >

When loading HTML files in PrizmDoc, will external images be rendered?

Question

If I upload an HTML document to PrizmDoc that has image tags in it, will those images be rendered in the Viewer or will only the text be displayed?

What if the image tags are referencing local images?

Answer

When loading an HTML file, PrizmDoc will render image tags that are sourced from publicly accessible URLs. The relative links may not render though, as it’s likely that the source will not be found.

In PrizmDoc 13.2, a server-side configuration option called:

security.htmlRendering.blockExternalContent

was introduced to control whether or not externally-referenced HTML content will be blocked.

See the release notes for more information.

View Detail >

When loading HTML files in PrizmDoc, will external images be rendered?

Question

If I upload an HTML document to PrizmDoc that has image tags in it, will those images be rendered in the Viewer or will only the text be displayed?

What if the image tags are referencing local images?

Answer

In PrizmDoc 13.2, a server-side configuration option called:

security.htmlRendering.blockExternalContent

was introduced to control whether or not externally-referenced HTML content will be blocked.

See the release notes for more information.

View Detail >

OCR vs ICR: What’s the Difference?

The days of manually transcribing scanned documents into an editable, digital document are thankfully long behind most organizations. Error-prone manual processes have largely given way to automated document and forms processing technology that can turn scanned documents into a more manageable form with a much higher degree of accuracy.

Much of transition was made possible by the proliferation of optical character recognition (OCR) and intelligent character recognition (ICR). While they perform very similar tasks, there are some key differences between them that developers need to keep in mind as they build their document and form processing applications.

How Does Character Recognition Technology Work?

Character recognition technology allows computer software to read and recognize text contained in an image and then convert it into a document that can be searched or edited. Since the process involves something that humans can do quite easily (namely, reading text), it’s easy to assume that this would be a rather trivial task for a computer to accomplish.

In reality, getting a computer program to correctly identify text and convert it into editable format is an incredibly complex challenge complicated by a wide range of variables. The problem is that when a computer examines an image, it doesn’t see people, backgrounds, or text as distinct images, but rather as a pattern of pixels. Character recognition technology helps computers distinguish text by telling them what patterns to look for.

Unfortunately, even this isn’t as straightforward as it sounds. That’s because there are so many different text fonts that depict the same characters in different ways. For example, a computer must be able to recognize that each of the following characters is an “a”:

When humans read text, they have a mental concept of what the letter “a” looks like, but that concept is incredibly flexible and can easily accommodate a broad range of variations. Computers, however, require precision. Programmers must provide them with clear parameters that help them to navigate unexpected variations and identify characters accurately.

Pattern Recognition

The earliest versions of character recognition developed in the 1960s relied on pattern recognition techniques, which scanned images and searched for pixel patterns that matched a backlog of font characters stored in memory. Once those patterns were located, the software could translate the characters into searchable, editable text in a document format. Unfortunately, the patterns had to be an exact pixel match, which severely limited how broadly the technology could be applied.

One of the first specialized fonts developed to facilitate pattern recognition was OCR-A. A simple monospace font (meaning that each character has the same width), OCR-A was used on bank checks to help banks quickly scan them electronically. Although pattern recognition libraries expanded over the years to incorporate common print fonts like Times New Roman and Arial, this still presented serious limitations, especially as the variety of fonts continued to grow. With one popular font finding website indexing more than 775,000 available fonts in 2021, pattern recognition needed to be supplemented by another approach to character recognition.

Feature Detection

Also known as feature extraction, feature detection focuses on the component elements of printed characters rather than looking at the character as a whole. Where pattern recognition tries to match characters to known libraries, this approach looks for very specific features that distinguish one character from another. A character that features two angular lines that come to a point and are crossed by a horizontal line in the middle, for instance, is almost always an “A,” regardless of the font used. Feature detection focuses on these qualities, which allows it to identify a character even the program has never encountered a particular font before. As the printed examples above demonstrate, however, this approach needs to take several ways of rendering the character “A” into consideration when setting parameters.

Most character recognition software tools utilize feature detection because it offers far more flexibility than pattern recognition. This is especially valuable for reading document images with faded ink or some degradation that could prevent an exact pattern match. Feature detection provides enough flexibility for a program to be able to identify characters under less than ideal circumstances, which is important for any application that has to deal with scanned images.

OCR vs ICR: What’s the Difference?

Optical character recognition (OCR) is typically understood to apply to any recognition technology that reads machine printed text. A classic OCR use case would involve reading the image of a printed document, such as a book page, newspaper clipping, or a legal contract, and then translating the characters into a separate file that could be searched and edited with a document viewer or word processor. It’s also incredibly useful for automating forms processing. By zonally applying the OCR engine to form fields, information can be quickly extracted and entered elsewhere, such as a spreadsheet or database.

When it comes to form fields, however, information is frequently entered by hand rather than typed. Reading hand-printed text adds another layer of complexity to character recognition. The range of more than 700,000 printed font types is insignificant compared to the near infinite variations in hand-printed characters. Not only must the recognition software account for stylistic variations, but also the type of writing implement used, the quality of the paper, mistakes, steadiness of hand, and smudges or running ink.

Intelligent character recognition (ICR) utilizes constantly updating algorithms to gather more data about variations in hand-printed characters to identify them more accurately. Developed in the early 1990s to help automate forms processing, ICR makes it possible to translate manually entered information into text that can be easily read, searched, and edited. It is most effective when used to read characters that are clearly separated into individual areas or zones, such as fixed fields used on many structured forms.

Both OCR and ICR can be set up to read multiple languages, although limiting the range of expected characters to fewer languages will result in more optimal recognition results. Critically, ICR does not read cursive handwriting because it must still be able to evaluate each individual character. With cursive handwriting, it’s not always clear where one character ends and another begins, and the individual variations from one sample to another are even greater than with hand-printed text. Intelligent word recognition (IWR) is a newer technology that focuses on reading an entire word in context rather than identifying individual characters.

To learn more about how OCR vs ICR technology and how they can transform your application when it comes to managing documents and automated forms processing, download our whitepaper on the topic today.

View Detail >

Featured Content

Video: Exploring the Future of AI for Improved Document Management for your ECM

Featured Content

eGuide: Digital Transformation

SDK Technologies

Accusoft SDKs

Featured Content

Improve Form Processing Text Recognition Results with Regular Expressions

Experience Docubee

Featured Content

Meet Docubee

Technical FAQs

What Is a Watermark?

Why Should You Watermark Documents?

Classify Documents More Easily

Track Your Documents

Discourage Unauthorized Distribution

How to Watermark Documents

Text Watermarks

Diagonal Text Watermarks

Image Watermarks

Start Watermarking Documents Today with PrizmDoc Viewer

The State of Healthcare Security

Robust Document Viewing with PrizmDoc Cloud API

Reliable PII Redaction

Regulated Image Compression

Rapid File Conversion

Relevant Watermarking

What is a Scalable Vector Graphic (SVG) and how does it work?

The Benefits of SVG

SVG in PrizmDoc Viewer

Humble Beginnings and Early Success

Evolution and Expansion

Integration and Innovation

30 Years of Growth and Innovation

Accusoft and Snowbound Join Forces

The Future

Managing Medical Documents

Document Conversion

Other Essential Document Features

Managing DICOM Files

Expand EHR Capabilities with ImageGear Medical

Enhancing Healthcare Collaboration with Annotations

Medical Annotation and Machine Learning

The DICOM Dilemma

Mastering Medical Annotation with ImageGear Medical

Enhance Healthcare Flexibility with ImageGear Medical

How Does Character Recognition Technology Work?

Pattern Recognition

Feature Detection

OCR vs ICR: What’s the Difference?