Join us for an engaging webinar, as we unravel the potential of AI for revolutionizing document management.
Watch Now
Enable your employees to remain productive throughout the document management process.
Read More
Learn how SmartZone uses a regular expression engine integrated into the recognition engine to achieve the best possible accuracy on data that can be defined by a regular expression.
Docubee is an intelligent contract automation platform built to help your team success
What quality should my images be for processing form data and recognition using FormSuite?
In all cases, you want to have your images as clear and as clean as possible. For any particular procedure, please consider the following:
OCR and ICR: Capture images in at least 300 DPI resolution. Ideally, working in black and white allows the objects of interest on your image to be better defined and recognized. Free the image form all noise as much as possible. As if a human were reading it, you want the text objects on the image to be as legible as possible. For ICR, ensure that the characters are printed (no cursive text, etc).
Barcode recognition: As with OCR and ICR, capture images in at least 300 DPI and working with black and white content can provide excellent results. Ensure that the bars in the barcodes are clearly defined on the image and are not malformed (for example, the barcodes should have the proper start and stop sequence, etc). Clear as much noise from the image as possible.
Forms matching and registration: As with the prior 2 items above, capture your documents in at least 300 DPI. Ensure that your resolution is consistent between your form templates and incoming batch images. Form templates should only contain data that is common to every image that is being processed (i.e. Form fields, the text that appears on the blank form itself, etc). The template should not have filled-in field information as this will affect the forms matching process.
How can I improve the performance and memory usage of scanning/recognition in Barcode Xpress?
Barcode Xpress supports a number of optimization settings that can improve your recognition performance, sometimes up to 40%, along with memory usage. The best way to optimize Barcode Xpress is to fine-tune the properties of the Reader class to be specific to your application’s requirements.
Reader
BarcodeTypes
MaximumBarcodes
Area & Orientation
ScanDistance
Finally, BarcodeXpress Professional edition does not impose a 40 page-per-minute limit on processing.
How do I use a Network Drive path for Image and ART storage in my ImageGear .NET web application?
In an ImageGear .NET web application, you have to define the location of the images and annotations directory in the storageRootPath and artStorageRootPath configuration property. In the current version of ImageGear .NET, the storageRootPath and artStorageRootPath do not work with a network drive path \\SERVER-NAME\sharefilename.
storageRootPath
artStorageRootPath
\\SERVER-NAME\sharefilename
The workaround for this would be to create a Symbolic link from a local directory to the network drive directory.
> mklink /d "local path" \\SERVER-NAME\sharefilename
web.config: storageRootPath="local path" artStorageRootPath="local path"
In PrizmDoc, why do I fail to load/convert Excel documents with the error “Exception from HRESULT: 0x800AC472”?
The error message Exception from HRESULT: 0x800AC472 is usually associated with a failure involving an Excel document, found in the MsOfficeConverter.log. Below are some known triggers of it:
Exception from HRESULT: 0x800AC472
MsOfficeConverter.log
If the user is logged in as “SYSTEM”, “LocalSystem”, or any other non-user-account variant, this will cause PrizmDoc to fail when using MSO services. This is expected behavior when working with Microsoft Office documents in PrizmDoc. Please see step 6 of the Windows Installation documentation regarding this:
http://help.accusoft.com/PrizmDoc/latest/HTML/webframe.html#windows-installation.html
“Specify the login account (account name and password) that PrizmDoc Server will run under. If you are using the Microsoft Office (MSO) Conversion add-on, please make sure that the “login account” is a real user account with Administrator rights. Running PrizmDoc under the LocalSystem user or another Microsoft Windows integrated service account is not supported for this option.”
It’s also crucial that the copy of Microsoft Office on the system has been activated. A not-licensed, not-activated, expired, or trial license will all cause Microsoft Office to not work with PrizmDoc.
More information: https://help.accusoft.com/PrizmDoc/latest/HTML/windows-requirements.html
“The installed copy of Microsoft Office must be activated in order for PrizmDoc’s Microsoft Office Conversion Service to work properly. Not licensed, not activated, an expired or trial version of Microsoft Office will not work with PrizmDoc.”
Your default printer must be the Microsoft XPS Document Writer when working with Excel documents in PrizmDoc. Specifying another printer could possibly lead to this exception.
More information: http://help.accusoft.com/PrizmDoc/latest/HTML/natively-render-mso-documents.html
“The Microsoft Office Conversion Service requires the Microsoft XPS Document Writer printer driver to be installed for the best conversion performance and rendering fidelity of MS Excel documents”
Ensure the Print Spooler service is started and the Microsoft XPS Document Writer is the default printer.
There is a known issue with version 13.3 of PrizmDoc where completely blank Excel files are not loadable in the Viewer. They will fail to load and throw the aforementioned HRESULT exception. This has been fixed in PrizmDoc version 13.6.
HRESULT
In short, please set up the PrizmDoc service correctly to run with a real user account, ensure the copy of Microsoft Office has been activated, and make sure the default printer is set to “Microsoft XPS Document Writer”, then restart the service. This should fix this particular issue in most cases.
For more reading on considerations that Microsoft recommends when running their client-side MSO applications on the server, see this article:
Considerations for server-side Automation of Office
If I have a PDF document that only has an embedded image in it (no text objects, etc.), can PrizmDoc Viewer take it and create a searchable PDF file from it?
Yes. PrizmDoc’s Content Conversion Services can take an image-only PDF and create a searchable PDF file from it. This can be done by modifying the input.dest.pdfOptions.ocr options object; see our documentation here.
input.dest.pdfOptions.ocr
If you are attempting to make a searchable PDF from an existing PDF document, please note that the source PDF file should be an image-only PDF. PrizmDoc will not create a searchable file from already-existing vector content.
This feature was introduced in PrizmDoc 13.1, please see our Release Notes for more information.
Are there any specific applications or folders which should be excluded in order for PrizmDoc to work on a system with antivirus software?
As a suggestion, the first place to start would be your antivirus logs to ensure there are no entries specific to the following services:
conhost.exe
node.exe
policyhost.exe
PrizmApplicationServices.exe
wmiprvse.exe
If there are, you can exclude those files or the following folders where PrizmDoc is installed by default. If you modified the folder name for PrizmDoc, ensure to include that name.
C:\Prizm
C:\ProgramData\Accusoft
Please consult your antivirus documentation for instructions on how to add exclusions.
With a fully-running PrizmDoc environment (version prior to v10.5), I’m noticing times where the system becomes unresponsive. That is, at some point, it no longer processes requests being made to the backend and I have to restart PrizmDoc in order to alleviate it. What could be causing this?
Prior to version 10.5, PrizmDoc used ProxyServer as its image processing system. Our Development Team eventually began seeing that the ProxyServer exhibited architectural problems that lead to a multitude of issues, such as unresponsiveness.
Starting in version 10.5, the Development Team created the Prizm Content Connect Imaging Service (PCCIS), which is far more stable.
To resolve this issue we suggest upgrading to the latest revision of PrizmDoc. Please see the product page for more information found here: https://www.accusoft.com/products/prizmdoc/overview/
My document has Asian characters (CJK, etc.), which are not displaying correctly in PrizmDoc Viewer; what steps can I take to view them?
In some cases, the reason is due to the fonts not being installed on the operating system. We have outlined some commands to install fonts for select operating systems below:
In CentOS 6 do:
yum groupinstall "Chinese Support" yum groupinstall "Japanese Support" yum groupinstall "Korean Support" yum groupinstall "Kannada Support" yum groupinstall "Hindi Support"
In CentOS 7 do:
yum groupinstall "fonts"
In Ubuntu do:
sudo apt-get install language-pack-ja sudo apt-get install japan* sudo apt-get install language-pack-zh* sudo apt-get install chinese* sudo apt-get install language-pack-ko sudo apt-get install korean* sudo apt-get install fonts-arphic-ukai fonts-arphic-uming fonts-ipafont-mincho fonts-ipafont-gothic fonts-unfonts-core
I have a PDF of a form that I’m sending to PrizmDoc to have it auto-detect, but PrizmDoc does not find any fields in the document. What would cause this?
Currently only PDF files with embedded AcroForms will be auto-detected. If the PDF document has an embedded image of a form, PrizmDoc will not find any results from auto-detection.
How do I store and retrieve documents in subdirectories of the configured documents directory in PrizmDoc PAS?
You can retrieve documents from subdirectories when dealing with local files. Simply pass the subfolder in the fileName parameter when creating the viewing session. You can test this by manually placing a document inside a subfolder and making the following POST request:
fileName
POST
`http://localhost:3000/ViewingSession` { "source": { "type": "document" "fileName": "folder/document.pdf" } }
How do I ensure temp files are deleted when closing ImageGear .NET?
All PDF objects are based on underlying low-level PDF objects that are not controlled by .NET resource manager and garbage collector. Because of this, each PDF object that is created from scratch should be explicitly disposed of using that object’s Dispose() method.
Also, any ImGearPDEContent object obtained from ImGearPDFPage should be released using the ImGearPDFPage.ReleaseContent() in all cases.
This should cause all temp files to be cleared when the application is closed.
What does it mean when I see “Email Address is not Registered” when entering in an email in the Evaluation Dialog?
You will see this error if you have not registered on the Accusoft website.
To register your email address, please visit the following link below:
https://my.accusoft.com/Account/FirstTimeUser?Length=7