11-10-24 06:59 PM
Good Afternoon Everyone,
https://docs.blueprism.com/bundle/decipher-idp-2-3/page/user-guide/configuration.htm#Timeout
States that:
OCR per page timeout is "The timeout in seconds to OCR a page."
Data capture per page timeout is "The timeout in seconds for Data Capture processing a single page."
What affect do these two timeout values actually have on the results provided by rules-based training?
We have a situation where a rules-based.td file from Decipher IDP 2.2 yields similar but not as accurate results when used with the September release of Decipher IDP 2.3.2
I understand that rules-based training captures data from the following elements, referred to as 'hints', defined in the DFD:
Keywords
Data types
Lists
Regex
Formula
Location (after user feedback)
But what actually happens DURING the process of capturing data when Decipher exceeds the 'Data capture per page timeout' period?
Does Decipher simply display the best results found prior to the timeout rather than the best results possible if provided additional time with a larger timeout period? Similarly, would increasing RAM improve the results provided if/when Decipher ever exceeds the Data capture per page timeout period? Is there a way for an Administrator to tell if/when a submitted batch exceeds a 'Data capture per page timeout' or an 'OCR per page timeout'?
Much Appreciated,
JD
Answered! Go to Answer.
14-10-24 02:54 PM
Hi JD,
The timeouts are there to protect your machine from getting stuck on a page forever, due to either a system error or a corrupted page. Theoretically these limits should never be hit, assuming your machine has adequate resource (CPU and RAM).
I don't believe any results would be provided from the respective client if the timeout is hit.
Every update to Decipher improves the Capture engine to improve data extraction, so I would expect to see an improvement in performance between those 2 versions. If this is not your experience, please raise a support ticket with as much detail as you can provide so that we can investigate.
Thanks
Tuesday
14-10-24 02:54 PM
Hi JD,
The timeouts are there to protect your machine from getting stuck on a page forever, due to either a system error or a corrupted page. Theoretically these limits should never be hit, assuming your machine has adequate resource (CPU and RAM).
I don't believe any results would be provided from the respective client if the timeout is hit.
Every update to Decipher improves the Capture engine to improve data extraction, so I would expect to see an improvement in performance between those 2 versions. If this is not your experience, please raise a support ticket with as much detail as you can provide so that we can investigate.
Thanks
Tuesday