<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Initial batch load fails, Restarting at OCR role captures in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113224#M50691</link>
    <description>&lt;P&gt;&lt;a href="https://community.blueprism.com/t5/user/viewprofilepage/user-id/15488"&gt;@Ben.Lyons1&lt;/a&gt;&amp;nbsp;I am using version 2.3. Here is the screenshot from the about section for the full version.&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Decipher version.png" style="width: 597px;"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/39174iCC1BCB6D4D97A7A5/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999" role="button" title="Decipher version.png" alt="Decipher version.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 20 Aug 2024 12:13:02 GMT</pubDate>
    <dc:creator>douglas.h.burke</dc:creator>
    <dc:date>2024-08-20T12:13:02Z</dc:date>
    <item>
      <title>Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113194#M50682</link>
      <description>&lt;P&gt;I have been troubleshooting getting certain fields to read. Determined today that on the initial batch load the fields fail. If I return the batch and then in batch admin restart the batch at the OCR step and get back to data verification(no issue with class verification) the fields are now reading as expected.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I found the logs on the machine running the client services but nothing is standing out as an issue.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2024 15:59:33 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113194#M50682</guid>
      <dc:creator>douglas.h.burke</dc:creator>
      <dc:date>2024-08-19T15:59:33Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113209#M50685</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.blueprism.com/t5/user/viewprofilepage/user-id/15473"&gt;@douglas.h.burke&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;It sounds like you have a pdf which includes both vector data and non-vector data. What that means is that some of the text is embedded as metadata and can be read without using OCR, often this can be selected when opening the document in Adobe Reader.&lt;/P&gt;&lt;P&gt;But then some of the data is 'flattened' and can't be selected.&lt;/P&gt;&lt;P&gt;Decipher can extract the vector data without using OCR and in some situations this will skip the OCR stage, hence why you could then read the data after returning the batch to the OCR stage.&lt;/P&gt;&lt;P&gt;This was more common in older versions, but an update in Decipher 2.2 introduced the automatic functionality that would extract the vector data and check for additional data using OCR, providing a blend of this data in data verification. (see feature detail below)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="BenLyons1_0-1724138892082.png" style="width: 400px;"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/39173iCAE0BE1DFA4FFD0C/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="BenLyons1_0-1724138892082.png" alt="BenLyons1_0-1724138892082.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;What version of Decipher are you currently using?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Aug 2024 07:28:51 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113209#M50685</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2024-08-20T07:28:51Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113224#M50691</link>
      <description>&lt;P&gt;&lt;a href="https://community.blueprism.com/t5/user/viewprofilepage/user-id/15488"&gt;@Ben.Lyons1&lt;/a&gt;&amp;nbsp;I am using version 2.3. Here is the screenshot from the about section for the full version.&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Decipher version.png" style="width: 597px;"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/39174iCC1BCB6D4D97A7A5/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999" role="button" title="Decipher version.png" alt="Decipher version.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Aug 2024 12:13:02 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113224#M50691</guid>
      <dc:creator>douglas.h.burke</dc:creator>
      <dc:date>2024-08-20T12:13:02Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113232#M50695</link>
      <description>&lt;P&gt;That's strange, 2.3 should get the vector and ocr data.&lt;/P&gt;&lt;P&gt;Is the document a pdf? Are you able to select the text when opened in Adobe Reader?&lt;/P&gt;&lt;P&gt;What happens if you covert it to a jpeg and upload it to Decipher?&lt;/P&gt;&lt;P&gt;What are the languages/regions set to in the Document Type and Batch Type?&lt;/P&gt;</description>
      <pubDate>Tue, 20 Aug 2024 13:51:20 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113232#M50695</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2024-08-20T13:51:20Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113247#M50698</link>
      <description>&lt;P&gt;Yes, it is a pdf and the field text is selectable when opening in Adobe Reader.&lt;/P&gt;&lt;P&gt;Converting the file to jpeg and uploading had the same result on the initial load and when reprocessing the OCR step was worse in that it still did not read the text.&lt;/P&gt;&lt;P&gt;Batch and Document type only has English has the primary language and no secondary.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Aug 2024 16:27:44 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113247#M50698</guid>
      <dc:creator>douglas.h.burke</dc:creator>
      <dc:date>2024-08-20T16:27:44Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113264#M50702</link>
      <description>&lt;P&gt;That doesn't sound like Decipher's performing properly, I would recommend raising a support ticket so we can investigate.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2024 07:01:40 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/113264#M50702</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2024-08-21T07:01:40Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/116804#M52281</link>
      <description>&lt;P&gt;G'Afternoon&amp;nbsp;&lt;a href="https://community.blueprism.com/t5/user/viewprofilepage/user-id/15473"&gt;@douglas.h.burke&lt;/a&gt;&amp;nbsp; &lt;span class="lia-unicode-emoji" title=":grinning_face:"&gt;😀&lt;/span&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp; Did you end up opening a ticket and was this item resolved for you yet? I have a some what similar issue with pdf document. We are using Decipher IDP version 2.3.2&amp;nbsp;&lt;A href="https://docs.blueprism.com/en-US/bundle/decipher-idp-2-3/page/release-notes/rn-home.htm" target="_blank"&gt;Release notes&lt;/A&gt;.&amp;nbsp; My ticket was opened Oct 16th. I would love to hear if/how you were able to remedy the issue you described here.&lt;/P&gt;&lt;P&gt;Thanks in advance,&lt;/P&gt;&lt;P&gt;JD&lt;/P&gt;</description>
      <pubDate>Thu, 07 Nov 2024 18:32:25 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/116804#M52281</guid>
      <dc:creator>JD_CPU</dc:creator>
      <dc:date>2024-11-07T18:32:25Z</dc:date>
    </item>
    <item>
      <title>Re: Initial batch load fails, Restarting at OCR role captures</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/116827#M52296</link>
      <description>&lt;P&gt;Yes, I did open a ticket but it was really the update from 2.3 to 2.3.2 that resolved my issue. After the update there were still some additional configuration that was needed in the document form definition(DFD) to refine the performance and get all fields to consistently identify. We were lucky the documents we are processing are structed PDFs so turned on "Strict Position" in the Misc parameter for most fields in the DFD.&amp;nbsp;&lt;/P&gt;&lt;P&gt;We also learned anything that is beyond a single line should be defined as mutli-line. This might be obvious to others but new to OCR and DFD creation so was not.&amp;nbsp; PDFs were digital signed, so while this is not the traditional editable multi-line input/paragraph field the details of digital signing get cast on the document in multiple lines. My recommendation is when defining regions in data verification step, if you scale the region bigger than a single line of the font size, set as multi-line in DFD.&lt;/P&gt;&lt;P&gt;I would also recommend if you have processed multiple samples the model could be bad. After our update we created DFDs with only the fields we had issues, so it narrowed down issues to troubleshoot. Once we had a better understanding of how the fields needed to be set in the DFD, we created a new DFD so we could train the model with all correct field settings.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 08 Nov 2024 15:29:52 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Initial-batch-load-fails-Restarting-at-OCR-role-captures/m-p/116827#M52296</guid>
      <dc:creator>douglas.h.burke</dc:creator>
      <dc:date>2024-11-08T15:29:52Z</dc:date>
    </item>
  </channel>
</rss>

