<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic RE: Decipher recognizes extra char in the target field in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66380#M18985</link>
    <description>&lt;P&gt;Hello Ben,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I had applied similar regex as you have proposed, but still, it was picking the data incorrectly... I have decided to remove the format expression value, and Decipher started to recognize the value without any doubts and extra chars.&lt;/P&gt;
&lt;P&gt;But now it's unclear how it will be performing in PROD. Potentially it might pick up completely random value for that field &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Thanks for reply!&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Kind regards, &lt;BR /&gt;&lt;BR /&gt;Dmitrij Mamajev&lt;BR /&gt;Senior RPA Developer&lt;BR /&gt;Substorm AB&lt;BR /&gt;Gothenburg - Sweden&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
    <pubDate>Fri, 17 Mar 2023 12:32:00 GMT</pubDate>
    <dc:creator>dmma</dc:creator>
    <dc:date>2023-03-17T12:32:00Z</dc:date>
    <item>
      <title>Decipher recognizes extra char in the target field</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66376#M18981</link>
      <description>&lt;P&gt;Hello,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am trying to train Decipher to extract data from invoices, and on some invoices Decipher is extracting ID number incorrectly, by adding extra 'O' char (or sometimes instead Zero char '0' it extracts data as 'O' letter).&lt;/P&gt;
&lt;P&gt;Any suggestions, who had similar issue, how to resolve it?&lt;/P&gt;
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="8969.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/9144i19C965D3BDAEBEED/image-size/large?v=v2&amp;amp;px=999" role="button" title="8969.png" alt="8969.png" /&gt;&lt;/span&gt;&lt;/DIV&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thanks.&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Kind regards, &lt;BR /&gt;&lt;BR /&gt;Dmitrij Mamajev&lt;BR /&gt;Senior RPA Developer&lt;BR /&gt;Substorm AB&lt;BR /&gt;Gothenburg - Sweden&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Wed, 15 Mar 2023 15:53:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66376#M18981</guid>
      <dc:creator>dmma</dc:creator>
      <dc:date>2023-03-15T15:53:00Z</dc:date>
    </item>
    <item>
      <title>RE: Decipher recognizes extra char in the target field</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66377#M18982</link>
      <description>&lt;DIV&gt;Hello,&lt;/DIV&gt;
&lt;DIV&gt;This is a known issue with the tesseract engine. Its not only 0/O, but also other "similar" characters like 4/A, 8/B, 5/S etc.&lt;BR /&gt;According to github&lt;/DIV&gt;
&lt;DIV&gt;e.g.&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://github.com/tesseract-ocr/tesseract/issues/2738" rel="noopener noreferrer" target="_blank"&gt;https://github.com/tesseract-ocr/tesseract/issues/2738&lt;/A&gt;&lt;/DIV&gt;
&lt;DIV&gt;The issue should be fixed in tesseract version 6.&lt;/DIV&gt;
&lt;DIV&gt;Unfortunately, there is no solution to this. I have added an extra validation field so that such cases are detected. Fortunately, the IBAN has a precisely defined number of characters.&lt;/DIV&gt;
&lt;DIV&gt;You can also try to improve the quality of your documents (300 dpi minimum). This will reduce number of duplications.&lt;/DIV&gt;
&lt;DIV&gt;&lt;/DIV&gt;
&lt;DIV&gt;BR&lt;/DIV&gt;
&lt;DIV&gt;Marius&lt;/DIV&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Marius Erbert&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 16 Mar 2023 09:26:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66377#M18982</guid>
      <dc:creator>marius-erbert</dc:creator>
      <dc:date>2023-03-16T09:26:00Z</dc:date>
    </item>
    <item>
      <title>RE: Decipher recognizes extra char in the target field</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66378#M18983</link>
      <description>&lt;P&gt;&amp;nbsp;Thanks &lt;a href="https://community.blueprism.com/t5/user/viewprofilepage/user-id/26339"&gt;@MariusErbert&lt;/a&gt;&lt;/P&gt;
&lt;P&gt;I was thinking maybe somebody came up with some formula solution in Decipher how to tackle this issue &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;I was trying to do some formula to replace combination of chars, but I am still not sure how Decipher Formula works. There is not enough of information.&lt;/P&gt;
&lt;P&gt;I thought about docs resolution, but the thing is that docs are sent by ~100 different vendors, so it would be an effort to request each and every vendor to send their invoices in higher resolution &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thanks anyway!&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Kind regards, &lt;BR /&gt;&lt;BR /&gt;Dmitrij Mamajev&lt;BR /&gt;Senior RPA Developer&lt;BR /&gt;Substorm AB&lt;BR /&gt;Gothenburg - Sweden&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 16 Mar 2023 12:51:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66378#M18983</guid>
      <dc:creator>dmma</dc:creator>
      <dc:date>2023-03-16T12:51:00Z</dc:date>
    </item>
    <item>
      <title>RE: Decipher recognizes extra char in the target field</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66379#M18984</link>
      <description>&lt;P&gt;Hi Dimitrij,&lt;/P&gt;
&lt;P&gt;Depending on how consistent the field format is, you could use Format Expression. This is not just used to validate but also match data, potentially correcting mis-recognised characters. E.g. [A-Z]{2}[0-9]{9}[A-Z]{1}[0-9]{2} or similar, depending on the actual format variables.&lt;/P&gt;
&lt;P&gt;Formulas have 2 separate functions, validation and calculation, generally these should not be mixed. For validation it would be used on an assigned field that appears in the document, a calculated field should not be assigned to a field in the document.&lt;/P&gt;
&lt;P&gt;Have you watched the &lt;A href="https://bpdocs.blueprism.com/decipher-2-1/en-us/user-guide/formula-language.htm?tocpath=Interface%7CAdmin%20panel%7CDocument%20form%20definitions%7C_____4"&gt;video on formulas&lt;/A&gt; in the online help?&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Ben Lyons&lt;BR /&gt;Senior Product Specialist - Decipher&lt;BR /&gt;Blue Prism&lt;BR /&gt;UK based&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 16 Mar 2023 15:58:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66379#M18984</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2023-03-16T15:58:00Z</dc:date>
    </item>
    <item>
      <title>RE: Decipher recognizes extra char in the target field</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66380#M18985</link>
      <description>&lt;P&gt;Hello Ben,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I had applied similar regex as you have proposed, but still, it was picking the data incorrectly... I have decided to remove the format expression value, and Decipher started to recognize the value without any doubts and extra chars.&lt;/P&gt;
&lt;P&gt;But now it's unclear how it will be performing in PROD. Potentially it might pick up completely random value for that field &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Thanks for reply!&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Kind regards, &lt;BR /&gt;&lt;BR /&gt;Dmitrij Mamajev&lt;BR /&gt;Senior RPA Developer&lt;BR /&gt;Substorm AB&lt;BR /&gt;Gothenburg - Sweden&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 17 Mar 2023 12:32:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Decipher-recognizes-extra-char-in-the-target-field/m-p/66380#M18985</guid>
      <dc:creator>dmma</dc:creator>
      <dc:date>2023-03-17T12:32:00Z</dc:date>
    </item>
  </channel>
</rss>

