Decipher

 View Only
last person joined: 12 hours ago 

A community for Blue Prism Decipher users.

  • 1.  Decipher Not learning based on Previous Corrections

    Posted 11-11-2023 07:57

    Hi @Ben Lyons,

    I have a set of 11 images with same structure, for the first time i assigned the regions for each field during manual verification but when the second time i passed the same 11 images still its wrongly selecting the regions, its not learning from what i corrected previously.

    1.

    For The Company Name its considering only first line, in the previous batch i already assigned the region to entire text box but still its considering only 1st line. I have selected the flag Multiline as well.

    Same issue with Company Profile Field also, its not extracting the data after First line and if we assign the manually also the text is not correct like a is extracted as @, some new characters are coming which are not present in the image also.

    2.

    The PE Ratio field is mapping to Sedol it should map to correct once as i mentioned the exact keyword in DFD.

    The characters like O,I are being read as Zero(0).One(1) by decipher for Outstanding shares field

    For the Chairman field also i assigned the correct region but when we send a new batch its mapping old one.

    3.

    Even if we mention the correct keywords in DFD its mapping to other fields.

    I tried this with same documents 5-6 times still same issue is happening, not recognizing the values i deleted the entire training data and started fresh but still no luck.

    What needs to be done do get the exact values the image quality is also good.
    I am i missing anything during training? As i know for training we no need to enable any setting in decipher it will automatically learn if it is a same template.



    ------------------------------
    If I was of assistance, please vote for it to be the "Best Answer".

    Thanks & Regards,
    Salman Shaik
    ------------------------------


  • 2.  RE: Decipher Not learning based on Previous Corrections

    Posted 11-13-2023 08:14

    Hi Salman,

    Have you tried training Decipher without specifying any keywords? This can produce better results where some keywords are resulting in incorrect data being auto-selected. Where some fields are blank this is likely to avoid Decipher looking for any other text in the area.

    It looks quite structured, have you considered trying the misc parameter FormFields = On?

    Thanks



    ------------------------------
    Ben Lyons
    Senior Product Specialist - Decipher
    SS&C Blue Prism
    UK based
    ------------------------------



  • 3.  RE: Decipher Not learning based on Previous Corrections

    Posted 11-13-2023 09:48

    Hi Ben,

    Initially i tried without specifying any key words only but its not worked so i added keywords still its not worked for me. 

    In my image the Fields position might shift randomly its not fixed so i have not used the FormFields parameter.

    Do you want me to use misc paramter in my case?

    Even i selected the flag as multiline and assigned the proper region its not detecting the multiline for the second time onwards.



    ------------------------------
    If I was of assistance, please vote for it to be the "Best Answer".

    Thanks & Regards,
    Salman Shaik
    ------------------------------



  • 4.  RE: Decipher Not learning based on Previous Corrections

    Posted 11-13-2023 13:34

    Hi Salman,

    Tricky, in this case I wouldn't recommend Form Fields.

    Are you deleting your training data each time you try a new DFD configuration?

    Tesseract OCR is also known to find it harder to read text on a coloured background, maybe try adjusting the image contrast or converting it to greyscale.

    If you'd like some "hands on" help, you may be eligible for some time with our professional services team via a Knowledge Support session. Please check with your SS&C Blue Prism account manager for details.



    ------------------------------
    Ben Lyons
    Senior Product Specialist - Decipher
    SS&C Blue Prism
    UK based
    ------------------------------



  • 5.  RE: Decipher Not learning based on Previous Corrections

    Posted 11-13-2023 14:03

    Hi Ben,

    No, I am not deleting the training data whenever i am changing the DFD Configuration.

    Decipher is not capable enough to read the text on a colored background?

    If i have to adjust the image contrast or converting it to greyscale can i do this setting in decipher for onetime? or i need to do it manually?

    Due to this colored background only the values are extracting wrong? like letter I is extracted as 1, letter 'O' is extracted as 0 (Zero), letter a is extracted as @, multiline text is detected only as a single line?



    ------------------------------
    If I was of assistance, please vote for it to be the "Best Answer".

    Thanks & Regards,
    Salman Shaik
    ------------------------------



  • 6.  RE: Decipher Not learning based on Previous Corrections

    Posted 11-13-2023 14:16

    Hi Salman,

    Check out the best practice guidance on how to configure/train your DFD Decipher IDP best practices (blueprism.com), this will help get you up and running as quickly as possible.

    It's not Decipher so much as it's the Tesseract OCR engine. Decipher uses Tesseract 5 as its primary OCR engine as its the most comprehensive, freely available OCR engine on the market. Our engineering team carry out a number of activities to refine the performance, but we are still working within the available features of the product.

    Image adjustments would be manual, but it might give you an idea on why your results aren't 100%.

    The multiline flag is more to create the space in the validation screen (there are other impacts in how the data is processed e.g. how line breaks are stored), you can train multiline fields without that flag being selected. It may be negatively influenced by previous rounds of training where you've made changes to your DFD without restarting the training.

    Thanks



    ------------------------------
    Ben Lyons
    Senior Product Specialist - Decipher
    SS&C Blue Prism
    UK based
    ------------------------------



Welcome to the Blue Prism Decipher Community

Blue Prism Decipher IDP is Intelligent Document Processing for Blue Prism RPA. This is a community for Decipher users to get support and discuss the implementation and usage of product.

This community has been set up so that we continue to work closely with our Decipher users, both new and established. As our members join, we will be using this community to share information, and also hope this will allow all members to discuss ideas and issues around Decipher.

Decipher IDP Resources

Download DecipherProduct Help

FAQs

Decipher IDP is a Blue Prism product for Intelligent Document Processing. It allows organizations to easily extract, analyse and understand data from a range of documents whether these are PDFs, email attachments, or even scanned paperwork.
Decipher IDP is currently available without additional cost to Blue Prism customers with Production or Business-Critical support.
Please contact your Blue Prism Account Manager or Customer Success Director for information on accessing Decipher. They will be able to request a license key which is required to be able to install Decipher IDP.
Yes, a trial version is available for Blue Prism customers. Please contact your Blue Prism Account Manager or Customer Success Director for information on accessing Decipher. We are planning to offer a trial version available for download from our DX later this year.
All documentation for Decipher IDP is available online on BP DOCS.
An extended FAQ can be found here.
Decipher IDP is currently available for Blue Prism Enterprise customer for on-premise installation.