29-03-23 01:07 PM
I am using Blue Prism's Read with OCR function to extract the NRIC number from a PDF form that is system generated. Although it performs as expected in most situations, it sometimes misinterprets certain characters, such as reading "S" as "$", "I" as ")", and "O" as "0".
Despite experimenting with different combinations of page segmentation, character whitelisting, and scaling, the issue with misinterpretation of characters persists. It would be greatly appreciated if someone could suggest alternative approaches to tackle this problem
System Generated PDF Section.
Configuration as below.