cancel
Showing results for 
Search instead for 
Did you mean: 

PDF extraction with checkbox field

Amruthasimplify
Level 5
Hi All,

Can you assist me in finding a solution to extract data from a PDF without relying on external applications, as my organization requires the use of only Blue Prism approved objects and native tools?
I have attempted using Global Send Keys, but it doesn't seem to work well for capturing data from the PDF, which includes text boxes, multi-line fields, and checkboxes. Also, there is a possibility of rearranging the field positions in the future, making it inappropriate to use field position references for data extraction. The PDF can have more than 3 pages.
Is there any alternative method that allows for capturing data, including checkbox values, in a more efficient manner?

Sample of the fields are shown below.

20323.pngThanks in advance.

------------------------------
Amrutha Sivarajan
------------------------------
12 REPLIES 12

Hi Amrutha,

Please find the links -

-> https://www.dllme.com/dll/files/itextsharp_dll.html

-> https://www.dllme.com/dll/files/bouncycastle_crypto_dll.html

PFB the example using C# code -

PDF Form - 

20319.png

We'll get the data in text data item and then use split action mentioned in previous step. 

PDF Data extracted - 

20320.png

Please let me know if you have any other issues.



------------------------------
KirtiMaan Talwar
Consultant
Deloitte
------------------------------
KirtiMaan Talwar
IA Consultant
Deloitte USI

Hi Amrutha,

Once you fetch the data after that you can use Excel macro to read the exact value 



------------------------------
Sahil Chankotra
------------------------------

Hello KirtiMaan,

I am receiving errors 

"StringBuilder" and "PdfReader" could not be found (are you missing a using directive or an assembly reference?)

Does this have to deal with the itextsharp version? as there are many versions on the site you suggested

Ndauti_0-1716727359586.png

Thank you

Tim