cancel
Showing results for 
Search instead for 
Did you mean: 

PDF Exttraction

SidramappaHunas
Level 4
​Hi ,

How to extract table data from PDF?


------------------------------
Sidramappa Hunashyal
SA
Accenture
Asia/Kolkata
------------------------------
4 REPLIES 4

VivekGoel
Level 10
Assuming you are well aware of BP and know about objects, processes and automating normal desktop based applications and also gone through surface automation training.

2 Ways to do it:

1- If your PDF is a structured PDF and is machine readable (NOT SCANNED BUT TYPED PDF wherein you can select the text by mouse), Then you can read the pdf by 
use pdf reader exe to create your object-->While Launching provide the PDF file path in commandline params input-->Send Global send keys "CTRL+A, CTRL+C " --> Use environment utility to get text from clipboard --> use REGEX to find the values you need --> For each value store them into a cell in collection

2- If your PDF is scanned pdf ,Then you can read the pdf by 
use pdf reader exe to create your object-->While Launching provide the PDF file path in commandline params input--> Use Surface automation to read each cells from the PDF and store it in collection cells.

Hope it helps.

------------------------------
Vivek Goel
RPA Architect
Asia/Singapore
"If you like this post, please press the "Recommend" Button.
------------------------------

Carl__Carter
Staff
Staff

Hi Sidramappa,

Please review the 'Interfacing with PDF Documents' training guide available free on the Blue Prism University. It explains in detail the 2 methods described by Vivek, along with a third option.

Surface Automation training is also available on the University.

Finally, some of our technology partners offer pdf data extraction which is implemented in a Blue Prism process using Blue Prism Skills. Visit the Blue Prism Digital Exchange for further information on this option.

https://digitalexchange.blueprism.com/dx/




------------------------------
Carl Carter
Developer Program Manager
Blue Prism
Europe/London
------------------------------

Hi Carl,

Can you please share the link to the document "Interacting with PDF" as I could not find it on digital exchange.


------------------------------
Jay Prajapati
------------------------------

Hi Jay,

Once you are in the Blue Prism University, follow the steps below to locate the information about interfacing with PDFs.

  • Click 'Online Courses'
  • Using the 'Search within results...' box, search for 'interfacing with pdf documents'
  • Select your preferred language.
32621.png

Hope this helps.

------------------------------
Carl Carter
Developer Program Manager
Blue Prism
Europe/London
------------------------------