cancel
Showing results for 
Search instead for 
Did you mean: 

PDF interaction

ALEKHJAIN
Level 2
Hi Everyone, I have to automate a process where in a PDF of 400 pages I have to specifically find for few keywords and for every match for any of the keyword, the O/P should be : >Keyword >Complete sentence containing keyword >Associated page number >Nearest header at top of section. Template of PDF is like : Header 1 (in Bold) Line1 Line2 . . . Line 3 Header 2 (In Bold) . . . and so on... Please advise how can I achieve this or search for text in between the headers without hardcoding
4 REPLIES 4

Denis__Dennehy
Level 15
Have you seen the guide in the learning area of the Portal called 'Interfacing with PDF Documents'?

Anonymous
Not applicable
Hi Alekh, Denis, How do we find the nearest header at the top of the section.??

Hi,

Can one one please help me if we can extract data from web embedded pdf which is readable using BP 6.7

------------------------------
Zaheed Khan
Deputy Manager
WNS, Asia/Kolkata
------------------------------

What do you mean by "readable using BP6.7"? 

Generally speaking, if you are able to, by hand, copy and paste the text that you need from the PDF, then Blue Prism should be able to as well.  If the PDF is actually an image (and not selectable text), then you'll need to use surface automation techniques and OCR in order to extract that data.

------------------------------
James Man
Professional Services
Blue Prism
Asia/Hong_Kong
------------------------------