cancel
Showing results for 
Search instead for 
Did you mean: 

PDF Automation

ssuray
Level 2

What are the available options for PDF automation in Blue Prism? Currently, there is no built-in VBO for reading digital PDFs, unlike other RPA tools. The existing methods include using keystrokes to copy content, Region mode, iTextSharp DLLs, Boost robotics , many of which come with additional licensing costs , Trying explore if there is any other way ?

3 REPLIES 3

Hi, what is the exact action that you want to perform on PDF? You are right there are different VBOs for different actions and while some of them work fine some require custom code stages.

ssuray
Level 2

Hello Tejas ,

I am considering creating custom code for PDF actions, particularly in Blue Prism, to use in PDF automation. Currently, I have a few usegae in my mind

  • Reading Body:
    • This involves extracting the main content or text from the entire PDF document.
  • Reading Text from Specific Pages:
    • This task focuses on extracting text from specific pages within the PDF. In Boost Robotics, you need to pass the coordinates (X, Y) along with the height and width to define the area from which to extract text. same thing I wanted to replicate.
  • Extracting Specific Pages from Complete File:
    • This involves selecting and extracting certain pages from the entire PDF document. The custom code would identify the pages to be extracted and create a new PDF file containing only those pages. This is useful for splitting documents or isolating relevant sections.

Hello @ssuray ,

You can get PDF Management in DX.

https://digitalexchange.blueprism.com/cardDetails?id=116962

Reading Body: "Extract All Text" action

Reading Text from Specific Pages: "Extract All Text For Page" action

Extracting Specific Pages from Complete File: "Split File" action

------------------------------
Mitsuko
Asia/Tokyo
------------------------------