Blue Prism University

last person joined: 2 days ago 

Discuss and learn more about Blue Prism University and certification.
Expand all | Collapse all

PDF Exttraction

  • 1.  PDF Exttraction

    Posted 01-30-2020 00:37
    ​Hi ,

    How to extract table data from PDF?


    ------------------------------
    Sidramappa Hunashyal
    SA
    Accenture
    Asia/Kolkata
    ------------------------------


  • 2.  RE: PDF Exttraction

    Posted 02-02-2020 22:20
    Assuming you are well aware of BP and know about objects, processes and automating normal desktop based applications and also gone through surface automation training.

    2 Ways to do it:

    1- If your PDF is a structured PDF and is machine readable (NOT SCANNED BUT TYPED PDF wherein you can select the text by mouse), Then you can read the pdf by 
    use pdf reader exe to create your object-->While Launching provide the PDF file path in commandline params input-->Send Global send keys "CTRL+A, CTRL+C " --> Use environment utility to get text from clipboard --> use REGEX to find the values you need --> For each value store them into a cell in collection

    2- If your PDF is scanned pdf ,Then you can read the pdf by 
    use pdf reader exe to create your object-->While Launching provide the PDF file path in commandline params input--> Use Surface automation to read each cells from the PDF and store it in collection cells.

    Hope it helps.

    ------------------------------
    Vivek Goel
    RPA Architect
    Asia/Singapore
    "If you like this post, please press the "Recommend" Button.
    ------------------------------



  • 3.  RE: PDF Exttraction

    Posted 02-03-2020 05:59

    Hi Sidramappa,

    Please review the 'Interfacing with PDF Documents' training guide available free on the Blue Prism University. It explains in detail the 2 methods described by Vivek, along with a third option.

    Surface Automation training is also available on the University.

    Finally, some of our technology partners offer pdf data extraction which is implemented in a Blue Prism process using Blue Prism Skills. Visit the Blue Prism Digital Exchange for further information on this option.

    https://digitalexchange.blueprism.com/dx/




    ------------------------------
    Carl Carter
    Developer Program Manager
    Blue Prism
    Europe/London
    ------------------------------



  • 4.  RE: PDF Exttraction

    Posted 07-14-2020 12:52
    Hi Carl,

    Can you please share the link to the document "Interacting with PDF" as I could not find it on digital exchange.


    ------------------------------
    Jay Prajapati
    ------------------------------



  • 5.  RE: PDF Exttraction

    Posted 07-14-2020 14:04
    Hi Jay,

    Once you are in the Blue Prism University, follow the steps below to locate the information about interfacing with PDFs.

    • Click 'Online Courses'
    • Using the 'Search within results...' box, search for 'interfacing with pdf documents'
    • Select your preferred language.

    Hope this helps.

    ------------------------------
    Carl Carter
    Developer Program Manager
    Blue Prism
    Europe/London
    ------------------------------