for reading PDF data, there're many options some are free, we've tested many and been parsing PDFs for a few years in case you decide to come back to that solution.
For reading data from the web sites:
- API is the best way to interact with the web site.
- If API is not available:
a) what is your Blue Prism version?
b) is this internal web site or from a third party?
------------------------------
Konstantin Kazantsev
Solutions Architect
Church and Dwight
America/New_York
------------------------------
Original Message:
Sent: 01-24-2023 15:59
From: Brittany Harding
Subject: Text from PDF Document to be split into collection
Right I can see how that is confusing. The original process was what I was struggling with, and because I was struggling with it I decided to put it aside and try something else. I just need to be able to copy and paste the UBRN Information: Registered Organization, Address and Telephone number.
------------------------------
Brittany Harding
Original Message:
Sent: 01-24-2023 14:13
From: Konstantin Kazantsev
Subject: Text from PDF Document to be split into collection
Hi Brittany,
can you pls clarify the process? From the screenshots above it looks like the object is trying to read the data from a web page and not pdf, correct?
------------------------------
Konstantin Kazantsev
Solutions Architect
Church and Dwight
America/New_York
Original Message:
Sent: 01-24-2023 09:50
From: Brittany Harding
Subject: Text from PDF Document to be split into collection
Thank you for replying, I am still new to BP so I am not sure that those steps would be easy for me to do.
------------------------------
Brittany Harding
Original Message:
Sent: 01-20-2023 22:07
From: Eric Wilson
Subject: Text from PDF Document to be split into collection
Hello @Brittany Harding,
There are a few ways you could handle this.
- I believe there's a some basic built-in OCR capability within Blue Prism (not referring to Decipher, but that's another option) that you might be able to leverage here. I haven't tried it myself, but I'm sure someone on the community can probably comment on it.
- There are various PDF tools available on the DX. Most of those will likely have some sort of cost associated with them. One example is the PDF Services Export asset. It's a wrapper around an Adobe Services REST API which can be used to export PDFs to other formats including Office document formats. You could then use the standard Blue Prism Office VBOs to work with the data. The catch here is that you need an Adobe subscription I believe unless you're just testing.
- I believe Microsoft Word will actually open a PDF and automatically convert it to a .DOCX, so you might want to give that a try.
Cheers,
------------------------------
Eric Wilson
Director, Integrations and Enablement
Blue Prism Digital Exchange
Original Message:
Sent: 01-20-2023 17:56
From: Brittany Harding
Subject: Text from PDF Document to be split into collection
Hi Guys,
I could really use some help.
I need to move data (Address, Telephone Number, and Organisation) from a PDF document to a Collection. To be used to verify information in a separate process.
Do you have any advise on this?
Thank you
------------------------------
Brittany Harding
------------------------------