15-02-24 06:23 PM
A new component for extracting text from PDF's has been posted to the Digital Exchange. This VBO wraps the open source PdfPig library. The component is limited to extracting all text, text from specific page numbers, and getting the count of pages within the PDF at the moment, but we will be adding additional functionality to it (ex. extracting images) as time permits.
You can find the new connector at the following link:
https://digitalexchange.blueprism.com/dx/entry/3439/solution/pdfpig
PS - Don't forget to grab the support files from the asset page too. There are multiple DLLs required to leverage the PdfPig library.
Cheers,
16-02-24 02:11 PM
Thanks for sharing this asset. @ewilson
19-02-24 06:47 AM
Thanks for sharing. Open source-based PDF assets are something much needed right now in the market. This will be a value addition.
19-02-24 07:14 AM
I think guides are not accessible yet,
20-02-24 11:39 PM
Can you try it again? It's working for me. If you continue to have the issue you might try clearing your browser cache.
Cheers,
2 weeks ago
Hi, we can't seem to get the object working. We have placed the DLLs in the Blue Prism folder and get the following error when running a merge.
2 weeks ago
Hi team,
While downloading the PdfPig asset, it missing the dll, can you provide?
a week ago
@ewilson hi,
Users cannot find the DLL here : https://digitalexchange.blueprism.com/cardDetails?id=137364#
regards
a week ago
@Mohamad_747 this is a known issue. For the time being, please open a ticket with dxsupport@blueprism.com and we can make the additional collateral available via a side channel.
Chers,
Eric