15-02-24 06:23 PM
A new component for extracting text from PDF's has been posted to the Digital Exchange. This VBO wraps the open source PdfPig library. The component is limited to extracting all text, text from specific page numbers, and getting the count of pages within the PDF at the moment, but we will be adding additional functionality to it (ex. extracting images) as time permits.
You can find the new connector at the following link:
https://digitalexchange.blueprism.com/dx/entry/3439/solution/pdfpig
PS - Don't forget to grab the support files from the asset page too. There are multiple DLLs required to leverage the PdfPig library.
Cheers,
16-02-24 02:11 PM
Thanks for sharing this asset. @ewilson
19-02-24 06:47 AM
Thanks for sharing. Open source-based PDF assets are something much needed right now in the market. This will be a value addition.
19-02-24 07:14 AM
I think guides are not accessible yet,
20-02-24 11:39 PM
Can you try it again? It's working for me. If you continue to have the issue you might try clearing your browser cache.
Cheers,