I totally agree with
@Denis__Dennehy Even I wasted a lot of time in one of my projects using the Surface Automation option with Tesseract OCR or using some kind of regular expression once you copy all the contents of a PDF file onto Clipboard or some data item.
In past we have used the iText Sharp DLL VBO's in order to get PDF based tabular data in case they are digital for sure (Again there are some license restriction if I am correct as it is a GNU based license).
We have also used ABBYY Flexicapture 12 Distributed and it works wonderfully in case you have scanned or digital PDF's depending on the scan quality and how you create the Document Definition and Layouts around the tool. (You will need the ABBYY tool knowledge for sure)
I also came across an interesting VBO few weeks back on a sample digital invoice and tested the same, the results were great and it's pretty easy to use. You can find the VBO on Digital Exchange at the following link:
PDF to Excel Converter------------------------------
----------------------------------
Hope it helps you and if it resolves you query please mark it as the best answer so that others having the same problem can track the answer easily
Regards,
Devneet Mohanty
Intelligent Automation Consultant
Blueprism 6x Certified Professional
Website:
https://devneet.github.io/Email: devneetmohanty07@gmail.com
----------------------------------
------------------------------
---------------------------------------------------------------------------------------------------------------------------------------
Hope this helps you out and if so, please mark the current thread as the 'Answer', so others can refer to the same for reference in future.
Regards,
Devneet Mohanty,
SS&C Blueprism Community MVP 2024,
Automation Architect,
Wonderbotz India Pvt. Ltd.