Hi Everyone,
As I have been working recently on a project where I had to read data from different types of PDF documents.
I would like to ask if there is planned in future to create an Object in BP which will deal with PDF manipulation, or just some update which will enable better manipulation with PDF documents.
For now we have just only two possible options how to read data from PDF:
1. We can use just simple copy data with Global Send Keys
2. Use Surface Automation to read certain regions in PDF
I think that is not enough, there are reasons:
1. Copy Paste (Global Send Keys)
- Data are pasted in different structure, not accordingly from top to bottom like in PDF, so If we have document which has large amount of words, tables, etc it is almost impossible to catch (calculate) all needed data. It needs too much Effort to extract the correct data without hard coding in calculation stages, even if it is possible.
2. Surface Automation
- Surface automation is still not 100% working approach, customers usually try to avoid this solution and it can crash the process very easy.
- Imagine we have many different structured PDFs (different templates of PDF which includes data). To process this data it is needed to capture (make Regions) to each PDF template separately. If we have 2-5 templates, it can be done quite easy but if we have 100 different PDFs ,better option is to do it manually.
Thank you
David