Extract of text from PDF
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
29-04-22 03:07 PM
Hello,
Do you have any good recommendation for the extract of text from a PDF file?
I have been looking at iText and Adobe API, however both are not available for free (for iText a commercial license is required and Adobe has only a free trial version for 6 months).
Looking forward to your replies 🙂
Martyna
------------------------------
Martyna Pokojska
------------------------------
Martyna Pokojska
Arla Foods
Solution Architect
4 REPLIES 4
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
29-04-22 08:08 PM
Hi @Martyna Pokojska,
Did you try fetching the data by creating a small process which will just launch PDF, performs Ctrl+A to select the data, Ctrl+C to copy the data and then pasting the data in your desired location e.g. notepad?
Or you are specifically looking for external tools only to read the data?
------------------------------
Manpreet Kaur
Manager
Deloitte
------------------------------
Did you try fetching the data by creating a small process which will just launch PDF, performs Ctrl+A to select the data, Ctrl+C to copy the data and then pasting the data in your desired location e.g. notepad?
Or you are specifically looking for external tools only to read the data?
------------------------------
Manpreet Kaur
Manager
Deloitte
------------------------------
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
04-05-22 06:31 AM
Hi @Manpreet Kaur,
I would like to avoid opening the file, therefore I'm searching for other options.
Additional problem with this PDF is, that part of it is interactive, so even when I'm opening it and save as txt file -> some values are not transferred to txt
------------------------------
Martyna Pokojska
------------------------------
I would like to avoid opening the file, therefore I'm searching for other options.
Additional problem with this PDF is, that part of it is interactive, so even when I'm opening it and save as txt file -> some values are not transferred to txt
------------------------------
Martyna Pokojska
------------------------------
Martyna Pokojska
Arla Foods
Solution Architect
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
04-05-22 11:14 AM
Hi Martyna,
You can use the following DX Exchange asset which will convert your PDF file to Excel using in-built Office operations hence no licensing issue should happen as long as you have a valid Microsoft Office software suite with you: Function for DX InDev PDF to Excel Converter
Once you have the data written on the excel file, you can use the 'Utility - MS Excel VBO' for extracting the content to a text file if required.
------------------------------
----------------------------------
Hope it helps you out and if my solution resolves your query, then please mark it as the 'Best Answer' so that the others members in the community having similar problem statement can track the answer easily in future
Regards,
Devneet Mohanty
Intelligent Process Automation Consultant | Sr. Consultant - Automation Developer,
Wonderbotz India Pvt. Ltd.
Blue Prism Community MVP | Blue Prism 7x Certified Professional
Website: https://devneet.github.io/
Email: devneetmohanty07@gmail.com
----------------------------------
------------------------------
You can use the following DX Exchange asset which will convert your PDF file to Excel using in-built Office operations hence no licensing issue should happen as long as you have a valid Microsoft Office software suite with you: Function for DX InDev PDF to Excel Converter
Once you have the data written on the excel file, you can use the 'Utility - MS Excel VBO' for extracting the content to a text file if required.
------------------------------
----------------------------------
Hope it helps you out and if my solution resolves your query, then please mark it as the 'Best Answer' so that the others members in the community having similar problem statement can track the answer easily in future
Regards,
Devneet Mohanty
Intelligent Process Automation Consultant | Sr. Consultant - Automation Developer,
Wonderbotz India Pvt. Ltd.
Blue Prism Community MVP | Blue Prism 7x Certified Professional
Website: https://devneet.github.io/
Email: devneetmohanty07@gmail.com
----------------------------------
------------------------------
---------------------------------------------------------------------------------------------------------------------------------------
Hope this helps you out and if so, please mark the current thread as the 'Answer', so others can refer to the same for reference in future.
Regards,
Devneet Mohanty,
SS&C Blueprism Community MVP 2024,
Automation Architect,
Wonderbotz India Pvt. Ltd.
Hope this helps you out and if so, please mark the current thread as the 'Answer', so others can refer to the same for reference in future.
Regards,
Devneet Mohanty,
SS&C Blueprism Community MVP 2024,
Automation Architect,
Wonderbotz India Pvt. Ltd.
Options
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
04-05-22 03:24 PM
Hi @Martyna Pokojska,
We can achieve this functionality using python code.
Write python code and call it from Process Studio. It will convert all data into Text file. Then we can read that text file.
Python code is easily available on Internet.
------------------------------
Jatin Kalra
Manager
Genpact
Noida UP
------------------------------
We can achieve this functionality using python code.
Write python code and call it from Process Studio. It will convert all data into Text file. Then we can read that text file.
Python code is easily available on Internet.
------------------------------
Jatin Kalra
Manager
Genpact
Noida UP
------------------------------
Jatin Kalra
Manager
Genpact
Noida UP
[Phone]
