cancel
Showing results for 
Search instead for 
Did you mean: 

PDF Extract Text not working properly

lorenzwagner
Level 4

Hello, i am currently encountering the following problem. I am using the PDF Management VBO and use it to extract text from a pdf. For this i am using an action stage and save the text to a data item. When i run the RPA, the tool has a succes in extracting the text (the data item indicates there a 200 characters), but whenever i open the data item and try to find out what the VBO found, there is no text in it. What could be the reason?

The PDF is readable and copyable.

 

Thanks in advance

4 REPLIES 4

Hi @lorenzwagner , 

Try to save the file as .txt and use Read text from file. Not an ideal way but if you could give more clarification about the PDF and text which is copied would be able to narrow it down. 

To eliminate any document-specific issues, could you try extracting text from a simple, known-good PDF? If that works, the problem may be related to the format or content of the original PDF.

Brigiana Kopec Senior Product Support Engineer (Bilingual) – Americas

@Brigianakopec Thank you for your anwser. I opened the file with Acrobat and everything is perfectly copyable.

lorenzwagner
Level 4

@Brigianakopec It currently looks something like this:  Unbenannt.PNG