21-11-25 12:41 AM
Hi everyone
We are investigating if there is a better way to extract XML files from PDF forms than how we are doing it currently.
We have a large number of PDF forms that arrive that our bots process and currently we open the file in Adobe Acrobat Pro and go and save the XML to a local drive from inside Acrobat, then load that XML file back into BP before we start to work on the data and whatever we need to do with it.
What we would like is a product that will do that function for us - ideally it would take an input of the PDF file and present a collection of the XML - but even if it quickly dropped an XML file to the drive it would be better than nothing.
Ideally free (because otherwise I need to pay for 60 copies and that is heaps of paperwork but if it works I don't think cost is a problem).
Is the XML that Adobe allows you to save stored in the PDF, or is it created when Acrobat reads the file?
Thanks in advance
Ian
21-11-25 05:47 AM
If you have Acrobat licence, you can export XML using API without open PDF file.
https://developer.adobe.com/document-services/apis/pdf-services/
23-11-25 09:35 PM
Hmm...looked good but it doesn't support Dynamic XFA Format which our forms have. 😞