ZUGFeRD invoices as XML extraction

Proxy_96 — Thu, 12 Jun 2025 09:41:59 GMT

Hi everyone,

I’d like to create an object in Blue Prism to work with ZUGFeRD invoices.
A ZUGFeRD invoice is essentially a regular PDF invoice that also contains an embedded XML file representing the same invoice data.

I’m not sure which extension or approach to use in order to:

Identify whether a given PDF is a ZUGFeRD invoice (I was hoping to use iTextSharp.dll for that, but not sure if that’s feasible),
Extract the embedded XML file from the PDF.

I already have working code that can process the extracted XML — so the most important step for me is to reliably extract the XML from the PDF, no matter the format it’s in.

Has anyone done this before, or does anyone know of a library, extension, or method that could help with this?

Thanks in advance, and have a great day!

topic ZUGFeRD invoices as XML extraction in Digital Exchange

ZUGFeRD invoices as XML extraction