<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic ZUGFeRD invoices as XML extraction in Digital Exchange</title>
    <link>https://community.blueprism.com/t5/Digital-Exchange/ZUGFeRD-invoices-as-XML-extraction/m-p/120993#M4403</link>
    <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I’d like to create an object in Blue Prism to work with ZUGFeRD invoices.&lt;BR /&gt;A ZUGFeRD invoice is essentially a regular PDF invoice that also contains an embedded XML file representing the same invoice data.&lt;/P&gt;&lt;P&gt;I’m not sure which extension or approach to use in order to:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Identify whether a given PDF is a ZUGFeRD invoice (I was hoping to use iTextSharp.dll for that, but not sure if that’s feasible),&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Extract the embedded XML file from the PDF.&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;I already have working code that can process the extracted XML — so the most important step for me is to reliably extract the XML from the PDF, no matter the format it’s in.&lt;/P&gt;&lt;P&gt;Has anyone done this before, or does anyone know of a library, extension, or method that could help with this?&lt;/P&gt;&lt;P&gt;Thanks in advance, and have a great day!&lt;/P&gt;</description>
    <pubDate>Thu, 12 Jun 2025 09:41:59 GMT</pubDate>
    <dc:creator>Proxy_96</dc:creator>
    <dc:date>2025-06-12T09:41:59Z</dc:date>
    <item>
      <title>ZUGFeRD invoices as XML extraction</title>
      <link>https://community.blueprism.com/t5/Digital-Exchange/ZUGFeRD-invoices-as-XML-extraction/m-p/120993#M4403</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I’d like to create an object in Blue Prism to work with ZUGFeRD invoices.&lt;BR /&gt;A ZUGFeRD invoice is essentially a regular PDF invoice that also contains an embedded XML file representing the same invoice data.&lt;/P&gt;&lt;P&gt;I’m not sure which extension or approach to use in order to:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Identify whether a given PDF is a ZUGFeRD invoice (I was hoping to use iTextSharp.dll for that, but not sure if that’s feasible),&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Extract the embedded XML file from the PDF.&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;I already have working code that can process the extracted XML — so the most important step for me is to reliably extract the XML from the PDF, no matter the format it’s in.&lt;/P&gt;&lt;P&gt;Has anyone done this before, or does anyone know of a library, extension, or method that could help with this?&lt;/P&gt;&lt;P&gt;Thanks in advance, and have a great day!&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jun 2025 09:41:59 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Digital-Exchange/ZUGFeRD-invoices-as-XML-extraction/m-p/120993#M4403</guid>
      <dc:creator>Proxy_96</dc:creator>
      <dc:date>2025-06-12T09:41:59Z</dc:date>
    </item>
  </channel>
</rss>

