<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic RE: Extract of text from PDF in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92713#M42439</link>
    <description>Hi @Martyna&amp;nbsp;Pokojska,&lt;BR /&gt;&lt;BR /&gt;We can achieve this functionality using python code.&lt;BR /&gt;Write python code and call it from Process Studio. It will convert all data into Text file. Then we can read that text file.&lt;BR /&gt;Python code is easily available on Internet.​&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Jatin Kalra&lt;BR /&gt;Manager&lt;BR /&gt;Genpact&lt;BR /&gt;Noida UP&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
    <pubDate>Wed, 04 May 2022 14:24:00 GMT</pubDate>
    <dc:creator>Jatin__Kalra</dc:creator>
    <dc:date>2022-05-04T14:24:00Z</dc:date>
    <item>
      <title>Extract of text from PDF</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92709#M42435</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;&lt;BR /&gt;Do you have any good recommendation for the extract of text from a PDF file?&amp;nbsp;&lt;BR /&gt;I have been looking at iText and Adobe API, however both are not available for free (for iText a commercial license is required and Adobe has only a free trial version for 6 months).&amp;nbsp;&lt;BR /&gt;Looking forward to your replies &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Martyna&lt;/P&gt;
&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Martyna Pokojska&lt;BR /&gt;------------------------------</description>
      <pubDate>Fri, 29 Apr 2022 14:07:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92709#M42435</guid>
      <dc:creator>pokojskamartyna</dc:creator>
      <dc:date>2022-04-29T14:07:00Z</dc:date>
    </item>
    <item>
      <title>RE: Extract of text from PDF</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92710#M42436</link>
      <description>Hi &lt;A class="user-content-mention" data-sign="@" data-contactkey="f52a292b-4821-4c38-aba1-43c678365f6a" data-tag-text="@Martyna Pokojska" href="https://community.blueprism.com/network/profile?UserKey=f52a292b-4821-4c38-aba1-43c678365f6a" data-itemmentionkey="b3d2e184-f970-4657-9ea8-90a5be4746a6"&gt;@Martyna Pokojska&lt;/A&gt;,&lt;BR /&gt;&lt;BR /&gt;Did you try fetching the data by creating a small process which will just launch PDF, performs Ctrl+A to select the data, Ctrl+C to copy the data and then pasting the data in your desired location e.g. notepad?&lt;BR /&gt;Or you are specifically looking for external tools only to read the data?​&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Manpreet Kaur&lt;BR /&gt;Manager&lt;BR /&gt;Deloitte&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 29 Apr 2022 19:08:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92710#M42436</guid>
      <dc:creator>ManpreetKaur1</dc:creator>
      <dc:date>2022-04-29T19:08:00Z</dc:date>
    </item>
    <item>
      <title>RE: Extract of text from PDF</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92711#M42437</link>
      <description>Hi @&lt;SPAN&gt;Manpreet Kaur,&lt;BR /&gt;&lt;BR /&gt;I would like to avoid opening the file, therefore I'm searching for other options.&lt;BR /&gt;Additional problem with this PDF is, that part of it is interactive, so even when I'm opening it and save as txt file -&amp;gt; some values are not transferred to txt&lt;BR /&gt;&lt;/SPAN&gt;​&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Martyna Pokojska&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Wed, 04 May 2022 05:31:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92711#M42437</guid>
      <dc:creator>pokojskamartyna</dc:creator>
      <dc:date>2022-05-04T05:31:00Z</dc:date>
    </item>
    <item>
      <title>RE: Extract of text from PDF</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92712#M42438</link>
      <description>Hi Martyna,&lt;BR /&gt;&lt;BR /&gt;You can use the following DX Exchange asset which will co​nvert your PDF file to Excel using in-built Office operations hence no licensing issue should happen as long as you have a valid Microsoft Office software suite with you​​: &lt;A href="https://digitalexchange.blueprism.com/dx/entry/122031/solution/pdf-to-excel-converter" target="_blank" rel="noopener"&gt;Function for DX InDev PDF to Excel Converter&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Once you have the data written on the excel file, you can use the 'Utility - MS Excel VBO' for extracting the content to a text file if required.&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;----------------------------------&lt;BR /&gt;Hope it helps you out and if my solution resolves your query, then please mark it as the 'Best Answer' so that the others members in the community having similar problem statement can track the answer easily in future&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Devneet Mohanty&lt;BR /&gt;Intelligent Process Automation Consultant | Sr. Consultant - Automation Developer,&lt;BR /&gt;Wonderbotz India Pvt. Ltd.&lt;BR /&gt;Blue Prism Community MVP | Blue Prism 7x Certified Professional&lt;BR /&gt;Website: &lt;A href="https://devneet.github.io/" target="test_blank"&gt;https://devneet.github.io/&lt;/A&gt;&lt;BR /&gt;Email: devneetmohanty07@gmail.com&lt;BR /&gt;&lt;BR /&gt;----------------------------------&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Wed, 04 May 2022 10:14:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92712#M42438</guid>
      <dc:creator>devneetmohanty07</dc:creator>
      <dc:date>2022-05-04T10:14:00Z</dc:date>
    </item>
    <item>
      <title>RE: Extract of text from PDF</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92713#M42439</link>
      <description>Hi @Martyna&amp;nbsp;Pokojska,&lt;BR /&gt;&lt;BR /&gt;We can achieve this functionality using python code.&lt;BR /&gt;Write python code and call it from Process Studio. It will convert all data into Text file. Then we can read that text file.&lt;BR /&gt;Python code is easily available on Internet.​&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Jatin Kalra&lt;BR /&gt;Manager&lt;BR /&gt;Genpact&lt;BR /&gt;Noida UP&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Wed, 04 May 2022 14:24:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Extract-of-text-from-PDF/m-p/92713#M42439</guid>
      <dc:creator>Jatin__Kalra</dc:creator>
      <dc:date>2022-05-04T14:24:00Z</dc:date>
    </item>
  </channel>
</rss>

