<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Hi Alekh, Denis, in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46225#M2272</link>
    <description>Hi Alekh, Denis,
How do we find the nearest header at the top of the section.??</description>
    <pubDate>Tue, 02 May 2017 16:46:00 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2017-05-02T16:46:00Z</dc:date>
    <item>
      <title>PDF interaction</title>
      <link>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46223#M2270</link>
      <description>Hi Everyone,

I have to automate a process where in a PDF of 400 pages I have to specifically find for few keywords and for every match for any of the keyword, the O/P should be : 
&amp;gt;Keyword
&amp;gt;Complete sentence containing keyword
&amp;gt;Associated page number
&amp;gt;Nearest header at top of section.

Template of PDF is like :

Header 1 (in Bold)
Line1
Line2
.
.
.
Line 3

Header 2 (In Bold)
.
.
.
and so on...

Please advise how can I achieve this or search for text in between the headers without hardcoding</description>
      <pubDate>Wed, 08 Mar 2017 15:22:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46223#M2270</guid>
      <dc:creator>ALEKHJAIN</dc:creator>
      <dc:date>2017-03-08T15:22:00Z</dc:date>
    </item>
    <item>
      <title>Have you seen the guide in</title>
      <link>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46224#M2271</link>
      <description>Have you seen the guide in the learning area of the Portal called 'Interfacing with PDF Documents'?</description>
      <pubDate>Wed, 08 Mar 2017 21:48:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46224#M2271</guid>
      <dc:creator>Denis__Dennehy</dc:creator>
      <dc:date>2017-03-08T21:48:00Z</dc:date>
    </item>
    <item>
      <title>Hi Alekh, Denis,</title>
      <link>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46225#M2272</link>
      <description>Hi Alekh, Denis,
How do we find the nearest header at the top of the section.??</description>
      <pubDate>Tue, 02 May 2017 16:46:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46225#M2272</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-05-02T16:46:00Z</dc:date>
    </item>
    <item>
      <title>RE: Have you seen the guide in</title>
      <link>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46226#M2273</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Can one one please help me if we can extract data from web embedded pdf which is readable using BP 6.7&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Zaheed Khan&lt;BR /&gt;Deputy Manager&lt;BR /&gt;WNS, Asia/Kolkata&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Mon, 12 Oct 2020 07:36:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46226#M2273</guid>
      <dc:creator>ZaheedulKhan1</dc:creator>
      <dc:date>2020-10-12T07:36:00Z</dc:date>
    </item>
    <item>
      <title>RE: Have you seen the guide in</title>
      <link>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46227#M2274</link>
      <description>What do you mean by "readable using BP6.7"?&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Generally speaking, if you are able to, by hand, copy and paste the text that you need from the PDF, then Blue Prism should be able to as well.&amp;nbsp; If the PDF is actually an image (and not selectable text), then you'll need to use surface automation techniques and OCR in order to extract that data.&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;James Man&lt;BR /&gt;Professional Services&lt;BR /&gt;Blue Prism&lt;BR /&gt;Asia/Hong_Kong&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 13 Oct 2020 01:33:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/PDF-interaction/m-p/46227#M2274</guid>
      <dc:creator>james.man</dc:creator>
      <dc:date>2020-10-13T01:33:00Z</dc:date>
    </item>
  </channel>
</rss>

