<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic RE: Text from PDF Document to be split into collection in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59369#M13034</link>
    <description>Right I can see how that is confusing. The original process was what I was struggling with, and because I was struggling with it I decided to put it aside and try something else. I just need to be able to copy and paste the UBRN Information: Registered Organization, Address and Telephone number.&amp;nbsp;
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="26532.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/26664i26A886481280C793/image-size/large?v=v2&amp;amp;px=999" role="button" title="26532.png" alt="26532.png" /&gt;&lt;/span&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Brittany Harding&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
    <pubDate>Tue, 24 Jan 2023 16:00:00 GMT</pubDate>
    <dc:creator>BrittanyHarding</dc:creator>
    <dc:date>2023-01-24T16:00:00Z</dc:date>
    <item>
      <title>Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59363#M13028</link>
      <description>&lt;SPAN&gt;Hi Guys,&lt;/SPAN&gt;&lt;BR /&gt;I could really use some help.&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I need to move data (Address, Telephone Number, and&amp;nbsp; Organisation) from a PDF document to a Collection. To be used to verify information in a separate process.&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Do you have any advise on this?&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;Thank you&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Brittany Harding&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 20 Jan 2023 17:56:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59363#M13028</guid>
      <dc:creator>BrittanyHarding</dc:creator>
      <dc:date>2023-01-20T17:56:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59364#M13029</link>
      <description>Hello &lt;A class="user-content-mention" data-sign="@" data-contactkey="e1709e63-89c0-4003-b7eb-0184c6ff70dd" data-tag-text="@Brittany Harding" href="https://community.blueprism.com/network/profile?UserKey=e1709e63-89c0-4003-b7eb-0184c6ff70dd" data-itemmentionkey="e42ee742-b268-4c68-acda-1bfa7c4377ce"&gt;@Brittany Harding&lt;/A&gt;,&lt;BR /&gt;&lt;BR /&gt;There are a few ways you could handle this. &lt;BR /&gt;&lt;BR /&gt;
&lt;OL&gt;
&lt;LI&gt;I believe there's a some basic built-in OCR capability within Blue Prism (not referring to Decipher, but that's another option) that you might be able to leverage here. I haven't tried it myself, but I'm sure someone on the community can probably comment on it.&lt;/LI&gt;
&lt;LI&gt;There are various PDF tools available on the DX. Most of those will likely have some sort of cost associated with them. One example is the &lt;A href="https://digitalexchange.blueprism.com/dx/entry/3439/solution/blue-prism---adobe-pdf-services---export" target="_blank" rel="noopener"&gt;PDF Services Export&lt;/A&gt; asset.​ It's a wrapper around an Adobe Services REST API which can be used to export PDFs to other formats including Office document formats. You could then use the standard Blue Prism Office VBOs to work with the data. The catch here is that you need an Adobe subscription I believe unless you're just testing.&lt;/LI&gt;
&lt;LI&gt;I believe Microsoft Word will actually open a PDF and automatically convert it to a .DOCX, so you might want to give that a try.&lt;/LI&gt;
&lt;/OL&gt;
&lt;BR /&gt;Cheers,&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Eric Wilson&lt;BR /&gt;Director, Integrations and Enablement&lt;BR /&gt;Blue Prism Digital Exchange&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 20 Jan 2023 22:08:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59364#M13029</guid>
      <dc:creator>ewilson</dc:creator>
      <dc:date>2023-01-20T22:08:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59365#M13030</link>
      <description>Thank you for replying, I am still new to BP so I am not sure that those steps would be easy for me to do.&amp;nbsp;&lt;BR /&gt;
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;Within eRS - electronic referral service we need to gather the Registered Practice Details, which is a hyperlink that opens down to a popover window (screenshot 1&amp;amp;2). In the object created, I added a Navigate stage to click on the Hyperlink and a Read stage to Get Table Items (screenshot: 3).&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;1.
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="26514.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/26645i87BBC4A6051F7853/image-size/large?v=v2&amp;amp;px=999" role="button" title="26514.png" alt="26514.png" /&gt;&lt;/span&gt;2.
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="26515.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/26646i94237E0AAB5ED0AF/image-size/large?v=v2&amp;amp;px=999" role="button" title="26515.png" alt="26515.png" /&gt;&lt;/span&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;3&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="26516.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/26649i10CBD7112787E15E/image-size/large?v=v2&amp;amp;px=999" role="button" title="26516.png" alt="26516.png" /&gt;&lt;/span&gt;&lt;BR /&gt;At the bottom you will see the error message that I get when trying to run this process (screenshot:4). I have tried spying in UIA as well and it does not work either – I believe it could just be our environment but I could be wrong. I don't know if anyone has has experience with this web application, but I'd really appreciate any help you can offer.&lt;BR /&gt;4&lt;span class="lia-inline-image-display-wrapper" image-alt="26517.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/26647iD860811DCC19CBC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="26517.png" alt="26517.png" /&gt;&lt;/span&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Brittany Harding&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 24 Jan 2023 09:50:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59365#M13030</guid>
      <dc:creator>BrittanyHarding</dc:creator>
      <dc:date>2023-01-24T09:50:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59366#M13031</link>
      <description>Hi Brittany,&lt;BR /&gt;&lt;BR /&gt;May I suggest to remove the pictures from points 1 and 2 of your examples above, and replace them with masked customer data. Publication of live customer data is in general frowned upon by the GDPR legislators.&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Happy coding!&lt;BR /&gt;---------------&lt;BR /&gt;Paul&lt;BR /&gt;Sweden&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 24 Jan 2023 12:45:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59366#M13031</guid>
      <dc:creator>PvD_SE</dc:creator>
      <dc:date>2023-01-24T12:45:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59367#M13032</link>
      <description>Thank you, I have removed it.&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Brittany Harding&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 24 Jan 2023 12:52:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59367#M13032</guid>
      <dc:creator>BrittanyHarding</dc:creator>
      <dc:date>2023-01-24T12:52:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59368#M13033</link>
      <description>Hi Brittany,&lt;BR /&gt;can you pls clarify the process? From the screenshots above it looks like the object is trying to read the data from a web page and not pdf, correct?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Konstantin Kazantsev&lt;BR /&gt;Solutions Architect&lt;BR /&gt;Church and Dwight&lt;BR /&gt;America/New_York&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 24 Jan 2023 14:14:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59368#M13033</guid>
      <dc:creator>kkazantsev</dc:creator>
      <dc:date>2023-01-24T14:14:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59369#M13034</link>
      <description>Right I can see how that is confusing. The original process was what I was struggling with, and because I was struggling with it I decided to put it aside and try something else. I just need to be able to copy and paste the UBRN Information: Registered Organization, Address and Telephone number.&amp;nbsp;
&lt;DIV class="media" style="overflow: hidden; zoom: 1;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="26532.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/26664i26A886481280C793/image-size/large?v=v2&amp;amp;px=999" role="button" title="26532.png" alt="26532.png" /&gt;&lt;/span&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Brittany Harding&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 24 Jan 2023 16:00:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59369#M13034</guid>
      <dc:creator>BrittanyHarding</dc:creator>
      <dc:date>2023-01-24T16:00:00Z</dc:date>
    </item>
    <item>
      <title>RE: Text from PDF Document to be split into collection</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59370#M13035</link>
      <description>for reading PDF data, there're many options some are free, we've tested many and been parsing PDFs for a few years in case you decide to come back to that solution.&lt;BR /&gt;&lt;BR /&gt;For reading data from the web sites:&lt;BR /&gt;- API is the best way to interact with the web site.&lt;BR /&gt;- If API is not available:&lt;BR /&gt;&amp;nbsp; &amp;nbsp;a) what is your Blue Prism version?&lt;BR /&gt;&amp;nbsp; &amp;nbsp;b) is this internal web site or from a third party?&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Konstantin Kazantsev&lt;BR /&gt;Solutions Architect&lt;BR /&gt;Church and Dwight&lt;BR /&gt;America/New_York&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Tue, 24 Jan 2023 21:08:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Text-from-PDF-Document-to-be-split-into-collection/m-p/59370#M13035</guid>
      <dc:creator>kkazantsev</dc:creator>
      <dc:date>2023-01-24T21:08:00Z</dc:date>
    </item>
  </channel>
</rss>

