<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Need an idea for extracting specific information in University Forum</title>
    <link>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97681#M3245</link>
    <description>&lt;P&gt;Hi&lt;BR /&gt;&lt;BR /&gt;I hope below solution works , if you still face any issue or the output is not in expected format then you can provide the expecting output&lt;BR /&gt;&lt;BR /&gt;so based on that can have a better solution&lt;BR /&gt;&lt;BR /&gt;Regards&lt;/P&gt;</description>
    <pubDate>Mon, 01 May 2023 06:06:46 GMT</pubDate>
    <dc:creator>plnarayana777</dc:creator>
    <dc:date>2023-05-01T06:06:46Z</dc:date>
    <item>
      <title>Need an idea for extracting specific information</title>
      <link>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97679#M3243</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;I have extracted all the information from the editable pdf using application modeller and send global keys. Now I need to extract some specific information. Can anyone tell me in which way can I extract data?&lt;BR /&gt;&lt;BR /&gt;Note that Blue Prism is new to me. It would be nice if you could tell the solution in details.&lt;BR /&gt;&lt;BR /&gt;PDF document:&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35510.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/37424i59C4CBD49BB7E3BC/image-size/large?v=v2&amp;amp;px=999" role="button" title="35510.png" alt="35510.png" /&gt;&lt;/span&gt;&lt;BR /&gt;Extracted text:&lt;BR /&gt;&lt;BR /&gt;Packing List&lt;BR /&gt;Date: 06/04/2018&lt;BR /&gt;FROM:&lt;BR /&gt;Shipping Company&lt;BR /&gt;123 Memory Lane&lt;BR /&gt;Madison, Wi&lt;BR /&gt;53203&lt;BR /&gt;TO:&lt;BR /&gt;Valued Customer&lt;BR /&gt;4343 Main St&lt;BR /&gt;Suite 100&lt;BR /&gt;Willowcrest, NC&lt;BR /&gt;27007&lt;BR /&gt;Quanity Ordered&lt;BR /&gt;Quantity Shipped&lt;BR /&gt;Product Number&lt;BR /&gt;Description&lt;BR /&gt;Unit Weight&lt;BR /&gt;2&lt;BR /&gt;1&lt;BR /&gt;0045657&lt;BR /&gt;Gizmo #1&lt;BR /&gt;1 lbs.&lt;BR /&gt;2&lt;BR /&gt;2&lt;BR /&gt;9007652&lt;BR /&gt;Gadget #2&lt;BR /&gt;20 lbs.&lt;BR /&gt;Total Items Shipped: 3&lt;BR /&gt;Comments:&lt;BR /&gt;Please retain this packing list for return or exchange.&lt;BR /&gt;Ship Date: 07/04/2018&lt;BR /&gt;&lt;BR /&gt;Need following information:&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35511.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/37425iA7F31A3E9898C106/image-size/large?v=v2&amp;amp;px=999" role="button" title="35511.png" alt="35511.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 28 Apr 2023 08:33:52 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97679#M3243</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-04-28T08:33:52Z</dc:date>
    </item>
    <item>
      <title>Re: Need an idea for extracting specific information</title>
      <link>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97680#M3244</link>
      <description>&lt;P&gt;Possible solutions are string manipulation or Regex through Utility - Strings VBO&lt;BR /&gt;&lt;BR /&gt;Still there are limitations as will be using key words like from, to etc so all the pdf should of same format&lt;BR /&gt;&lt;BR /&gt;Next related to table data it again depends no of columns ( are they going to be fixed or keeps changing) &amp;amp; available of data in cells &lt;BR /&gt;&lt;BR /&gt;if there any cells empty then their is failure of solution&lt;BR /&gt;&lt;BR /&gt;&amp;nbsp;so should be aware of most possible cases with enough nor of test files.&lt;BR /&gt;&lt;BR /&gt;following regex you can use to get the grouping values&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;FROM:\s+(?&amp;lt;FromAddress&amp;gt;[\S\s]+)\s+TO:\s+(?&amp;lt;ToAddress&amp;gt;[\S\s]+)Quanity Ordered[\S\s]+Weight\s+(?&amp;lt;Table&amp;gt;[\S\s]+)Total Items Shipped:\s*(?&amp;lt;TotalItemsShipped&amp;gt;\d?)[\S\s]+Ship Date:\s(?&amp;lt;ShipDate&amp;gt;\d{2}\/\d{2}\/\d{4})&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35504.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/35560iF6CD4CB6EA72704C/image-size/large?v=v2&amp;amp;px=999" role="button" title="35504.png" alt="35504.png" /&gt;&lt;/span&gt;&lt;BR /&gt;Here initial string is text from pdf&lt;BR /&gt;&lt;BR /&gt;first use the above to get the groups data and along with table&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;will get output as collection as below&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35505.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/35559i0C8ADDAD33210C6B/image-size/large?v=v2&amp;amp;px=999" role="button" title="35505.png" alt="35505.png" /&gt;&lt;/span&gt;&lt;BR /&gt;next step is to get table data present in Table column&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35506.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/35561i07CFDEE353FC687E/image-size/large?v=v2&amp;amp;px=999" role="button" title="35506.png" alt="35506.png" /&gt;&lt;/span&gt;&lt;BR /&gt;will get the output as string&lt;BR /&gt;&lt;BR /&gt;same way you can get all the values or even you can use multi calculation stage to get all the values at a time&lt;BR /&gt;&lt;BR /&gt;next use below regex to get table values&lt;BR /&gt;&lt;BR /&gt;(?&amp;lt;QO&amp;gt;\d+?)\r?\n(?&amp;lt;QS&amp;gt;\d+?)\r?\n(?&amp;lt;PN&amp;gt;\d{7})&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35507.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/35563i97F02CA0BFA56580/image-size/large?v=v2&amp;amp;px=999" role="button" title="35507.png" alt="35507.png" /&gt;&lt;/span&gt;&lt;BR /&gt;will get the values in collection , so based on your requirement you can get values through looping&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="35508.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/35562i6A6DF5FBA8E2572A/image-size/large?v=v2&amp;amp;px=999" role="button" title="35508.png" alt="35508.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Note as mentioned earlier there are limitations&lt;BR /&gt;&lt;BR /&gt;Note : you can the group names as you require but should have spaces (ex &amp;lt;QO&amp;gt; you can Name it &amp;lt;Quantity_Ordered&amp;gt;)&lt;BR /&gt;&lt;BR /&gt;Hope this helps still if you face any issue you can explain&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Regards&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 28 Apr 2023 09:06:47 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97680#M3244</guid>
      <dc:creator>plnarayana777</dc:creator>
      <dc:date>2023-04-28T09:06:47Z</dc:date>
    </item>
    <item>
      <title>Re: Need an idea for extracting specific information</title>
      <link>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97681#M3245</link>
      <description>&lt;P&gt;Hi&lt;BR /&gt;&lt;BR /&gt;I hope below solution works , if you still face any issue or the output is not in expected format then you can provide the expecting output&lt;BR /&gt;&lt;BR /&gt;so based on that can have a better solution&lt;BR /&gt;&lt;BR /&gt;Regards&lt;/P&gt;</description>
      <pubDate>Mon, 01 May 2023 06:06:46 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97681#M3245</guid>
      <dc:creator>plnarayana777</dc:creator>
      <dc:date>2023-05-01T06:06:46Z</dc:date>
    </item>
    <item>
      <title>Re: Need an idea for extracting specific information</title>
      <link>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97682#M3246</link>
      <description>&lt;P&gt;thanks for the solution. It worked for me.&lt;/P&gt;</description>
      <pubDate>Tue, 02 May 2023 10:28:09 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/University-Forum/Need-an-idea-for-extracting-specific-information/m-p/97682#M3246</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-05-02T10:28:09Z</dc:date>
    </item>
  </channel>
</rss>

