<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Download File from Web Page in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85787#M36736</link>
    <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;I have a question regarding how to download a file from a web page. I know that there is a Download File action in Utility - File Management. This requires a source URL. When spying a link from a web page it seems impossible to get this source URL with read stage.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Just simply google&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;pdf file.&lt;/STRONG&gt;The first result and the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;source URL&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is this&lt;/P&gt;
&lt;P&gt;&lt;A href="http://www.pdf995.com/samples/pdf.pdf" target="_blank" rel="noopener"&gt;http://www.pdf995.com/samples/pdf.pdf&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;However, I just cannot get this link using a read stage from the google search result page. Any ideas?&lt;BR /&gt;&lt;BR /&gt;Get Document URL(Your result may vary):&lt;BR /&gt;&lt;A href="https://www.google.com/search?q=pdf+file&amp;amp;rlz=1C1CHBF_enUS838US838&amp;amp;oq=pdf+file&amp;amp;aqs=chrome..69i57j0l5.1551j0j7&amp;amp;sourceid=chrome&amp;amp;ie=UTF-8#spf=1567693634623" target="test_blank"&gt;https://www.google.com/search?q=pdf+file&amp;amp;rlz=1C1CHBF_enUS838US838&amp;amp;oq=pdf+file&amp;amp;aqs=chrome..69i57j0l5.1551j0j7&amp;amp;sourceid=chrome&amp;amp;ie=UTF-8#spf=1567693634623&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Get Document URL Domain:&lt;BR /&gt;&lt;A href="https://community.blueprism.com/www.google.com" target="test_blank"&gt;www.google.com&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Get Current Value:&lt;BR /&gt;PDF document - pdf 995&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Eric Liu&lt;BR /&gt;RPA Developer&lt;BR /&gt;America/Toronto&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
    <pubDate>Thu, 05 Sep 2019 14:47:00 GMT</pubDate>
    <dc:creator>EricLiu</dc:creator>
    <dc:date>2019-09-05T14:47:00Z</dc:date>
    <item>
      <title>Download File from Web Page</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85787#M36736</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;I have a question regarding how to download a file from a web page. I know that there is a Download File action in Utility - File Management. This requires a source URL. When spying a link from a web page it seems impossible to get this source URL with read stage.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Just simply google&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;pdf file.&lt;/STRONG&gt;The first result and the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;source URL&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is this&lt;/P&gt;
&lt;P&gt;&lt;A href="http://www.pdf995.com/samples/pdf.pdf" target="_blank" rel="noopener"&gt;http://www.pdf995.com/samples/pdf.pdf&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;However, I just cannot get this link using a read stage from the google search result page. Any ideas?&lt;BR /&gt;&lt;BR /&gt;Get Document URL(Your result may vary):&lt;BR /&gt;&lt;A href="https://www.google.com/search?q=pdf+file&amp;amp;rlz=1C1CHBF_enUS838US838&amp;amp;oq=pdf+file&amp;amp;aqs=chrome..69i57j0l5.1551j0j7&amp;amp;sourceid=chrome&amp;amp;ie=UTF-8#spf=1567693634623" target="test_blank"&gt;https://www.google.com/search?q=pdf+file&amp;amp;rlz=1C1CHBF_enUS838US838&amp;amp;oq=pdf+file&amp;amp;aqs=chrome..69i57j0l5.1551j0j7&amp;amp;sourceid=chrome&amp;amp;ie=UTF-8#spf=1567693634623&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Get Document URL Domain:&lt;BR /&gt;&lt;A href="https://community.blueprism.com/www.google.com" target="test_blank"&gt;www.google.com&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Get Current Value:&lt;BR /&gt;PDF document - pdf 995&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Eric Liu&lt;BR /&gt;RPA Developer&lt;BR /&gt;America/Toronto&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 05 Sep 2019 14:47:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85787#M36736</guid>
      <dc:creator>EricLiu</dc:creator>
      <dc:date>2019-09-05T14:47:00Z</dc:date>
    </item>
    <item>
      <title>RE: Download File from Web Page</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85788#M36737</link>
      <description>This is almost certainly specific to google as the actual hyperlink has nested h3 and div inside it.&amp;nbsp; Just using Blue Prism's spy gets you the nested div and not the A element.&amp;nbsp; So it may be worth trying to change and manually tweak the spied element so that it matches just the &amp;lt;a&amp;gt;&amp;lt;/a&amp;gt; hyperlink, and seeing if you can read the URL from that.&lt;BR /&gt;&lt;BR /&gt;
&lt;DIV class="media" style="overflow: hidden;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="19105.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/19258iCF914F60DC7240C3/image-size/large?v=v2&amp;amp;px=999" role="button" title="19105.png" alt="19105.png" /&gt;&lt;/span&gt;&lt;/DIV&gt;
&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;James Man&lt;BR /&gt;Senior Product Consultant&lt;BR /&gt;Blue Prism&lt;BR /&gt;Asia/Hong_Kong&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 06 Sep 2019 02:50:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85788#M36737</guid>
      <dc:creator>james.man</dc:creator>
      <dc:date>2019-09-06T02:50:00Z</dc:date>
    </item>
    <item>
      <title>RE: Download File from Web Page</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85789#M36738</link>
      <description>&lt;P&gt;Hi Eric,&lt;/P&gt;
&lt;P&gt;Might be a bit overkill, but you could use the HtmlAgilityPack (&lt;A href="https://html-agility-pack.net/select-nodes" target="_blank" rel="noopener"&gt;https://html-agility-pack.net/select-nodes&lt;/A&gt;). Read the whole HTML into this object and search for the link you need using XPath. This has to be in a code block in an object.&lt;/P&gt;
&lt;P&gt;Regards, Erik&lt;/P&gt;
​&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Erik Christoffer&lt;BR /&gt;Developer&lt;BR /&gt;InsingerGilissen&lt;BR /&gt;Europe/Amsterdam&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 06 Sep 2019 09:10:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85789#M36738</guid>
      <dc:creator>ErikChristoffer</dc:creator>
      <dc:date>2019-09-06T09:10:00Z</dc:date>
    </item>
    <item>
      <title>RE: Download File from Web Page</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85790#M36739</link>
      <description>Hi James,&lt;BR /&gt;&lt;BR /&gt;Thanks for replying. Actually there is plenty of websites that have this type of issue. I am not an expert int web development but I guess people use a lot of div to wrap things around to use CSS, or their JavaScript pieces render the HTML this way. &lt;BR /&gt;&lt;BR /&gt;I have tried to move the mouse a little bit when spying an element, not really very effective. Do you know if Blue Prism is planning to make any new functions to solve this issue?&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Eric Liu&lt;BR /&gt;RPA Developer&lt;BR /&gt;America/Toronto&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 06 Sep 2019 18:02:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85790#M36739</guid>
      <dc:creator>EricLiu</dc:creator>
      <dc:date>2019-09-06T18:02:00Z</dc:date>
    </item>
    <item>
      <title>RE: Download File from Web Page</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85791#M36740</link>
      <description>Thanks Erik! I have never heard about this method before. Overkill or not, I am so happy to learn a new way of interacting with HTML pages. I will definitely try it out.&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Eric Liu&lt;BR /&gt;RPA Developer&lt;BR /&gt;America/Toronto&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Fri, 06 Sep 2019 18:04:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Download-File-from-Web-Page/m-p/85791#M36740</guid>
      <dc:creator>EricLiu</dc:creator>
      <dc:date>2019-09-06T18:04:00Z</dc:date>
    </item>
  </channel>
</rss>

