<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Duplicate Removal From Collections. in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93981#M43390</link>
    <description>Hi,

I have been trying to remove duplicates from a excel file and then inserting them into work queue.

Excel file has 4 fields with 1 unique field ( say Product Code ). File has almost 100+ entries. I don't want non-unique values to be inserted into work queue.

Now what I am doing is using Business Object "Utility Collection Manipulation" and Action Stage "Collection Contains Values" to check every single item in the excel file and then removing the rows with non-unique value.&amp;nbsp;

When I run this in Control Room, It works fine but takes more than 3 minutes to do so (depending on my logic and number of entries in excel file).

I want a better solution to this which can solve this problem in less than 10 seconds. I have heard that there are somethings like dataview, datatable, Code Stage which could have been used here to do this quite quickly internally itself.

Since I am very new to this technology, I don't know what to do. Please help!

Thanks.</description>
    <pubDate>Fri, 29 Mar 2019 12:16:00 GMT</pubDate>
    <dc:creator>AmberKamboj</dc:creator>
    <dc:date>2019-03-29T12:16:00Z</dc:date>
    <item>
      <title>Duplicate Removal From Collections.</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93981#M43390</link>
      <description>Hi,

I have been trying to remove duplicates from a excel file and then inserting them into work queue.

Excel file has 4 fields with 1 unique field ( say Product Code ). File has almost 100+ entries. I don't want non-unique values to be inserted into work queue.

Now what I am doing is using Business Object "Utility Collection Manipulation" and Action Stage "Collection Contains Values" to check every single item in the excel file and then removing the rows with non-unique value.&amp;nbsp;

When I run this in Control Room, It works fine but takes more than 3 minutes to do so (depending on my logic and number of entries in excel file).

I want a better solution to this which can solve this problem in less than 10 seconds. I have heard that there are somethings like dataview, datatable, Code Stage which could have been used here to do this quite quickly internally itself.

Since I am very new to this technology, I don't know what to do. Please help!

Thanks.</description>
      <pubDate>Fri, 29 Mar 2019 12:16:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93981#M43390</guid>
      <dc:creator>AmberKamboj</dc:creator>
      <dc:date>2019-03-29T12:16:00Z</dc:date>
    </item>
    <item>
      <title>Change Stage logging to …</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93982#M43391</link>
      <description>Change Stage logging to 'Errors only' for all the stages in the loop. It'll run in less than 10 seconds (or so) in Control Room after that.</description>
      <pubDate>Fri, 29 Mar 2019 14:32:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93982#M43391</guid>
      <dc:creator>david.l.morris</dc:creator>
      <dc:date>2019-03-29T14:32:00Z</dc:date>
    </item>
    <item>
      <title>Here's a simple c# code…</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93983#M43392</link>
      <description>&lt;P&gt;Here's a simple c# code stage that'll get you distincts. &lt;BR /&gt;&lt;BR /&gt;Inputs: &lt;BR /&gt;Collection - Collection &lt;BR /&gt;Column - Text (Name of column) &lt;BR /&gt;&lt;BR /&gt;Outputs: &lt;BR /&gt;Sorted Collection - Collection&lt;/P&gt;
&lt;PRE class="language-csharp"&gt;DataView dv &lt;SPAN class="token operator"&gt;=&lt;/SPAN&gt; Collection&lt;SPAN class="token punctuation"&gt;.&lt;/SPAN&gt;DefaultView&lt;SPAN class="token punctuation"&gt;;&lt;/SPAN&gt; 
Sorted_Collection &lt;SPAN class="token operator"&gt;=&lt;/SPAN&gt; dv&lt;SPAN class="token punctuation"&gt;.&lt;/SPAN&gt;&lt;SPAN class="token function"&gt;ToTable&lt;/SPAN&gt;&lt;SPAN class="token punctuation"&gt;(&lt;/SPAN&gt;&lt;SPAN class="token keyword"&gt;true&lt;/SPAN&gt;&lt;SPAN class="token punctuation"&gt;,&lt;/SPAN&gt; Column&lt;SPAN class="token punctuation"&gt;)&lt;/SPAN&gt;&lt;SPAN class="token punctuation"&gt;;&lt;/SPAN&gt;&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Mar 2019 18:50:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93983#M43392</guid>
      <dc:creator>AmiBarrett</dc:creator>
      <dc:date>2019-03-29T18:50:00Z</dc:date>
    </item>
    <item>
      <title>thanks @david and …</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93984#M43393</link>
      <description>thanks @david and @amitbarrett ... you guys really helped me out</description>
      <pubDate>Mon, 01 Apr 2019 11:54:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Duplicate-Removal-From-Collections/m-p/93984#M43393</guid>
      <dc:creator>AmberKamboj</dc:creator>
      <dc:date>2019-04-01T11:54:00Z</dc:date>
    </item>
  </channel>
</rss>

