<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic i thought it was 2018...very… in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86127#M37026</link>
    <description>i thought it was 2018...very late to the party</description>
    <pubDate>Thu, 07 Mar 2019 06:13:00 GMT</pubDate>
    <dc:creator>BenKirimlidis</dc:creator>
    <dc:date>2019-03-07T06:13:00Z</dc:date>
    <item>
      <title>Fuzzy Matching</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86121#M37020</link>
      <description>Hello!

Has anyone been able to try the concept of 'Fuzzy Matching' ( &lt;A href="https://en.wikipedia.org/wiki/Fuzzy_matching_(computer-assisted_translation)" target="test_blank"&gt;https://en.wikipedia.org/wiki/Fuzzy_matching_(computer-assisted_translation)&lt;/A&gt; ) by using Blue Prism? I was thinking of creating a VBO specifically for this but find it hard to know where to begin. I read this concept could be used as suggested by Blue Prism's "Increasing Data Quality" data sheet (see documentation). Here's the exact info given in the pdf:

Fuzzy match translation
Data can be corrected or translated using €œfuzzy match€&amp;#157; requirements, provided there is a clear rule for doing so. For example, location names can be corrected using techniques borrowed from spell-checking algorithms to identify the closest match from a €œdictionary€&amp;#157; of approved values. For example, using a €œLevenstein Distance€&amp;#157; calculation the following corrections might be made:
ï‚· €œSollihull hospital€&amp;#157; becomes €œSolihull Hospital€&amp;#157; (corrected the double €œL€&amp;#157; and the capitalisation of €œH€&amp;#157;)
ï‚· €œBlue Prism€&amp;#157; becomes €œBlue Prism Limited€&amp;#157;


So my question is: has anyone done something like this before? Is this done by writing our own Visual Basic code, using a VBO, using a separate program, ... ? 

Thanks for any info related to this subject!

SÃ©bastien</description>
      <pubDate>Mon, 24 Jul 2017 11:01:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86121#M37020</guid>
      <dc:creator>SébastienBulté</dc:creator>
      <dc:date>2017-07-24T11:01:00Z</dc:date>
    </item>
    <item>
      <title>Hi Sebastien - there is no</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86122#M37021</link>
      <description>Hi Sebastien - there is no official VBO available but maybe someone out there has tried it. I would imagine a single code sage would be enough, and searching for '.Net Levenstein Distance' offers many examples. These two look like they will paste straight in, with minimal adjustment.
&lt;A href="https://social.technet.microsoft.com/wiki/contents/articles/28961.leven…" target="test_blank"&gt;https://social.technet.microsoft.com/wiki/contents/articles/28961.leven…&lt;/A&gt;
&lt;A href="https://www.programmingalgorithms.com/algorithm/levenshtein-distance?la…" target="test_blank"&gt;https://www.programmingalgorithms.com/algorithm/levenshtein-distance?la…&lt;/A&gt;</description>
      <pubDate>Tue, 25 Jul 2017 13:46:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86122#M37021</guid>
      <dc:creator>John__Carter</dc:creator>
      <dc:date>2017-07-25T13:46:00Z</dc:date>
    </item>
    <item>
      <title>I've been able to create a</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86123#M37022</link>
      <description>I've been able to create a vbo for it, thanks &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt; I've used the levenstein distance as well as the jaro-winkler ratio, both will prove useful for my OCR needs I believe.</description>
      <pubDate>Tue, 25 Jul 2017 14:37:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86123#M37022</guid>
      <dc:creator>SébastienBulté</dc:creator>
      <dc:date>2017-07-25T14:37:00Z</dc:date>
    </item>
    <item>
      <title>Very good. Just bear in mind</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86124#M37023</link>
      <description>Very good. Just bear in mind that all OCR and fuzzy matching is basically a guess that can be wrong.</description>
      <pubDate>Tue, 25 Jul 2017 22:01:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86124#M37023</guid>
      <dc:creator>John__Carter</dc:creator>
      <dc:date>2017-07-25T22:01:00Z</dc:date>
    </item>
    <item>
      <title>Hello Sébastien,</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86125#M37024</link>
      <description>Hello Sébastien, 
I'm currently facing the same challenge as yours. Could you please share the VBO you have created ?
Thanks a lot !</description>
      <pubDate>Wed, 13 Sep 2017 20:24:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86125#M37024</guid>
      <dc:creator>MahmudBarrak</dc:creator>
      <dc:date>2017-09-13T20:24:00Z</dc:date>
    </item>
    <item>
      <title>late to the party but…</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86126#M37025</link>
      <description>late to the party but lehvenstein distance functions can be useful here for finding matches where the difference between the target string and the input are similar but different</description>
      <pubDate>Thu, 07 Mar 2019 06:12:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86126#M37025</guid>
      <dc:creator>BenKirimlidis</dc:creator>
      <dc:date>2019-03-07T06:12:00Z</dc:date>
    </item>
    <item>
      <title>i thought it was 2018...very…</title>
      <link>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86127#M37026</link>
      <description>i thought it was 2018...very late to the party</description>
      <pubDate>Thu, 07 Mar 2019 06:13:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/Fuzzy-Matching/m-p/86127#M37026</guid>
      <dc:creator>BenKirimlidis</dc:creator>
      <dc:date>2019-03-07T06:13:00Z</dc:date>
    </item>
  </channel>
</rss>

