<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic DFD definition for multiline REGEX in Product Forum</title>
    <link>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69460#M22065</link>
    <description>Dear Community,&lt;BR /&gt;&lt;BR /&gt;in my organization we are using decipher in some projects. We have find ourselves with a knowledge limitation.&lt;BR /&gt;&lt;BR /&gt;We are trying to extract a phrase frome PDFs. This phrase can be divided into 2,3,4 or five lines, depending on the structure.&lt;BR /&gt;We can define a Regular expression that contemplates this issue, but it seems that decipher is not able to execute the Regex through multiple lines, so it doesn´t recognise our intended phrase.&lt;BR /&gt;&lt;BR /&gt;Examples:&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="9470.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/9639i38C07B0E5D96BC87/image-size/large?v=v2&amp;amp;px=999" role="button" title="9470.png" alt="9470.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="9471.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/9641i7FACDB9E75A4229F/image-size/large?v=v2&amp;amp;px=999" role="button" title="9471.png" alt="9471.png" /&gt;&lt;/span&gt;&lt;BR /&gt;We need a DFD with a regex that extracts the phrase - image 1: "DEMARCACIÓN DE CARRETERAS DEL ESTADO EN CATALUÑA" - image 2: DEMARCACIÓN DE CARRETERAS DEL ESTADO EN CASTILLA Y LEÓN OCCIDENTAL"&lt;BR /&gt;&lt;BR /&gt;We have this regex working: &lt;STRONG&gt;(DEMARCACION)[\s\n\r]*(DE)[\s\n\r]*(CARRETERAS)[\s\n\r]*(DEL)[\s\n\r]*(ESTADO EN)[\s\n\r]*(CATALUÑA|CASTILLA-LA MANCHA)&lt;/STRONG&gt; in other languages, but the limitation through lines in decipher doesn´t allow us to succed with it.&lt;BR /&gt;&lt;BR /&gt;Please, if you have an idea to get through this issue, it will help a lot. I have seen other post regarding this problem, but it hasn´t help at all in our case.&lt;BR /&gt;&lt;BR /&gt;Best regardas, have a nice day!&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Arturo Garcia&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
    <pubDate>Wed, 14 Dec 2022 09:32:00 GMT</pubDate>
    <dc:creator>ArturoGarcia</dc:creator>
    <dc:date>2022-12-14T09:32:00Z</dc:date>
    <item>
      <title>DFD definition for multiline REGEX</title>
      <link>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69460#M22065</link>
      <description>Dear Community,&lt;BR /&gt;&lt;BR /&gt;in my organization we are using decipher in some projects. We have find ourselves with a knowledge limitation.&lt;BR /&gt;&lt;BR /&gt;We are trying to extract a phrase frome PDFs. This phrase can be divided into 2,3,4 or five lines, depending on the structure.&lt;BR /&gt;We can define a Regular expression that contemplates this issue, but it seems that decipher is not able to execute the Regex through multiple lines, so it doesn´t recognise our intended phrase.&lt;BR /&gt;&lt;BR /&gt;Examples:&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="9470.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/9639i38C07B0E5D96BC87/image-size/large?v=v2&amp;amp;px=999" role="button" title="9470.png" alt="9470.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="9471.png"&gt;&lt;img src="https://community.blueprism.com/t5/image/serverpage/image-id/9641i7FACDB9E75A4229F/image-size/large?v=v2&amp;amp;px=999" role="button" title="9471.png" alt="9471.png" /&gt;&lt;/span&gt;&lt;BR /&gt;We need a DFD with a regex that extracts the phrase - image 1: "DEMARCACIÓN DE CARRETERAS DEL ESTADO EN CATALUÑA" - image 2: DEMARCACIÓN DE CARRETERAS DEL ESTADO EN CASTILLA Y LEÓN OCCIDENTAL"&lt;BR /&gt;&lt;BR /&gt;We have this regex working: &lt;STRONG&gt;(DEMARCACION)[\s\n\r]*(DE)[\s\n\r]*(CARRETERAS)[\s\n\r]*(DEL)[\s\n\r]*(ESTADO EN)[\s\n\r]*(CATALUÑA|CASTILLA-LA MANCHA)&lt;/STRONG&gt; in other languages, but the limitation through lines in decipher doesn´t allow us to succed with it.&lt;BR /&gt;&lt;BR /&gt;Please, if you have an idea to get through this issue, it will help a lot. I have seen other post regarding this problem, but it hasn´t help at all in our case.&lt;BR /&gt;&lt;BR /&gt;Best regardas, have a nice day!&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Arturo Garcia&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Wed, 14 Dec 2022 09:32:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69460#M22065</guid>
      <dc:creator>ArturoGarcia</dc:creator>
      <dc:date>2022-12-14T09:32:00Z</dc:date>
    </item>
    <item>
      <title>RE: DFD definition for multiline REGEX</title>
      <link>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69461#M22066</link>
      <description>Hi Arturo,&lt;BR /&gt;&lt;BR /&gt;Is the phrase always near the heading "DIRECCION GENERAL DE CERRETERAS", as this may also be useful without the regex requirement?&lt;BR /&gt;&lt;BR /&gt;Your regex also doesn't appear to use the correct characters e.g. "DEMARCACION" should be "DEMARCACI&lt;SPAN&gt;Ó&lt;/SPAN&gt;N". Have you tried it this way?&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Ben Lyons&lt;BR /&gt;Senior Product Specialist - Decipher&lt;BR /&gt;Blue Prism&lt;BR /&gt;UK based&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 15 Dec 2022 09:43:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69461#M22066</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2022-12-15T09:43:00Z</dc:date>
    </item>
    <item>
      <title>RE: DFD definition for multiline REGEX</title>
      <link>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69462#M22067</link>
      <description>Hello Ben,&lt;BR /&gt;&lt;BR /&gt;thank you for the reply.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Actyally yes, &lt;SPAN&gt;"DIRECCION GENERAL DE CERRETERAS" is allways near the phrase.&lt;BR /&gt;&lt;BR /&gt;We have tried "DEMARCACIÓN" and all kind of possibilities. We have also defined less restrictive Regex and it looks like decipher is not able to extract info that is divided in lines...&lt;BR /&gt;&lt;BR /&gt;For example, If we create an image that has to lines:&lt;BR /&gt;&lt;BR /&gt;"hello&lt;BR /&gt;world"&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;And we define a Regex that accepts all characters (spaces an line jumps also) and words, it only gives us the word "hello".... not both.&lt;BR /&gt;&lt;BR /&gt;Waiting your response... Thank you.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Arturo Garcia&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 15 Dec 2022 09:51:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69462#M22067</guid>
      <dc:creator>ArturoGarcia</dc:creator>
      <dc:date>2022-12-15T09:51:00Z</dc:date>
    </item>
    <item>
      <title>RE: DFD definition for multiline REGEX</title>
      <link>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69463#M22068</link>
      <description>Hi Arturo,&lt;BR /&gt;&lt;BR /&gt;Hmm, that shouldn't be a problem, I've had success with using multi-line regex.&lt;BR /&gt;&lt;BR /&gt;I assume you've seen this thread where I demo it's possible? &lt;A href="https://community.blueprism.com/discussion/bug-using-regex-in-format-expression#bma20a0e34-b004-41b6-8465-07818380d4cd" target="test_blank"&gt;https://community.blueprism.com/discussion/bug-using-regex-in-format-expression#bma20a0e34-b004-41b6-8465-07818380d4cd&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Ben Lyons&lt;BR /&gt;Senior Product Specialist - Decipher&lt;BR /&gt;Blue Prism&lt;BR /&gt;UK based&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 15 Dec 2022 09:57:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69463#M22068</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2022-12-15T09:57:00Z</dc:date>
    </item>
    <item>
      <title>RE: DFD definition for multiline REGEX</title>
      <link>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69464#M22069</link>
      <description>I think a newline is missing after ESTADO, each example shows EN at the start of a new line.&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;(DEMARCACION)[\s\n\r]*(DE)[\s\n\r]*(CARRETERAS)[\s\n\r]*(DEL)[\s\n\r]*(ESTADO EN)[\s\n\r]*(CATALUÑA|CASTILLA-LA MANCHA)&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;(DEMARCACION)[\s\n\r]*(DE)[\s\n\r]*(CARRETERAS)[\s\n\r]*(DEL)[\s\n\r]*(ESTADO)[\s\n\r]*(EN)[\s\n\r]*(CATALUÑA|CASTILLA-LA MANCHA)&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;------------------------------&lt;BR /&gt;Ben Lyons&lt;BR /&gt;Senior Product Specialist - Decipher&lt;BR /&gt;Blue Prism&lt;BR /&gt;UK based&lt;BR /&gt;------------------------------&lt;BR /&gt;</description>
      <pubDate>Thu, 15 Dec 2022 10:04:00 GMT</pubDate>
      <guid>https://community.blueprism.com/t5/Product-Forum/DFD-definition-for-multiline-REGEX/m-p/69464#M22069</guid>
      <dc:creator>Ben.Lyons1</dc:creator>
      <dc:date>2022-12-15T10:04:00Z</dc:date>
    </item>
  </channel>
</rss>

