Is anyone using Decipher in non-alphabetic languages?

sumire · ‎15-08-24

Hello,

If anyone uses Decipher in non-alphabetic language, please let me know what improvements you have made to OCR.
I am using Decipher for Japanese, but I am at a loss to learn that the accuracy of character recognition is poor and that Decipher cannot be trained for OCR.

------------------------------
Mitsuko
Asia/Tokyo
------------------------------

steven.boggs · ‎15-08-24

Which version of Decipher IDP are you currently using?

The release of v2.3 increased the number of supported languages and currencies to include Japanese, Chinese, and Korean. With this release, Decipher IDP now supports more than 30 languages -- if you're not doing so already, have you tried using the latest release to see if this improves the character recognition?

sumire · ‎16-08-24

Hello @steven.boggs ,

I use v2.3, it can read Japanese.
However, as you know, there are several types of characters in Japanese, and to be specific, the accuracy of katakana is particularly poor. (ex. "ク", "グ" and "ケ")
In order to use Decipher for business purposes, this problem must be cleared.

------------------------------
Mitsuko
Asia/Tokyo
------------------------------

Sangjun · ‎24-09-24

Hello, sumire sensei.

I am using Decipher ver 2.3 in Korean language. I feel a level of recognition of Korean Letters is good, but the recognition for numbers with Korean Letters is not good. Especially, when dealing with the scanned images.

I wonder there are some ways to enhance the recognition when handling numbers.

sumire · ‎24-09-24

Hello @Sangjun ,

I'm lucky to meet you who has a similar problem.
It seems like it's hard to recognize the Korean numbers, too.
(ex. 7 "칠" and 8 "팔")

If we list the characters that are hard to recognize in each language, will it help you to improve Decipher? @steven.boggs

------------------------------
Mitsuko
Asia/Tokyo
------------------------------

Sangjun · ‎24-09-24

Dear @sumire

My mention means when below

"안녕하세요3780912" from scanned images or papers

IDP successfully capture "안녕하세요", but cannot perfectly capture "3780912" ,
it captures like "3ㄱ80912" .

So, I am testing many conditions of the scanned images when it comes to the resolution, image quality, colored or mono(blank and white image)

sumire · ‎24-09-24

Hello @Sangjun ,

Even in Japanese capturing, there are cases similar to the case you exemplified in Korean letters.
エーエフハチビーニ captures 上ー工フハナヒ""ー二
Decipher often mistakenly captures katakana into symbols or kanji.

------------------------------
Mitsuko
Asia/Tokyo
------------------------------

SS&C Blue Prism Community

Is anyone using Decipher in non-alphabetic languages?