30-10-23 07:00 AM
I often face this issue where Decipher extract table lines as "|".
I wonder why it is not able to recognize the table structure lines and instead extracting values with "|" pipe characters.
And this is happening for all types of invoices. Sometimes it takes pipe, dots, semicolumns etc for the table lines
Is Decipher able to understand the table structures? Here, it looks like going only with the sample headers.
30-10-23 10:53 AM
Hi Tejaskumar,
This would depend on which version of Decipher as there are table extraction improvements in every release. Also whether the region was manually assigned or trained with that part of the table included in the region.
Thanks
30-10-23 11:15 AM
No this is the raw output without training and the table lines are clear and consistent
30-10-23 11:21 AM
Ok, so what happens after training? And what version are you using?
Thanks
30-10-23 11:27 AM
If I fix for 1 line item then it still keeps doing for other lines and other line fields.
There is no vertical column division per say in Decipher line item extraction. Sometimes it combines 2 line fields in 1.
Version:
30-10-23 02:55 PM
I can't think of any good reason for it, might be worth trying some format expressions to filter it. Or restarting your training and using the best practice guide shared earlier.
Failing that, please raise a support ticket and we can look into it.
Thanks
31-10-23 04:44 PM
Using a format expression will just clean up the data after extraction is performed, right?
Will it make Decipher understand the table lines?
02-11-23 07:59 AM
It doesn't exactly work like that. During the Capture stage Decipher uses Format Expressions to aid the extraction from the OCR data and may help identify the correct characters where there's low confidence.
Thanks