Complex Table without Headers and dynamic width

elopez · ‎13-07-22

Hi guys,

I have a problem getting the data from a document because:
- Vertically it has dynamic titles (dates) but horizontally if they are always the same titles.
- The number of dynamic columns. Max 13 columns, Min 8 columns
- The table is centered in the PDF, this causes the initial position of the table to vary between the documents
- Between the documents there are small variations of distances between columns

My first try:
- DFD with a table of 12 text type columns, 1 column for the horizontal headers including some extra data such as the unit that some titles have and the rest for the data
- Decipher initially picked up the information well, then it started skipping lines, detecting headers of the date along with another column that is next to it.

Second try
- DFD with a table of 13 text type columns, 1 column for the horizontal headers, another column for the extra data and the rest for the data
- Decipher initially picked up the information well, then it started skipping lines, detecting headers of the date along with another column that is next to it.

Third try
- DFD with 3 boards. One for the dynamic date headers with text columns, another for the temperatures (1 text column and the rest numeric), another for the demand (1 text column and the rest numeric)
- ExactsRows and ButtomStop were defined
- Decipher works if the document is trained, but if I use another document that has some variation in width or columns, then you have to complete everything again and using the previous flags, the tables are not auto-completed and training the documents takes time.

How do you recommend I train these documents?

Ben.Lyons1 · ‎18-07-22

Hi Elbio,

I don't think Decipher currently supports this use case.

It is designed to have static columns and dynamic rows, but in this instance that concept is reversed. Decipher can't be trained to recognise a table consistently with this format.

It's something I've seen once before and I imagine there are more use cases, so I would recommend submitting your idea via the Innovate button at the top of the page. The product manager reviews all suggestions throughout our development cycle.

Thanks

Ben

Ben Lyons
Principal Product Specialist - Decipher
SS&C Blue Prism
UK based

SS&C Blue Prism Community

Complex Table without Headers and dynamic width