Hi All,
So far as I'm aware there's no actual limit on number of documents/files per batch, but we do recommend a lower count for performance and manual verification reasons. Performance will of course vary depending on your infrastructure, so this can be locally tested to see what works best for your environment. Though I would still consider it from the view of the person who may be manually verifying batches and a batch of 50+ could be more onerous than a batch of 10 or 15 documents (at least during training).
How many vendors should you train?
Well that's a tricky question, but one we can help you answer and the best practice is in the process of being updated for release with v2.2. In the meantime, it's not so much about how many vendors, more how different the layouts are. If every vendor uses the same headers for fields, the tables are all in the exact same layout and the image quality is high, then you won't need to train many in Development.
Decipher is designed to learn as it goes in Production, but you'll want to be sure the DFD configuration is as near to perfect as possible because changes in Production are not advisable. Additionally, you would potentially have difficulty manually verifying thousands of documents in Production, so you'd want your Training Data to be able to handle enough documents automatically that it doesn't create a lot of extra work.
Finally you'll want to be confident that all the auto-captured data is correct, so you'll need to be sure that during UAT no false positive values are sent back to the Blue Prism process. This will be an opportunity to utilise the many validation features available in Decipher, such as Format Expressions, Formulas and Validation Lists.
Does that help?
Thanks
Ben
Ben Lyons
Principal Product Specialist - Decipher
SS&C Blue Prism
UK based