Hi Krishna,
1. NLP is not intended for structured or semi-structured documents, and while I appreciate this may seem "unstructured" it would fall in one of the first 2 categories. This is because the information is laid out in a consistent structure and not unstructured like a contract or agreement.
2. The GPU requirement is to speed up the processing as the NLP model uses a neural network and a regular CPU would take far too long to process the data.
3. Without anchors (e.g. sample headers) near the respective data, it will be very difficult to train consistently. Your best bet is to try using Format Expressions (Regex) for each of the fields. Sometimes you may need to gather more data than needed and format it later in Blue Prism. e.g. get the number 10 and KPL, then remove KPL later.
Thanks
------------------------------
Ben Lyons
Product Consultant
Blue Prism
UK
------------------------------
Ben Lyons
Principal Product Specialist - Decipher
SS&C Blue Prism
UK based