Hi Virender,
There are usually 2 types of PDF documents. PDF documents and PDF Images.
For Documents - it is
usually created using Microsoft Word or Adobe Acrobat, and saved in the read only.pdf format. You can test if your document is truly a PDF document by attempting to copy text from the document using the Windows clipboard.The Image type isoften scanned documents saves as .pdf or .tiff format images. You can't copy text from these images. You can use the ' Reading Text with OCR' technique to extract data. OCR will only work if the image is of a high enough quality, 300dpi is recommended as a minimum.
Once you have captured the PDF document text using one of the techniques outlined above you will need to implement some logic to extract the data you want from the within the text.
Hope this helps.
Thanks,
------------------------------
In Joe Khor
Sr. Product Consultant
Blue Prism
------------------------------
In Joe Khor
Sr. Product Consultant
Blue Prism