PDF to XML
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
08-02-23 06:35 AM
------------------------------
Aiman Nishat
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
08-02-23 07:53 AM
Would opening the PDF in Word, and then having Word save the document in .XML format do the trick?
------------------------------
Happy coding!
---------------
Paul
Sweden
------------------------------
Paul, Sweden
(By all means, do not mark this as the best answer!)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
15-02-23 07:31 AM
Hi @AimanNishat ,
One way of achieving this is by leveraging code stage in blue prism.
1. Makes sure below namespaces are imported in your object
Imports System
Imports System.Collections.Generic
Imports System.Text
Imports Bytescout.PDFExtractor
Imports System.Diagnostics
2. Call below code in your code stage
Dim extractor As New XMLExtractor()
extractor.RegistrationName = "demo"
extractor.RegistrationKey = "demo"
' Load sample PDF document
extractor.LoadDocumentFromFile("test.pdf")
extractor.SaveXMLToFile("output.XML")
------------------------------
Sonam Sharma
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
15-02-23 08:51 PM
Hi @Sonam Sharma ,
Can you please share the dll file of PDFExtractor as well?
------------------------------
Manpreet Kaur
Manager
Deloitte
*If you find this post helpful mark it as Best Answer
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
16-02-23 05:33 AM
PFA !
------------------------------
Sonam Sharma
Manager, Blue Prism
SS&C
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
16-02-23 05:40 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
16-02-23 07:51 AM
Great !!
Thanks @Sonam Sharma !!
Any specific guidelines to use the attached code?
------------------------------
Manpreet Kaur
Manager
Deloitte
*If you find this post helpful mark it as Best Answer
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
17-02-23 06:47 AM
Hi Sonam,
I have imported all the namespace and added attached dll and getting below error could you please help.
Internal : Could not execute code stage because exception thrown by code stage: Could not load file or assembly 'Bytescout.PDFExtractor, Version=13.3.0.4514, Culture=neutral, PublicKeyToken=f7dd1bd9d40a50eb' or one of its dependencies. The system cannot find the file specified.
------------------------------
Saumitra Kumar Sharma
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
20-02-23 05:41 AM
Did you place the dll file at location "C:\Program Files\Blue Prism Limited\Blue Prism Automate" ?
------------------------------
Sonam Sharma
Manager, Blue Prism
SS&C
------------------------------
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
20-02-23 06:53 AM
Hi @Sonam Sharma It's already resolved Thanks
------------------------------
Saumitra Kumar Sharma
------------------------------
