AJAX Forums

Could We Convert Any PDF File Into An XML One?

This is a discussion on Could We Convert Any PDF File Into An XML One? within the XML and XSLT forums, part of the Beginners AJAX category; If the master pdf file carried a text other than English?Will the resulting text of the xml file be of any use, if one required carrying out some analysis ...


Go Back   AJAX Forums > Beginners AJAX > XML and XSLT

Register FAQ Members List Calendar Search Today's Posts Mark Forums Read
Old 06-21-2007, 11:54 AM   #1 (permalink)
Junior Member
 
Join Date: Jun 2007
Posts: 1
Rep Power: 0 rsintheatre is on a distinguished road
Could We Convert Any PDF File Into An XML One?

If the master pdf file carried a text other than English?Will the resulting text of the xml file be of any use, if one required carrying out some analysis based on specific words of that non-English language.(One hears fonts are not easily extractable in the recent pdf versions)
__________________
rsintheatre is offline   Reply With Quote
Old 06-21-2007, 10:10 PM   #2 (permalink)
Junior Member
 
Join Date: Jun 2007
Posts: 1
Rep Power: 0 Ben S is on a distinguished road
XML is a meta-format, so there is no one 'correct' way to convert a PDF into XML. You'd first have to define an XML document schema, and then decide how the PDF was to be converted. So although technically it is possible, practically there is probably more work to be done. If all you need is the text, there are plenty of text extraction tools available for PDFs if you search the web. Then you can put that data into any XML format you want.
__________________
Ben S is offline   Reply With Quote
Reply

Bookmarks


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)

 
Thread Tools
Display Modes


Similar Threads

Thread Thread Starter Forum Replies Last Post
How can I convert excel file to XML file formate??? shiva XML and XSLT 1 11-23-2007 05:02 PM
How do i convert a video file which is in html format to wmv format? crsporty XHTML and CSS 1 06-25-2007 01:28 PM
how to find file path and infor mation of file using java program in local system.? rajeswa r JavaScript 1 05-21-2007 01:21 AM
how do you convert a word file to a .xml or .txt file? boobie XML and XSLT 1 05-12-2007 10:37 AM
how can i convert a .doc file to .xml file using Java? Kunal Shah XML and XSLT 0 04-20-2007 07:25 PM


All times are GMT -4. The time now is 12:56 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.0 RC5
Copyright ©2006 - 2008, AJAXwith.com