Opened 15 years ago
Last modified 15 years ago
#35 new enhancement
Parse XML Document Formats
|Reported by:||Joseph Connors||Owned by:|
With MS Office now defaulting to XML based documents, we should take advantage of that to read keywords out of the document for better classification. Though XML is just structured text, OOXML and OpenDoc are zipped. So mostly we just need to unzip the file. Our parser should deal with XML decently I think.
Change History (0)
Note: See TracTickets for help on using tickets.