XML parsing error during Search XML generation in WMS
Posted February 24th, 2012 by ananthan
Project: | RUcore Workflow Management System (WMS) |
Version: | 6.1.1 |
Component: | File Upload Module |
Category: | bug report |
Priority: | normal |
Assigned: | yuyang |
Status: | closed |
Jump to:
Description
Some PDF documents produce XML parsing error. This was tested with Ron's article on the development server and staging server, and the Search XML was created successfully and no XML parsing error was reported by WMS.
A fix was applied directly on the production server. After adding explicit argument for UTF-8 to pdftotext and htmlspecialchars(), the problem in production for OCR xml seems to have gone away.
This change has been added to the code and is being delivered in R6.1.1.
Comments
#1