XML parsing error during Search XML generation in WMS

Project:RUcore Workflow Management System (WMS)
Version:6.1.1
Component:File Upload Module
Category:bug report
Priority:normal
Assigned:yuyang
Status:closed
Description

Some PDF documents produce XML parsing error. This was tested with Ron's article on the development server and staging server, and the Search XML was created successfully and no XML parsing error was reported by WMS.

A fix was applied directly on the production server. After adding explicit argument for UTF-8 to pdftotext and htmlspecialchars(), the problem in production for OCR xml seems to have gone away.

This change has been added to the code and is being delivered in R6.1.1.

Comments

#1

Version:6.1» 6.1.1

Back to top