✔ Quality of HTML Conversions
Completed by Morgan M.
- Assigned to
-
Anil V.
Ryan K.
- Notes
-
There is an issue that we need to address concerning the quality and formatting of HTML documents that are created through the PDF to HTML converter. For some documents (particularly older PDF - see http://p3lg.tologix.com/Document/EditDocument?DocumentId=318&rand=1530037386217), there are a number of formatting issues that need to be resolved after the conversion is performed. We need to have a discussion on how we can improve the quality of the conversions to minimize the manual interventions before/after the conversion.
Further to Morgan's comments, I think the concern stems from the formatting pulled in from the document. How much control do we have over the formatting during ingest? Here are some examples of odd spacing, line breaks, etc.
Has someone been able to take a look at this?
Thanks!
Ryan
We haven't checked this yet. We will let you know the feedback soon.
To resolve above issue, it may require manual edits in PDF/HTML files to generate proper html. Please suggest.
Thanks,
Morgan
Thanks,
Morgan