✔ PDF quality test for document upload
Completed by Paul M.
- Assigned to
-
Guillaume M.
Harsh P.
Jitesh D.
Melissa C.
- Notes
-
As part of the document upload process within ISLG (and ILG) we need integrate a PDF quality test to ensure the PDF document meets certain quality thresholds before it is converted into an HTML document. The premise would be that if the PDF quality threshold isn't met, the HTML conversion is cancelled, and the admin user is prompted to send the document for manual HTML conversion.
It is proposed that we PDF quality test tool used earlier this year for this purposed (see Message Board - TOLOGIX - PDF to HTML Conversion).
Thanks,
Morgan
In addition to the question above, would it be possible for you to provide more detail on how the PDF Quality tool works, and what exactly it is testing to determine the quality score? The reason I ask is that while
BIT/0020 Canada - Venezuela BIT (1996)
Please see questions above.
We are checking PDF's font pixel in PDF quality tool. In above sample the PDF's quality is not good but the font appear in PDF is good like we can clearly read and bold.
How long does it take on average to assess a quality score in the PDF quality tool?
Mel
Its depend on PDF File's pages. But you can consider average 20 Seconds to assess a quality score.
If the documents above are getting a 100% quality score, but are in fact insufficient quality for automated conversion, there is a problem with the PDF quality tool. The whole purpose of the tool is to allow us to identify which documents are of sufficient quality for automated conversion.
Morgan
Currently, PDF quality tool does identify whether a document is sufficient to be converted into HTML or not by our conversion tools, it does not mean if quality of 100% document will convert 100% of document quality.
Quality tool is used to check whether it is capable to convert by our algorithm or not. Yes definitely most of case if document quality is 100% than our algorithm is capable to convert into HTML but in some cases it might be quality should be not 100%, the reason behind is content of PDF document and its format ( indentation, style, bullets, numbering ) etc.
Our main target of quality tools it was, The premise would be that if the PDF quality threshold isn't met, the HTML conversion is cancelled, and the admin user is prompted to send the document for manual HTML conversion.
Also we are continuously working on conversion tool to improve quality of document.
Thanks,
Jitesh
Thanks,
Morgan
Would it be possible to a response to my questions in my previous comment. Note that I'm adding this to the agenda for Thursday meeting.
Morgan
Current conversation algorithm is capable to identify whether document is converted to html or not, if not it will update status as a "conversion fail". Further we can sent it to manual conversion to html.
Our team is working on improving conversion algorithm to accomplish all our requirements and we are sure to achieve all your needs.
Still if you planning to integrate quality tool to display score of document quality please provide user story and updated wire frame in new ISLG and ILG so we will plan accordingly. Let's we will discuss more about this on Thursday call.
Thanks,
Jitesh
Morgan