TOLOGIX - ISLG App Rebuild

IC/0274/01 paragraph numbers not displayed under FTS

Assigned to
Martin Laporte, CTO at Tologix Martin L.

Comments & Events

Piyush Kanpariya, DevIT
Hi Radomir Mladenovic, Contegra Radomir ,

Please looking the above issue.  Please find below the document HTML file.

Gambrinus Corporation v. Bolivarian Republic of Venezuela, ICSID Case No. ARB/11/31, Award, 15 June 2015

File:


Request Body:
{
   "searchRequest":"challenge w/10 nationality",
   "FilterStatement":{
      "type":"boolean",
      "Operator":"and",
      "clauses":[
         {
            "type":"boolean",
            "Operator":"or",
            "clauses":[
               {
                  "type":"boolean",
                  "Operator":"and",
                  "clauses":[
                     {
                        "type":"match",
                        "field":"DocumentContentTypeId",
                        "values":[
                           "13"
                        ]
                     }
                  ]
               },
               {
                  "type":"boolean",
                  "Operator":"and",
                  "clauses":[
                     {
                        "type":"match",
                        "field":"DocumentContentTypeId",
                        "values":[
                           "37"
                        ]
                     },
                     {
                        "type":"match",
                        "field":"Field_DocumentTypeId",
                        "values":[
                           "1080",
                           "1078",
                           "1074",
                           "1071",
                           "1070",
                           "1068",
                           "1066"
                        ]
                     }
                  ]
               },
               {
                  "type":"boolean",
                  "Operator":"and",
                  "clauses":[
                     {
                        "type":"match",
                        "field":"DocumentContentTypeId",
                        "values":[
                           "12"
                        ]
                     }
                  ]
               }
            ]
         }
      ]
   },
   "SearchType":"Boolean",
   "Stemming":true,
   "Synonyms":false,
   "Fuzzy":true,
   "Fuzziness":"1",
   "SortField":"hits",
   "SortOrder":"desc",
   "PageNum":0,
   "PageSize":"20"
}

cc: Harsh Parikh, Tech Lead at DevIT Harsh Martin Laporte, CTO at Tologix Martin
Radomir Mladenovic, Contegra
Hi Piyush Kanpariya, DevIT Piyush , I checked this document and paragraphs. I don't see a case where 
"challenge" and "nationality" appear in the same paragraph. That's why you don't get paragraphs back. 
Harsh Parikh, Tech Lead at DevIT
Hi Radomir Mladenovic, Contegra Radomir ,

As per video by Paul Moon Paul , The parapgh is not listing for particular one document. if the keyword is not matching then why document is coming through API without Paragraph number.

Please see above Paul Moon Paul  video the word is highlighted in html document's paragraph. but those paragraph of same document is not fetched in FTS.

Cc : Martin Laporte, CTO at Tologix Martin  
Radomir Mladenovic, Contegra
Harsh Parikh, Tech Lead at DevIT Harsh   because document it self matches, probably having the keywords in paragraphs close to each other, but not in the same paragraph.
I saw the video but, what's asked for, to return paragraphs 119 and 120, is not doable because paragraphs are indexed separately. If you prefer, we could return paragraphs containing ANY keyword used in the proximity query. That may result in many paragraphs returned where one keyword appears but totally unrelated to the other one - so you get a lot of noise.
Paul Moon
Hi Martin Laporte, CTO at Tologix Martin :

We need to discuss how we can address this type of full text search result, as it will be problematic for other full text search. What Radomir Mladenovic, Contegra Radomir is suggesting above does not sound ideal to me, as it does not sound very accurate. Let me know when you can discuss this.

Thanks,

Paul
Paul Moon
Thanks, Martin Laporte, CTO at Tologix Martin . Is there an interim solution in the meantime (e.g., display pa 119 only)?
Martin Laporte, CTO at Tologix
I'm not sure if we can find a reliable interim solution.

Harsh Parikh, Tech Lead at DevIT Harsh and Radomir Mladenovic, Contegra Radomir : could the document-level index return the list of paragraph IDs where the matches were found?
Radomir Mladenovic, Contegra
Martin Laporte, CTO at Tologix Martin   the document-level info does not have information about paragraphs so it cannot return that info.
Paul Moon
Then, is "Click on Full Case & Analysis" an option we can display instead of nothing?
Martin Laporte, CTO at Tologix
Hi Paul Moon Paul ,

I think your suggestion might be our best approach.

Harsh Parikh, Tech Lead at DevIT Harsh : could you implement Paul's suggestion above, where if there are no paragraphs listed, we put a generic link "Click on Full Case & Analysis"?

Thanks,
-Martin
Harsh Parikh, Tech Lead at DevIT
Nafiseh Arghandehpour, Legal Content Manager at ISLG Nafiseh , Please ignore my above comment for this thread. I put in wrong thread.
Paul Moon
Hi Martin Laporte, CTO at Tologix Martin and Harsh Parikh, Tech Lead at DevIT Harsh :

I see DevOps 1615 is a research item, but do we need a separate item to incorporate an interim solution below?

Thanks,

Paul
Paul Moon
This looks good. Thanks, Piyush Kanpariya, DevIT Piyush .

Paul
Paul Moon
Paul Moon completed this to-do.