Validation of a high precision semantic search tool using a curated dataset containing related and unrelated reports of clinically relevant search terms (RPS 1005b)

In: Academic ,<a href="https://www.caring-research.com/category/publications/artificial-intelligence/">Artificial Intelligence , Conference Oral ,<a href="https://www.caring-research.com/category/publications/new-tech-validation/">New Tech Validation , News , Publications
Comments Of: <a href="https://www.caring-research.com/validation-of-a-high-precision-semantic-search-tool-using-a-curated-dataset-containing-related-and-unrelated-reports-of-clinically-relevant-search-terms-rps-1005b/">Jun 1, 2021
By: Caring-Research

Purpose:

To validate a sematic search tool by testing the search results for complex terms.

Methods and materials:

The tool consists of two pipelines: an offline indexing pipeline and a querying pipeline. The raw text from both reports and queries were first passed through a set of pre-processing steps; sentence tokenisation, spelling correction, negation detection, and word sense disambiguation. It was transformed into a concept plane followed by indexing or querying. During querying, additional concepts were added using a query expansion technique to include nearby related concepts. The validation was done on a set of 30 search queries, carefully curated by two radiologists. The reports that are related to the search queries were randomly selected with the help of keyword search and the text was re-read to determine its suitability to the queries. These reports formed the "related" group. Similarly, the reports that were not exactly satisfying the context of the search queries were categorised as the "not related" group. A set of 5 search queries and 250 reports were used for tuning the model initially. A total of 500 reports of the 10 search queries formed the corpus of the test set. The search results for each test query were evaluated and appropriate statistical analysis was performed.

Results:

The average precision and recall rates on 10 unseen queries on a small corpus for respective queries containing related and unrelated reports were 0.54 and 0.42. On a larger corpus containing 60 K reports, the average precision for these 15 queries was 0.6.

Conclusion:

We describe a method to clinically validate a sematic search tool with high precision.