The theory is try to maximize the similarity between query and relevant documents, while minimizing the similarity between query and irrelevant documents.
How to generate a new query: 1. add new terms. 2. reweight query terms.
q = \arg max_q [sim(q,C_r)-sim(q,C_nr)]
Rocchio algorithm is an example of incorporating feedback to the modified query. The effect is to move query towards the centroid of relevant documents, and move away from the centroid irrelevant documents.
Users are often reluctant to provide feedback to prolong the search interaction. Pseudo-feedback
0 comments:
Post a Comment