Variational bayes for modeling score distributions |
| |
Authors: | Keshi Dai Evangelos Kanoulas Virgil Pavlu Javed A Aslam |
| |
Institution: | (1) College of Computer and Information Science, Northeastern University, 360 Huntington Ave, #202 WVH, Boston, MA 02115, USA;(2) Department of Information Studies, University of Sheffield, Regent Court, 211 Portobello Street, Sheffield, S1 4DP, UK |
| |
Abstract: | Empirical modeling of the score distributions associated with retrieved documents is an essential task for many retrieval
applications. In this work, we propose modeling the relevant documents’ scores by a mixture of Gaussians and the non-relevant
scores by a Gamma distribution. Applying Variational Bayes we automatically trade-off the goodness-of-fit with the complexity
of the model. We test our model on traditional retrieval functions and actual search engines submitted to TREC. We demonstrate
the utility of our model in inferring precision-recall curves. In all experiments our model outperforms the dominant exponential-Gaussian
model. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|