首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Crowdsourcing interactions: using crowdsourcing for evaluating interactive information retrieval systems
Authors:Guido Zuccon  Teerapong Leelanupab  Stewart Whiting  Emine Yilmaz  Joemon M Jose  Leif Azzopardi
Institution:1. Australian e-Health Research Centre, CSIRO, Brisbane, QLD, Australia
2. King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand
3. School of Computing Science, University of Glasgow, Glasgow, UK
4. Microsoft Research, Cambridge, UK
Abstract:In the field of information retrieval (IR), researchers and practitioners are often faced with a demand for valid approaches to evaluate the performance of retrieval systems. The Cranfield experiment paradigm has been dominant for the in-vitro evaluation of IR systems. Alternative to this paradigm, laboratory-based user studies have been widely used to evaluate interactive information retrieval (IIR) systems, and at the same time investigate users’ information searching behaviours. Major drawbacks of laboratory-based user studies for evaluating IIR systems include the high monetary and temporal costs involved in setting up and running those experiments, the lack of heterogeneity amongst the user population and the limited scale of the experiments, which usually involve a relatively restricted set of users. In this paper, we propose an alternative experimental methodology to laboratory-based user studies. Our novel experimental methodology uses a crowdsourcing platform as a means of engaging study participants. Through crowdsourcing, our experimental methodology can capture user interactions and searching behaviours at a lower cost, with more data, and within a shorter period than traditional laboratory-based user studies, and therefore can be used to assess the performances of IIR systems. In this article, we show the characteristic differences of our approach with respect to traditional IIR experimental and evaluation procedures. We also perform a use case study comparing crowdsourcing-based evaluation with laboratory-based evaluation of IIR systems, which can serve as a tutorial for setting up crowdsourcing-based IIR evaluations.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号