Foundations and Trends in Information Retrieval ( IF 8.3 ) Pub Date : 2024-6-12 , DOI: 10.1561/1500000098 Krisztian Balog , ChengXiang Zhai
Information access systems, such as search engines, recommender systems, and conversational assistants, have become integral to our daily lives as they help us satisfy our information needs. However, evaluating the effectiveness of these systems presents a long-standing and complex scientific challenge. This challenge is rooted in the difficulty of assessing a system’s overall effectiveness in assisting users to complete tasks through interactive support, and further exacerbated by the substantial variation in user behaviour and preferences. To address this challenge, user simulation emerges as a promising solution.
This monograph focuses on providing a thorough understanding of user simulation techniques designed specifically for evaluation purposes. We begin with a background of information access system evaluation and explore the diverse applications of user simulation. Subsequently, we systematically review the major research progress in user simulation, covering both general frameworks for designing user simulators, utilizing user simulation for evaluation, and specific models and algorithms for simulating user interactions with search engines, recommender systems, and conversational assistants. Realizing that user simulation is an interdisciplinary research topic, whenever possible, we attempt to establish connections with related fields, including machine learning, dialogue systems, user modeling, and economics. We end the monograph with a broad discussion of important future research directions, many of which extend beyond the evaluation of information access systems and are expected to have broader impact on how to evaluate interactive intelligent systems in general.
中文翻译:
用于评估信息访问系统的用户模拟
信息访问系统,例如搜索引擎、推荐系统和会话助理,已经成为我们日常生活中不可或缺的一部分,因为它们帮助我们满足信息需求。然而,评估这些系统的有效性提出了长期且复杂的科学挑战。这一挑战的根源在于评估系统通过交互式支持协助用户完成任务的整体有效性的困难,并且由于用户行为和偏好的巨大变化而进一步加剧。为了应对这一挑战,用户模拟成为一种有前景的解决方案。
本专着的重点是提供对专为评估目的而设计的用户模拟技术的透彻理解。我们从信息访问系统评估的背景开始,探索用户模拟的多样化应用。随后,我们系统地回顾了用户模拟的主要研究进展,涵盖设计用户模拟器、利用用户模拟进行评估的通用框架,以及模拟用户与搜索引擎、推荐系统和会话助手交互的具体模型和算法。认识到用户模拟是一个跨学科的研究课题,只要有可能,我们就会尝试与相关领域建立联系,包括机器学习、对话系统、用户建模和经济学。我们以对未来重要研究方向的广泛讨论来结束本专着,其中许多方向超出了信息访问系统的评估范围,预计将对如何评估一般交互式智能系统产生更广泛的影响。