Breaking the Filter Bubble: A Semantic Pareto-DQN Framework for Multi-Objective Recommendation
Breaking the Filter Bubble: A Semantic Pareto-DQN Framework for Multi-Objective Recommendation
打破信息茧房:用于多目标推荐的语义 Pareto-DQN 框架
Abstract: Recommender systems often induce filter bubbles and semantic homogenization by monolithically optimizing for immediate user engagement. Standard single-objective models, including traditional Deep Q-Networks, are ill-equipped to navigate the trade-offs between platform retention and critical societal values like information diversity and provider fairness.
摘要: 推荐系统往往通过单一地优化即时用户参与度,从而导致信息茧房(filter bubbles)和语义同质化。标准的单目标模型(包括传统的深度 Q 网络)难以在平台留存率与信息多样性、提供者公平性等关键社会价值之间取得平衡。
To address these limitations, we introduce a multi-objective reinforcement learning framework that formalizes recommendation as a semantic multi-objective Markov decision process. By integrating high-fidelity semantic embeddings with a Pareto-DQN agent, our architecture treats engagement, diversity, and fairness as distinct, non-aggregable reward signals, avoiding the pitfalls of static reward scalarization.
为了解决这些局限性,我们引入了一种多目标强化学习框架,将推荐过程形式化为语义多目标马尔可夫决策过程。通过将高保真语义嵌入与 Pareto-DQN 智能体相结合,我们的架构将参与度、多样性和公平性视为独立且不可聚合的奖励信号,从而避免了静态奖励标量化带来的缺陷。
Empirical evaluations on the MovieLens small dataset shows that our hypervolume based action selection disrupts the feedback loops responsible for semantic collapse. By sustaining high state-trajectory variance, the Pareto-DQN effectively maps the Pareto frontier, achieving gains in auxiliary societal objectives with only marginal impacts on engagement. This work provides a path toward intrinsically aligned, responsible recommender systems.
在 MovieLens 小型数据集上的实证评估表明,我们基于超体积(hypervolume)的动作选择机制打破了导致语义崩溃的反馈循环。通过维持高状态轨迹方差,Pareto-DQN 有效地映射了 Pareto 前沿,在仅对参与度产生微小影响的情况下,实现了辅助社会目标的提升。这项工作为构建内在对齐、负责任的推荐系统提供了一条路径。
Paper Details:
- Authors: Cláudio Lúcio Do Val Lopes, Lucca Machado da Silva, André de Oliveira Brandão
- Subject: Artificial Intelligence (cs.AI)
- Date: 23 Jun 2026
- arXiv ID: 2606.24042
论文详情:
- 作者: Cláudio Lúcio Do Val Lopes, Lucca Machado da Silva, André de Oliveira Brandão
- 学科: 人工智能 (cs.AI)
- 日期: 2026年6月23日
- arXiv ID: 2606.24042