DUET: Joint Exploration of User–Item Profiles

Anonymous Authors
Paper under double-blind review

Overview


Figure: Overview of DUET. Raw user histories and item metadata are transformed into textual profiles within a shared semantic space, enabling interpretability and alignment with large language models.

DUET aligns raw user and item data by transforming them into textual profiles within a shared semantic space. Traditional recommendation systems use hidden vectors, but we advocate a shift from vectors to text: representing users and items as textual profiles that are directly compatible with LLMs and offer interpretability for downstream agentic systems.

Abstract


Traditional recommendation systems represent users and items as hidden vectors, learning to align them in a shared latent space for relevance estimation. With the advent of large language models (LLMs), we advocate a shift from vectors to text: representing users and items as textual profiles and aligning them in a shared semantic space. Textual profiles are directly compatible with LLMs and offer interpretability for downstream agentic systems.

A key challenge, however, is that the optimal profile format is unknown, and handcrafted templates often misalign with task objectives. We propose DUET, a framework for joint exploration of user–item profile generation in text. The framework operates in three stages: First, raw histories and metadata are distilled into simple cues that capture minimal but informative signals. Second, during a single sequence-to-sequence inference pass, these cues are expanded into richer prompts and then into textual profiles, allowing for the exploration of multiple formats rather than a single creation.

Finally, profiles are optimized jointly via reinforcement learning, where downstream recommendation performance provides feedback to refine and align them. Experiments on three real-world datasets demonstrate that DUET outperforms strong baselines, validating the effectiveness of joint textual profile alignment and the utility of prompt-driven exploration.

Method: DUET Framework


Figure: The DUET framework operates in three stages: (1) Cue-based initialization, (2) Self-prompt construction in a single seq-to-seq pass, and (3) Joint optimization with reinforcement learning (GRPO) using downstream recommendation feedback.

Experimental Results

Figure: Performance comparison across Yelp, Amazon Music, and Amazon Books datasets using Qwen3-8B and LLaMA3-8B. DUET achieves the best accuracy (avg. 64.52%) and F1 score (60.80%), consistently outperforming strong baselines (see Table 1 in the paper).
Figure: Ablation study showing the contribution of each DUET component. Profile alignment yields the largest performance gains (+5.4% on Yelp, +5.95% on Amazon Books, +9.05% on Amazon Music). See Table 2 for details.
Figure: Impact of interaction history length (30, 50, 70 records). Longer histories do not always improve profile quality; results highlight the trade-off between semantic richness and noise (see Table 3).

Case Study: Profile Alignment in Action


Figure: Case study of DUET profile alignment. User preferences (funk, soul, progressive rock, emphasis on technical mastery) semantically correspond to item attributes (funk-rock fusion, polished production, genre-blurring creativity), enabling improved prediction accuracy.