April 2026Qihang Yu

ThinkRec: Thinking-based Recommendation via LLM

LLMRecommendation SystemReasoning ModelWWW 2026

We are excited to share ThinkRec, our work on thinking-based recommendation with large language models, accepted to The Web Conference (WWW) 2026.

Modern recommender systems excel at predicting what users may prefer next, but they often struggle to answer a fundamental question:

Why is this recommendation made?

Despite the recent success of LLM-based recommendation, most existing approaches still rely heavily on implicit pattern matching. They learn correlations from user interactions but rarely perform explicit reasoning about user preferences, limiting both recommendation quality and user trust.

ThinkRec takes a different perspective: instead of treating recommendation as a pure matching problem, we frame it as a reasoning problem.

From matching to thinking

Current LLM recommenders largely resemble System 1 thinking—fast, intuitive, and driven by surface similarities. However, users with similar behaviors may have very different underlying motivations and preferences.

ThinkRec enables LLMs to think before they recommend through two key components:

  • Thinking Activation — constructing reasoning traces that connect user history, preference analysis, and recommendation decisions, encouraging explicit reasoning during training.
  • Instance-wise Expert Fusion — dynamically combining personalized LoRA experts based on user characteristics, balancing global knowledge with fine-grained personalization.

Together, these components allow ThinkRec to move beyond simple behavior matching and generate recommendations that are both more accurate and more interpretable.

Figure 1. Overview of ThinkRec, including thinking activation and instance-wise expert fusion.

What do the experiments show?

MethodsML1M AUCML1M UAUCML1M NDCG@5ML1M MAP@5Yelp AUCYelp UAUCYelp NDCG@5Yelp MAP@5Book AUCBook UAUCBook NDCG@5Book MAP@5
MF0.64010.60790.72860.45200.58380.53890.81200.25520.65920.55270.68050.2887
LightGCN0.61400.62300.73330.46000.53600.51790.80760.25200.56220.49850.64060.2598
SASRec0.69560.66870.76630.47470.61840.60960.85640.27850.54110.51970.65500.2701
gSASRec0.70430.66900.76500.46930.62000.60510.85860.27920.61870.55460.68580.2964
Prompt4NR0.69360.64330.76650.46520.62720.60340.83480.27050.67640.56990.70230.3048
TALLRec0.68720.65530.76830.47060.53340.52060.79880.25380.66320.55680.70230.3049
CoLLM0.71410.66720.75850.46470.63730.59610.84200.27340.78300.56720.69170.2968
Ours0.77640.67750.77470.47740.69550.60650.85850.28260.83020.57050.68580.2977

We evaluate ThinkRec on three widely used recommendation benchmarks: MovieLens-1M, Yelp, and BookCrossing, comparing against strong traditional and LLM-based baselines.

ThinkRec consistently achieves the best overall performance across ranking metrics such as AUC, UAUC, NDCG@5, and MAP@5. In particular, it improves upon the previous state-of-the-art CoLLM by:

  • +8.72% AUC on MovieLens-1M
  • +9.13% AUC on Yelp

Beyond recommendation accuracy, ThinkRec also produces higher-quality explanations. Compared with existing LLM-based methods, it achieves average relative improvements of:

  • +56.54% on METEOR
  • +23.35% on BLEURT

These results suggest that explicit reasoning not only improves recommendation performance but also enhances transparency and user trust.

MethodsML1M METEORML1M BLEURTYelp METEORYelp BLEURTBook METEORBook BLEURT
Prompt4NR0.00100.20130.02050.16750.00030.1957
TALLRec0.02750.26070.03790.24200.03010.1931
CoLLM0.00030.16260.00010.17850.00970.1636
Ours0.03330.31040.06160.26830.05460.2828

Further analysis reveals that reasoning and personalization are highly complementary. Removing either the thinking component or the expert fusion mechanism leads to noticeable performance degradation, highlighting the importance of combining explicit reasoning with personalized modeling.

Why does it matter?

We believe recommendation systems are evolving from:

predicting what users may click

to:

understanding why users make choices.

ThinkRec represents a step toward this transition by integrating reasoning directly into the recommendation process. More broadly, the idea of combining explicit reasoning with dynamic expert selection may extend beyond recommendation to areas such as expert routing, personalized AI assistants, and multi-agent systems.

The complete implementation, including data preprocessing, reasoning construction, expert fusion, training scripts, and evaluation code, is available on GitHub.

If you are interested in LLM-based recommendation, personalized reasoning, or expert routing, we would love to hear your feedback.

Further reading