Meituan Merchant Business Diagnosis via Policy-Guided Dual-Process User Simulation

Ziyang Chen, Renbing Chen, Daowei Li, Jinzhi Liao, Jiashen Sun et al.|April 16, 2026arXiv

Key Takeaway

Combining reasoning-based and learning-based simulation through a shared policy layer reduces errors by ~45%, showing that hybrid approaches work better than either method alone for predicting real-world user behavior.

Summary

This paper presents a system for simulating how groups of users behave on a food delivery platform (Meituan) to test merchant strategies without real experiments. It combines two approaches—one that reasons through decisions logically and another that learns statistical patterns—using shared decision policies as a bridge between them.

agents reasoning applications

Key Terms

user-simulator counterfactual-evaluation dual-process policy-mining group-level-simulation