Building trustworthy personal assistants requires more than good GUI navigation—agents must actively learn user preferences through dialogue and make smart decisions about when to intervene, which current models struggle with even at the frontier.
KnowU-Bench is a new benchmark for evaluating mobile agents that must learn user preferences through interaction and decide when to proactively help.