Towards Direct Latent-Space Synthesis for Parallel Branches in LLM-Agent Workflows

Shikun Liu, Mufei Li, Dongqi Fu, Haoyu Wang, Yinglong Xia et al.|June 12, 2026arXiv

Key Takeaway

Direct cache-based synthesis enables LLM agents to efficiently combine parallel branches without redundant computation, making multi-agent workflows faster and more aligned with how modern systems actually work.

Summary

This paper introduces Parallel-Synthesis, a framework that lets LLM agents directly process cached outputs from multiple parallel worker branches instead of concatenating text. By working with KV caches directly, it reduces computation time by 2.5-11x while maintaining or improving performance across math, code, and reasoning tasks.

agents efficiency architecture

Key Terms

kv-cache multi-agent-framework cache-mapper synthesizer-adapter prefill-computation