UltraQuant: 4-bit KV Caching for Context-Heavy Agents — ThinkLLM