MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems? — ThinkLLM