RevengeBench: Reverse Engineering Code-Space Policies from Behavioral Experiments — ThinkLLM