SpecRLBench: A Benchmark for Generalization in Specification-Guided Reinforcement Learning — ThinkLLM