Scalable Training of Spatially Grounded 2D Vision-Language Models for Radiology — ThinkLLM