From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression — ThinkLLM