Automated reproducibility assessments in the social and behavioral sciences using large language models — ThinkLLM