Self-Refining Agentic Reinforcement Learning for Vision-Conditioned UAV Navigation — ThinkLLM