Marginal Advantage Accumulation for Memory-Driven Agent Self-Evolution — ThinkLLM