Gradient Boosting within a Single Attention Layer — ThinkLLM