Internal memory structures in transformers that store computed representations to speed up inference and enable agent communication.