The amount of time it takes for a model to process input and generate output after it has been trained.