The ability to accept and process multiple types of input data simultaneously, such as both images and text in the same request.