Training models to process and align two different types of input data (like RGB and infrared images) simultaneously.