Time is a learnable visual concept—models can be trained to perceive temporal changes in videos and use that understanding to generate or transform videos with precise control over playback speed and temporal detail.
This paper teaches AI models to understand and control the passage of time in videos. The researchers develop self-supervised models that detect when videos are sped up or slowed down, then use this capability to build a large dataset of slow-motion videos.