This framework solves two critical problems in motion-controlled video generation: disentangling camera from object motion, and modeling causal interactions between objects so actions produce realistic consequences rather than just pixel displacement.
MoRight enables users to generate videos where objects move realistically and interact with each other, while freely choosing the camera angle. It separates object motion control from camera control and learns how objects causally affect each other—so when you push one object, others react naturally rather than just shifting pixels around.