Reinforcement learning with adaptive rewards can significantly improve chart understanding in vision-language models, enabling smaller models to outperform larger ones while reducing inference time by 3x.
Chart-RL uses reinforcement learning to improve how vision-language models understand and answer questions about charts.