A heatmap is a graphical representation of data using colors that represent different values. It's often used to demonstrate user behavior on a particular web page.
Abstract: Multi-modal large language models have demonstrated impressive performances on most vision-language tasks. However, the model generally lacks the understanding capabilities for specific ...