Tracking eye movements to uncover the nature of visual-linguistic interaction in static and dynamic scenes