Experimental stimuli. a,b) The videos comprised a diverse set of face stimuli (still, speaking, singing) with the natural voice sound included, from two different models. c) Areas of Interests (AOIs) used for analysis. d) An aggregated gaze heatmap of data from all infants from all stimuli superimposed on one stimulus.