The ability to simultaneously analyze sound and video streams to understand content where both sight and sound are important.