Human interaction is based on a language, on a context, on a world knowledge that we share. As a Voice AI company, we know that emotion is the key factor. Emotional expression gets us moving, creates movement and a collective response. It is a key factor in society. It is the basis for all the decisions we make. In creating a virtual reality, new dimensions and augmented experiences, this key factor cannot be missing.
Data collection is an important issue in machine learning research. If we use YouTube as a resource do we really know what we are getting? To that end do we really know what’s in our audio data?