Audio Classification model is used to detect around 500+ pre-trained audio commonly occurring sounds such as door opening, car moving sound, dog barking, etc.
Speech
Noise
Music
Hold Sound
Silence
500+ other sounds (contact [email protected] for more info)
This is a Beta API - Undergoing further development. Please reach us at [email protected]
Input Type Supported: Audio
Fields | Description |
| Starting time of the chunk in milliseconds |
| Ending time of the chunk in milliseconds |
| The transcribed sentence from Marsview STT |
| Audio type label for the Sentence/Chunk |
| Confidence of the speech type label (ranges from 0 to 1). Higher the better |
​