Audio Classification

This API allows you to distinguish audio sounds to one of the 500+ classes available

Overview
Audio Classification model is used to detect around 500+ pre-trained audio commonly occurring sounds such as door opening, car moving sound, dog barking, etc.
Audio Classification Types
Speech
Noise
Music
Hold Sound
Silence
500+ other sounds (contact [email protected] for more info)
This is a Beta API - Undergoing further development. Please reach us at [email protected]
Input Type Supported: Audio
Response Object Fields
Fields
Description
start_time
Starting time of the chunk in milliseconds
end_time
Ending time of the chunk in milliseconds
sentence
The transcribed sentence from Marsview STT
audio_type
Audio type label for the Sentence/Chunk
confidence
Confidence of the speech type label (ranges from 0 to 1). Higher the better
​

Conversation API (Bundle) - Previous

Optical Character Recognition (OCR)

Next - Conversation API (Bundle)

Search API

Last updated 3 months ago

Fields	Description
`start_time`	Starting time of the chunk in milliseconds
`end_time`	Ending time of the chunk in milliseconds
`sentence`	The transcribed sentence from Marsview STT
`audio_type`	Audio type label for the Sentence/Chunk
`confidence`	Confidence of the speech type label (ranges from 0 to 1). Higher the better