Marsview Speech Analytics is a cloud-hosted or containerized API service that helps you accurately transcribe a conversation and discover insights. It is packed with models for automatic speech recognition (ASR), Tone Analyzer, Natural Language Classifiers to uncover topics, keywords, entities and sentiments.
Get API Key View API DocsSpeech analytics software helps mine and analyze audio data, detecting things like emotion, tone and stress in a customer's voice; the reason for the call; satisfaction; the products mentioned; and more. Speech analytics tools can also identify if a customer is getting upset or frustrated.
Detect
things like emotion,
tone and stress
in
a customer's voice; the reason for the call;
satisfaction; the
products mentioned
Adapt to
customer’s
sentiments in real
time
or improve after the fact
Identify
customers at risk of
churning
and
retain them
Gather
insights to improve
NPS, CSAT and
CES
scores
Use call
transcripts for
compliance and
documentation
Listen to
your customers - it
pays!
Marsview conversation self-service API platform
offers a
comprehensive suite of proprietary APIs and
developer tools for
automatic speech recognition, speaker
separation, multi-modal
emotion and sentiment recognition, intent
recognition,
time-sequenced visual recognition, and more.
Designed for the demanding Call Center
environments (CCAI) that
handle millions of outbound and inbound sales
and support calls.
Marsview APIs provide end-to-end workflows from
call listening,
recording, insights generation, and Voice of
Customer Insights.
Conversation APIs are also used in one-on-one to
many-to-many
conversations and meetings to automatically
generate rich contextual
feedback, key topics, moments, actions, Q&A;, and
summaries.
We support all audio and video file types without any transcoding.
We support live streaming for select APIs and models. Please review the documentation for details.
API output data is delivered in JavaScript Object Notation (JSON) format.
Easily export your transcription as SRT or VTT format to be directly plugged into video players for subtitles and captions.
Convert speech into readable text from a live stream or from audio and/or video recordings in minutes.
Automatically add punctuation and casing in the transcription text.
Automatically recognize and separate speakers in a group conversation. Attribute speaker names within a group of enrolled speakers.
Identify the type of speech based on context and tone such as a statement, question, command and so on.
Detect and list action items and tasks in the transcription text.
Detect questions and related responses in the transcription text.
Automatically determine the topics, entities, concepts discussed in the transcription text.
Extract keywords and key phrases in the transcription text.
Generate a concise paraphrased summary from the transcription text.
Reduce the transcription text into a short summary preserving the keywords and phrases.
Use the acoustic voice of the speaker to determine the tone such as neutral, calm, happy, sad, angry, fearful, disgust, surprised.
Detect the emotion such as anger, anticipation, disgust, fear, joy, love, optimism, pessimism, sadness, surprise, trust from the language of the speaker.
Determine the sensitivity of the topic in the transcription text to classify the sentiment as positive, negative, neutral.
Detect the type of on-screen activity as interaction, slide share, motion graphics etc.
Capture key frames and slides to chapterize a visual presentation.
Automatically detect text in a typed, handwritten, or a display form into machine encoded text in a visual presentation.
Automatically detect people and objects in a visual presentation.
Add new words to the base vocabulary or train your own language model to generate more accurate transcriptions for domain-specific words and phrases like product names, technical terminology, or names of individuals.
99.9% uptime with always-on support via email, chat and web call.
Marsview encrypts all data via 256-bit SSL encryption and complies with GDPR and CCPA standards.
Optimized for public cloud and private on-premise deployments.