The Voice Isolation Telephony Agentic NC (Documentation Index
Fetch the complete documentation index at: https://developer.sanas.ai/llms.txt
Use this file to discover all available pages before exploring further.
AGENTIC_VI_GT_NC) model is a telephony-optimized variant of the AGENTIC_VI_G_NC model, purpose-built for 8kHz telephony audio. It improves Automatic Speech Recognition (ASR) accuracy by isolating the primary speaker and removing background noise and voices from narrowband telephony streams.
Which model should you use?
- 16kHz input: Use
AGENTIC_VI_G_NC(General) - 8kHz input: Use
AGENTIC_VI_GT_NC(Telephony)
Hear the Difference
Removes environmental background noise and distant human voices from telephony audio for complete voice isolation, ensuring ASR processes only the primary speaker’s input.Before · Primary speaker + background noise
After · Primary speaker only
Performance Benchmarks
The following transcripts are generated from the audio samples above. The Oracle Transcript is the ground truth — the actual words spoken. The Source Transcript is the raw audio passed directly through the ASR system. The Sanas Transcript is the audio processed through the Sanas model first, then passed through the same ASR system. The Word Error Rate (WER) is calculated relative to the Oracle Transcript (0% WER), based on insertion, deletion, and substitution errors — bolded words below indicate these errors. WER percentage for each is shown at the bottom of the table.Test Environment
- Background: Contact center with background office chatter and ambient noise
- ASR System: Deepgram Nova3 Streaming
| Oracle Transcript | Source Transcript | Sanas Transcript |
|---|---|---|
| good morning welcome to sun bank fraud prevention desk this is partha speaking how can i assist i completely understand let’s check your account immediately uh for verification may i have the last four digits of your card thank you i will review your account i see an o t p multiple request attempted today but no transaction went through did anyone recently ask for your o t p or card details good please remember we will never ask for your o t p to keep your account safe i recommend blocking the card immediately and issuing a replacement would you like me to proceed done your card has been blocked successfully your replacement card will reach your uh reach your registered address in three to five business days okay yes i’m also enabling security alerts for your account you are welcome anything else i can help you with today | good morning welcome to sunbank fraud prevention desk this is speaking how can i assist i completely understand let’s check your account immediately for verification may i have the last four digits of your card thank you i will remove the account i’ll see a particular request your replacement card will reach your reach your registered address three to five business days enabling security alerts for your account will you write me the change this to you you were welcome anything else i can help you with changes to your account | good morning welcome to sun bank fraud prevention desk this is pasa speaking how can i assist i completely understand there’s check your account in the details for verification may i have the last digits of your card thank you i will mhmm your account attempted today the new transaction went through did anyone recently ask for the otp of paralytus good remember we will never ask for the o t p to keep your account safe i recommend blocking the card immediately and issue you had a placement would you like me to give sir your card has been blocked successfully your replacement card has reached your reached your registered address in three to 5% i’m also enabling security alert for your account you are welcome anything else i can help you with sir |
| — | 60.7% WER | 30.7% WER |
Key Features
Telephony Optimized
Purpose-built for 8kHz narrowband telephony audio, delivering optimal performance on real-world call center streams.
Improved ASR Accuracy
Relative Word Error Rate Reduction (RWERR) across multiple ASR systems on noisy telephony data with no degradation on clean audio.
ASR-Agnostic Design
Works seamlessly with any ASR pipeline, open-source or commercial, without requiring retraining or modification.
Enhanced Turn-Taking
Reduces false triggers from ambient sounds (background chatter, environmental noise) that cause agents to interrupt.
Specifications
Model ID:AGENTIC_VI_GT_NCCategory: Agentic Noise Cancellation
Type: Human ↔ Machine
~100ms
Streaming latency (end-to-end processing time)
8kHz
Sample rate
5-30%
RWERR in noisy conditions (average)
Use Cases
Telephony Voice Agents
Improves speech recognition for voice agents operating on narrowband telephony audio.
IVR Systems
Enhances recognition accuracy for automated phone systems on 8kHz audio.
Contact Centers
Single-speaker isolation for telephony-based contact center environments.
ASR Preprocessing
Purpose-built enhancement for downstream ASR systems consuming telephony audio.
Code Example
Create an audio processor with the Voice Isolation Telephony Agentic NC model:Sample rate: 8 kHz for telephony audio. For 16kHz input, use
AGENTIC_VI_G_NC instead.Next Steps
Quick Start
Get up and running with Sanas SDK in under 5 minutes.
API Reference
Full SDK documentation for classes, enums, and callbacks.
Processing Multiple Streams
Handle multiple concurrent audio streams.