Skip to main content

Documentation Index

Fetch the complete documentation index at: https://developer.sanas.ai/llms.txt

Use this file to discover all available pages before exploring further.

The Voice Isolation Telephony Agentic NC (AGENTIC_VI_GT_NC) model is a telephony-optimized variant of the AGENTIC_VI_G_NC model, purpose-built for 8kHz telephony audio. It improves Automatic Speech Recognition (ASR) accuracy by isolating the primary speaker and removing background noise and voices from narrowband telephony streams.
Which model should you use?
  • 16kHz input: Use AGENTIC_VI_G_NC (General)
  • 8kHz input: Use AGENTIC_VI_GT_NC (Telephony)

Hear the Difference

Removes environmental background noise and distant human voices from telephony audio for complete voice isolation, ensuring ASR processes only the primary speaker’s input.
Before · Primary speaker + background noise
After · Primary speaker only

Performance Benchmarks

The following transcripts are generated from the audio samples above. The Oracle Transcript is the ground truth — the actual words spoken. The Source Transcript is the raw audio passed directly through the ASR system. The Sanas Transcript is the audio processed through the Sanas model first, then passed through the same ASR system. The Word Error Rate (WER) is calculated relative to the Oracle Transcript (0% WER), based on insertion, deletion, and substitution errors — bolded words below indicate these errors. WER percentage for each is shown at the bottom of the table.

Test Environment

  • Background: Contact center with background office chatter and ambient noise
  • ASR System: Deepgram Nova3 Streaming
Oracle TranscriptSource TranscriptSanas Transcript
good morning welcome to sun bank fraud prevention desk this is partha speaking how can i assist i completely understand let’s check your account immediately uh for verification may i have the last four digits of your card thank you i will review your account i see an o t p multiple request attempted today but no transaction went through did anyone recently ask for your o t p or card details good please remember we will never ask for your o t p to keep your account safe i recommend blocking the card immediately and issuing a replacement would you like me to proceed done your card has been blocked successfully your replacement card will reach your uh reach your registered address in three to five business days okay yes i’m also enabling security alerts for your account you are welcome anything else i can help you with todaygood morning welcome to sunbank fraud prevention desk this is speaking how can i assist i completely understand let’s check your account immediately for verification may i have the last four digits of your card thank you i will remove the account i’ll see a particular request your replacement card will reach your reach your registered address three to five business days enabling security alerts for your account will you write me the change this to you you were welcome anything else i can help you with changes to your accountgood morning welcome to sun bank fraud prevention desk this is pasa speaking how can i assist i completely understand there’s check your account in the details for verification may i have the last digits of your card thank you i will mhmm your account attempted today the new transaction went through did anyone recently ask for the otp of paralytus good remember we will never ask for the o t p to keep your account safe i recommend blocking the card immediately and issue you had a placement would you like me to give sir your card has been blocked successfully your replacement card has reached your reached your registered address in three to 5% i’m also enabling security alert for your account you are welcome anything else i can help you with sir
60.7% WER30.7% WER

Key Features

Telephony Optimized

Purpose-built for 8kHz narrowband telephony audio, delivering optimal performance on real-world call center streams.

Improved ASR Accuracy

Relative Word Error Rate Reduction (RWERR) across multiple ASR systems on noisy telephony data with no degradation on clean audio.

ASR-Agnostic Design

Works seamlessly with any ASR pipeline, open-source or commercial, without requiring retraining or modification.

Enhanced Turn-Taking

Reduces false triggers from ambient sounds (background chatter, environmental noise) that cause agents to interrupt.

Specifications

Model ID: AGENTIC_VI_GT_NC
Category: Agentic Noise Cancellation
Type: Human ↔ Machine

~100ms

Streaming latency (end-to-end processing time)

8kHz

Sample rate

5-30%

RWERR in noisy conditions (average)

Use Cases

Telephony Voice Agents

Improves speech recognition for voice agents operating on narrowband telephony audio.

IVR Systems

Enhances recognition accuracy for automated phone systems on 8kHz audio.

Contact Centers

Single-speaker isolation for telephony-based contact center environments.

ASR Preprocessing

Purpose-built enhancement for downstream ASR systems consuming telephony audio.

Code Example

Create an audio processor with the Voice Isolation Telephony Agentic NC model:
audio_params = sanas_remote_sdk.AudioParams()
audio_params.modelName = "AGENTIC_VI_GT_NC"
audio_params.sampleRate = 8000

processor, create_result = sdk.CreateAudioProcessor(audio_params)
Sample rate: 8 kHz for telephony audio. For 16kHz input, use AGENTIC_VI_G_NC instead.
For full setup and initialization, see the Quick Start →

Next Steps

Quick Start

Get up and running with Sanas SDK in under 5 minutes.

API Reference

Full SDK documentation for classes, enums, and callbacks.

Processing Multiple Streams

Handle multiple concurrent audio streams.