Overview

Introduction to Sanas SDK and how it works

What is Sanas SDK?

Sanas SDK is a lightweight connector that enables real-time audio streaming via AI models hosted on the Sanas Cloud. The Sanas ecosystem consists of two components:

Connector (local Python library) - Installed on endpoint devices/servers, handles SIP authentication and RTC audio streaming
Sanas Cloud (hosted service) - Hosts AI models and provides real-time audio processing, enhances quality, and returns clean audio

Available AI Models

Sanas SDK provides two specialized models optimized for different audio processing needs:

AI Voice Agent
Noise Cancellation

Optimized for: Speech recognition systems (ASR)Use this model when:

Building voice assistants or chatbots
Running ASR in noisy environments
Need to improve Word Error Rate (WER)
Optimizing turn-taking in conversational AI

Key Benefits:

5-30% WER improvement in noisy conditions
Works with any ASR (Deepgram, Google, Whisper, Azure, AWS)
Optimized for machine processing

Model Name: AGENTIC_NCConfiguration:

    audio_params.modelName = "AGENTIC_NC"
    audio_params.sampleRate = 16000

Learn more →

Optimized for: Human conversationsUse this model when:

Processing human-to-human calls
Need clean audio for human listeners
Video conferencing or meetings
Gaming voice chat
Contact center applications

Key Benefits:

Preserves primary speaker up to ~1m
Removes background voices and noise
Natural voice quality preservation
Low latency (~40ms)

Model Name: VI_G_NC3.0Configuration:

    audio_params.modelName = "VI_G_NC3.0"
    audio_params.sampleRate = 16000  # Or up to 24kHz

Learn more →

How It Works

Sanas SDK operates as a cloud-hosted real-time streaming service that processes audio with low latency:

Technical Architecture

Connection Protocol:

SIP (Session Initiation Protocol): Establishes and manages the connection
RTP (Real-Time Transport Protocol): Transmits audio streams

Processing Mode:

Real-time streaming
Audio processed in chunks as it arrives

How Audio Flows

Take audio input

Take an audio stream from your voice pipeline.

Establish SIP connection

Establish a secure SIP connection to the Sanas Cloud via the SDK connector.

Stream audio via RTP

Transmit audio chunks in real time over RTP.

Process with AI

Process each 20ms audio chunk with the selected AI model.

Receive clean audio stream

Receive enhanced audio in real time via RTP.

Use enhanced audio

Send enhanced audio to the next stage based on your use case.

Deployment Options

Sanas Cloud (Recommended)
Self-Hosted

Cloud-Hosted Deployment

What it is:

Sanas manages all infrastructure
Hosted on enterprise-grade cloud infrastructure
Automatic scaling and high availability

Available Regions:

South Asia — Mumbai, India (AWS ap-south-1)
United States — Ohio, USA (AWS us-east-2)
More regions coming soon

Connection:

SIP-based connection establishment
RTP for audio streaming
Secure authentication with Account ID + Secret
Encrypted media transmission

Best for:

Fast time to market
Real-time voice applications
Scaled deployment

Pricing: Usage-based (contact sales)Get started with Sanas Cloud →

System Requirements

Supported Platforms

Component	Requirement
Operating System	Ubuntu 22.04 x86-64
Python Version	Python 3.10 or higher
Network	Outbound connectivity to Sanas Cloud (SIP/RTC ports)
Dependencies	Dependencies are handled automatically during installation.

Network Requirements

Required ports and protocols:

SIP: For connection establishment
RTP: For audio streaming

Firewall configuration: Ensure your firewall allows outbound SIP and RTP traffic to Sanas Cloud endpoints.

Key Features

Real-Time Streaming

Low-latency processingLive audio processing with SIP/RTC for minimal delay

High Concurrency

Scalable solutionProcess multiple audio streams simultaneously

Session Management

SIP protocolReliable connection establishment and management

ASR-Agnostic

Works with any ASRDeepgram, Google, Whisper, Azure, AWS, etc.

Easy Integration

Simple APIInitialize, create a processor, stream audio

Secure

Enterprise securityEncrypted transmission, secure authentication

Next Steps

Ready to start? Follow the quickstart guide to install the SDK and process your first live audio stream in under 10 minutes.

Request SDK Access

Get your credentials to start building.

Quickstart Guide

Install the SDK and stream your first audio.

Need Help?

Email Support

support@sanas.aiResponse time: 1 business day

Support Portal

Raise a support ticketFor urgent issues

Email Support

Get Started

AI Models

Tutorials

What is Sanas SDK?

Available AI Models

How It Works

Technical Architecture

How Audio Flows

Deployment Options

Cloud-Hosted Deployment

Self-Hosted Deployment

System Requirements

Supported Platforms

Network Requirements

Key Features

Real-Time Streaming

High Concurrency

Session Management

ASR-Agnostic

Easy Integration

Secure

Next Steps

Request SDK Access

Quickstart Guide

Need Help?

Support Portal

Get Started

AI Models

Tutorials

​What is Sanas SDK?

​Available AI Models

​How It Works

​Technical Architecture

​How Audio Flows

​Deployment Options

​Cloud-Hosted Deployment

​Self-Hosted Deployment

​System Requirements

​Supported Platforms

​Network Requirements

​Key Features

Real-Time Streaming

High Concurrency

Session Management

ASR-Agnostic

Easy Integration

Secure

​Next Steps

Request SDK Access

Quickstart Guide

​Need Help?

Email Support

Support Portal

What is Sanas SDK?

Available AI Models

How It Works

Technical Architecture

How Audio Flows

Deployment Options

Cloud-Hosted Deployment

Self-Hosted Deployment

System Requirements

Supported Platforms

Network Requirements

Key Features

Next Steps

Need Help?