What is Sanas SDK?
Sanas SDK is a lightweight connector that enables real-time audio streaming via AI models hosted on the Sanas Cloud. The Sanas ecosystem consists of two components:- Connector (local Python library) - Installed on endpoint devices/servers, handles SIP authentication and RTC audio streaming
- Sanas Cloud (hosted service) - Hosts AI models and provides real-time audio processing, enhances quality, and returns clean audio
Available AI Models
Sanas SDK provides two specialized models optimized for different audio processing needs:- AI Voice Agent
- Noise Cancellation
Optimized for: Speech recognition systems (ASR)Use this model when:Learn more →
- Building voice assistants or chatbots
- Running ASR in noisy environments
- Need to improve Word Error Rate (WER)
- Optimizing turn-taking in conversational AI
- 5-30% WER improvement in noisy conditions
- Works with any ASR (Deepgram, Google, Whisper, Azure, AWS)
- Optimized for machine processing
AGENTIC_NCConfiguration:How It Works
Sanas SDK operates as a cloud-hosted real-time streaming service that processes audio with low latency:Technical Architecture
Connection Protocol:- SIP (Session Initiation Protocol): Establishes and manages the connection
- RTP (Real-Time Transport Protocol): Transmits audio streams
- Real-time streaming
- Audio processed in chunks as it arrives
How Audio Flows
Establish SIP connection
Establish a secure SIP connection to the Sanas Cloud via the SDK connector.
Deployment Options
- Sanas Cloud (Recommended)
- Self-Hosted
Cloud-Hosted Deployment
What it is:- Sanas manages all infrastructure
- Hosted on enterprise-grade cloud infrastructure
- Automatic scaling and high availability
- South Asia — Mumbai, India (AWS ap-south-1)
- United States — Ohio, USA (AWS us-east-2)
- More regions coming soon
- SIP-based connection establishment
- RTP for audio streaming
- Secure authentication with Account ID + Secret
- Encrypted media transmission
- Fast time to market
- Real-time voice applications
- Scaled deployment
System Requirements
Supported Platforms
| Component | Requirement |
|---|---|
| Operating System | Ubuntu 22.04 x86-64 |
| Python Version | Python 3.10 or higher |
| Network | Outbound connectivity to Sanas Cloud (SIP/RTC ports) |
| Dependencies | Dependencies are handled automatically during installation. |
Network Requirements
Required ports and protocols:- SIP: For connection establishment
- RTP: For audio streaming
Firewall configuration: Ensure your firewall allows outbound SIP and RTP traffic to Sanas Cloud endpoints.
Key Features
Real-Time Streaming
Low-latency processingLive audio processing with SIP/RTC for minimal delay
High Concurrency
Scalable solutionProcess multiple audio streams simultaneously
Session Management
SIP protocolReliable connection establishment and management
ASR-Agnostic
Works with any ASRDeepgram, Google, Whisper, Azure, AWS, etc.
Easy Integration
Simple APIInitialize, create a processor, stream audio
Secure
Enterprise securityEncrypted transmission, secure authentication
Next Steps
Ready to start? Follow the quickstart guide to install the SDK and process your first live audio stream in under 10 minutes.
Request SDK Access
Get your credentials to start building.
Quickstart Guide
Install the SDK and stream your first audio.