What is Sanas
Sanas is a speech and audio AI platform that provides a suite of real-time AI models accessible through a single SDK. Engineered for enterprise scale and built for teams of every size, from startups to large global enterprises. Enterprise-ready with security, compliance, and support.How It Works
The Sanas SDK Connector sits between your application and Sanas Models, managing the connection to the inference engine. It sends input, receives processed output, and streams data in real-time using SIP (Session Initiation Protocol) and RTP (Real-Time Transport Protocol). The Sanas SDK can be deployed via Sanas Cloud (recommended) or self-hosted (coming soon) on your own infrastructure.Core Capabilities
Noise Cancellation
Isolate intended speech by removing background noise and voices — no quality degradation.
Agentic Noise Cancellation
Clean audio before ASR/STT. Isolate primary speakers and reduce WER.
Speech Enhancement
Reconstruct and restore speech quality degraded by compression and network conditions.Coming Soon
Live Language Translation
Real-time cross-language voice communication.Coming Soon
Accent Translation
Convert accents while preserving voice identity.Coming Soon
Sanas currently offers noise cancellation capabilities. Accent Translation, Language Translation, Speech Enhancement, and Speech Intelligence coming soon.
Available Models
Noise Cancellation · Voice Isolation (General)
VI_G_NC3.0 Human ↔ HumanIsolates intended speech by removing background noise and voices. Optimized for human listeners.Agentic Noise Cancellation · Voice Isolation (General)
AGENTIC_VI_G_NC Human ↔ MachineRemoves background noise and distant voices for complete voice isolation of the primary speaker’s audio stream.Agentic Noise Cancellation · Standard
AGENTIC_ST_NC Human ↔ MachineRemoves background noise while preserving all human speech for multi-speaker environments.What You Can Build
Voice Agents
Enhance voice agent pipelines with real-time speech and audio processing for clearer, more accurate interactions.
Contact Centers
Power contact center audio at scale with concurrent stream processing and enterprise-grade reliability.
Conferencing & Gaming
Deliver high-quality voice experiences across communication and interactive platforms.
STT Pipelines
Improve speech-to-text accuracy by processing audio before it reaches your transcription engine.
Key Features
Real-Time Streaming
Low-latency processingLive audio processing with SIP/RTP
High Concurrency
Scalable solutionProcess multiple audio streams simultaneously
Session Management
SIP protocolReliable connection establishment and management
Seamless Communication
Enable clearer human-machine interactions across any environment
Easy Integration
Simple APIInitialize, create a processor, stream audio
Secure
Enterprise securityEncrypted transmission, secure authentication
Ready to Build?
Request your API keys and integration credentials, then use the Quick Start to start streaming clean audio in minutes.Request SDK Access
Get your credentials to start building.
Quick Start
Get up and running with Sanas SDK in under 5 minutes.
Resources
Pricing
Usage-based pricing.
Enterprise
Data residency, compliance, security, and support.
API Reference
Complete API documentation for the Sanas SDK.
Changelog
Latest updates, releases, and fixes.