Skip to main content

Documentation Index

Fetch the complete documentation index at: https://developer.sanas.ai/llms.txt

Use this file to discover all available pages before exploring further.

What is Sanas

Sanas is a speech and audio AI platform that provides a suite of real-time AI models accessible through a single SDK. Engineered for enterprise scale and built for teams of every size, from startups to large global enterprises. Enterprise-ready with security, compliance, and support.

How It Works

The Sanas SDK Connector sits between your application and Sanas Models, managing the connection to the inference engine. It sends input, receives processed output, and streams data in real-time using SIP (Session Initiation Protocol) and RTP (Real-Time Transport Protocol). The Sanas SDK can be deployed via Sanas Cloud (recommended) or self-hosted (coming soon) on your own infrastructure.

Core Capabilities

Noise Cancellation

Isolate intended speech by removing background noise and voices — no quality degradation.

Agentic Noise Cancellation

Clean audio before ASR/STT. Isolate primary speakers and reduce Relative Word Error Rate (RWERR).

Speech Enhancement

Reconstruct and restore speech quality degraded by compression and network conditions.

Live Language Translation

Real-time cross-language voice communication.Coming Soon

Accent Translation

Convert accents while preserving voice identity.Coming Soon
Sanas currently offers Noise Cancellation and Speech Enhancement capabilities. Accent Translation, Language Translation, and Speech Intelligence coming soon.

Available Models

Noise Cancellation · Voice Isolation (General)

VI_G_NC3.0 Human ↔ HumanIsolates intended speech by removing background noise and voices. Optimized for human listeners.

Speech Enhancement · Standard

SE2.1 Human ↔ HumanRestores and enhances voice quality for telephony audio. Low CPU footprint.

Speech Enhancement · Ultra

SE2.2 Human ↔ HumanFull-fidelity speech enhancement with bandwidth extension to ultra-fidelity 24kHz.

Agentic Noise Cancellation · Voice Isolation (General)

AGENTIC_VI_G_NC Human ↔ MachineRemoves background noise and distant voices for complete voice isolation of the primary speaker’s audio stream.

Agentic Noise Cancellation · Voice Isolation (Telephony)

AGENTIC_VI_GT_NC Human ↔ MachineTelephony-optimized variant of Voice Isolation for 8kHz narrowband audio.

Agentic Noise Cancellation · Standard

AGENTIC_ST_NC Human ↔ MachineRemoves background noise while preserving all human speech for multi-speaker environments.
See the full model comparison →

What You Can Build

Voice Agents

Enhance voice agent pipelines with real-time speech and audio processing for clearer, more accurate interactions.

Contact Centers

Power contact center audio at scale with concurrent stream processing and enterprise-grade reliability.

Conferencing & Gaming

Deliver high-quality voice experiences across communication and interactive platforms.

STT Pipelines

Improve speech-to-text accuracy by processing audio before it reaches your transcription engine.

Key Features

Real-Time Streaming

Low-latency processingLive audio processing with SIP/RTP

High Concurrency

Scalable solutionProcess multiple audio streams simultaneously

Session Management

SIP protocolReliable connection establishment and management

Seamless Communication

Enable clearer human-machine interactions across any environment

Easy Integration

Simple APIInitialize, create a processor, stream audio

Secure

Enterprise securityEncrypted transmission, secure authentication

Ready to Build?

Request your API keys and integration credentials, then use the Quick Start to start streaming clean audio in minutes.

Request SDK Access

Get your credentials to start building.

Quick Start

Get up and running with Sanas SDK in under 5 minutes.

Resources

Pricing

Usage-based pricing.

Enterprise

Data residency, compliance, security, and support.

API Reference

Complete API documentation for the Sanas SDK.

Changelog

Latest updates, releases, and fixes.