Skip to main content

What is Sanas

Sanas is a speech and audio AI platform that provides a suite of real-time AI models accessible through a single SDK. Engineered for enterprise scale and built for teams of every size, from startups to large global enterprises. Enterprise-ready with security, compliance, and support.

How It Works

The Sanas SDK Connector sits between your application and Sanas Models, managing the connection to the inference engine. It sends input, receives processed output, and streams data in real-time using SIP (Session Initiation Protocol) and RTP (Real-Time Transport Protocol). The Sanas SDK can be deployed via Sanas Cloud (recommended) or self-hosted (coming soon) on your own infrastructure.

Core Capabilities

Noise Cancellation

Isolate intended speech by removing background noise and voices — no quality degradation.

Agentic Noise Cancellation

Clean audio before ASR/STT. Isolate primary speakers and reduce WER.

Speech Enhancement

Reconstruct and restore speech quality degraded by compression and network conditions.Coming Soon

Live Language Translation

Real-time cross-language voice communication.Coming Soon

Accent Translation

Convert accents while preserving voice identity.Coming Soon
Sanas currently offers noise cancellation capabilities. Accent Translation, Language Translation, Speech Enhancement, and Speech Intelligence coming soon.

Available Models

Noise Cancellation · Voice Isolation (General)

VI_G_NC3.0 Human ↔ HumanIsolates intended speech by removing background noise and voices. Optimized for human listeners.

Agentic Noise Cancellation · Voice Isolation (General)

AGENTIC_VI_G_NC Human ↔ MachineRemoves background noise and distant voices for complete voice isolation of the primary speaker’s audio stream.

Agentic Noise Cancellation · Standard

AGENTIC_ST_NC Human ↔ MachineRemoves background noise while preserving all human speech for multi-speaker environments.
See the full model comparison →

What You Can Build

Voice Agents

Enhance voice agent pipelines with real-time speech and audio processing for clearer, more accurate interactions.

Contact Centers

Power contact center audio at scale with concurrent stream processing and enterprise-grade reliability.

Conferencing & Gaming

Deliver high-quality voice experiences across communication and interactive platforms.

STT Pipelines

Improve speech-to-text accuracy by processing audio before it reaches your transcription engine.

Key Features

Real-Time Streaming

Low-latency processingLive audio processing with SIP/RTP

High Concurrency

Scalable solutionProcess multiple audio streams simultaneously

Session Management

SIP protocolReliable connection establishment and management

Seamless Communication

Enable clearer human-machine interactions across any environment

Easy Integration

Simple APIInitialize, create a processor, stream audio

Secure

Enterprise securityEncrypted transmission, secure authentication

Ready to Build?

Request your API keys and integration credentials, then use the Quick Start to start streaming clean audio in minutes.

Request SDK Access

Get your credentials to start building.

Quick Start

Get up and running with Sanas SDK in under 5 minutes.

Resources

Pricing

Usage-based pricing.

Enterprise

Data residency, compliance, security, and support.

API Reference

Complete API documentation for the Sanas SDK.

Changelog

Latest updates, releases, and fixes.