Blog

Introducing AI Audio Generation

Changelog

API

Dashboard

Cosmic

February 23, 2026

Hero image

0:00

-3:01

Listen to this article · 3:01

Cosmic AI now includes text-to-speech generation, powered by OpenAI's TTS models. Convert any text into natural-sounding audio and save it directly to your media library as an MP3, ready for CDN delivery.

Whether you're creating audio versions of blog posts, podcast intros, product walkthroughs, or accessibility-focused content, audio generation is available in the dashboard, API, and SDK.

9 Natural-Sounding Voices

Choose from a range of voices to match your content's tone:

Feminine voices:

Nova (default): Warm and bright, great for friendly narration
Shimmer: Soft and intimate, ideal for meditation or bedtime stories
Coral: Clear and polished, professional tone for product demos
Sage: Calm and steady, thoughtful pacing for education and tutorials
Alloy: Neutral and balanced, versatile for general-purpose use

Masculine voices:

Echo: Deep and authoritative, great for announcements and news
Onyx: Bold and commanding, strong presence for intros and branding
Fable: Animated and expressive, a natural storyteller for audiobooks
Ash: Warm and approachable, conversational for interviews

Two Quality Tiers

Standard (tts-1): Fast, low-latency generation. Recommended for most use cases.
HD (tts-1-hd): Higher quality audio with richer detail. 2x token cost.

Long Text Support

Texts over 4,096 characters are automatically split at paragraph boundaries and concatenated into a single seamless audio file. No manual chunking required.

How to Use in the Dashboard

Navigate to Media in your project
Click Create and select Audio
Select a voice from the dropdown
Paste or type the text you want to convert
Click Generate

The audio file is saved to your media library in MP3 format, available instantly via CDN.

API and SDK Access

Generate audio programmatically using the API or JavaScript SDK:

Or with cURL:

Optional parameters include ( or ), , , and .

Pricing

Audio generation tokens scale with text length:

Model	Cost per 1,000 characters
TTS Standard	3,600 tokens
TTS HD	7,200 tokens

Get Started

Audio generation is available now on all Cosmic plans. Open your project, head to Media, and try it out. For full API documentation, visit the AI API reference.

For dashboard usage details, see the AI dashboard docs.

Continue Learning

Documentation

Articles

Comparisons

Back to blog

Hero image

You might also like

GitHub Actions Pricing Changes and the Future of CI/CD: What Developers Need to Know

GitHub's new Actions pricing has developers reconsidering CI/CD infrastructure. From surveillance te...

Cosmic AI

December 16, 2025

Skills-Based AI Integration: What OpenAI's Latest Move Means for CMS Platforms

OpenAI quietly adopts Skills just months after Anthropic's launch, while Apple enables AI cluster co...

Cosmic AI

December 12, 2025

Web Dev Rundown: The C3 Programming Language, Popular HN Blogs, and POSSE Publishing

A new systems programming language challenges C, analysis reveals what makes technical blogs succeed...

Cosmic AI

January 3, 2026