Choose Your Perfect Plan
Flexible pricing plans to meet your different needs
Subscription Plans
Choose the subscription plan that best fits your business needs
🎉 Limited-time Early Bird Offer! Subscribe now and enjoy continuous discounts for a full year
Free
$0.00
/month
- 1,000 credits/month
- Completely free
- 1 concurrent requests
- Support all models
- Community support
Starter
$4.99
$3.50
/month
Save $1.49 (30% OFF)
- 20,000 credits/month
-
$0.250 / 1000 credits $0.175 / 1000 credits
- 1 concurrent requests
- Support all models
- Community support
Most Popular
Basic
$29.99
$21.00
/month
Save $8.99 (30% OFF)
- 130,000 credits/month
-
$0.231 / 1000 credits $0.162 / 1000 credits
- 3 concurrent requests
- Support all models
- Email support
Pro
$99.99
$80.00
/month
Save $19.99 (20% OFF)
- 500,000 credits/month
-
$0.200 / 1000 credits $0.160 / 1000 credits
- 10 concurrent requests
- Support all models
- Priority support
Max
$199.99
$180.00
/month
Save $19.99 (10% OFF)
- 1,200,000 credits/month
-
$0.167 / 1000 credits $0.150 / 1000 credits
- 15 concurrent requests
- Support all models
- Dedicated customer service
Enterprise
Ultra
Contact Us for Custom PlanProfessional solution designed for higher usage demands and complex scenarios
More credits
Higher concurrency
Custom development
Dedicated support
We will contact you within 24 hours
TTS Model Pricing
Pay-as-you-use, supporting multiple top TTS services
Provider | Model | Description | Credit/1K Character |
---|---|---|---|
|
azure-tts |
Microsoft Cognitive Services speech synthesis, supporting 80+ languages and 500+ voices | 200 |
|
speech-2.5-hd-preview |
Latest HD model with excellent prosody, high replication similarity, supporting 40 languages: Chinese, Cantonese, English, Spanish, French, Russian, German, Portuguese, Arabic, Italian, Japanese, Korean, Indonesian, Vietnamese, Turkish, Dutch, Ukrainian, Thai, Polish, Romanian, Greek, Czech, Finnish, Hindi, Bulgarian, Danish, Hebrew, Malay, Persian, Slovak, Swedish, Croatian, Filipino, Hungarian, Norwegian, Slovenian, Catalan, Nynorsk, Tamil, Afrikaans | 300 |
speech-2.5-turbo-preview |
Latest Turbo model, supporting 40 languages (same as speech-2.5-hd-preview model) | 150 | |
speech-02-hd |
Features superior prosody, stability and replication similarity, outstanding audio quality, supporting 40 languages (same as speech-2.5-hd-preview model) | 300 | |
speech-02-turbo |
Features superior prosody and stability, enhanced minor language capabilities, excellent performance, supporting 40 languages (same as speech-2.5-hd-preview model) | 150 | |
speech-01-hd |
Ultra-high replication similarity, outstanding audio quality (old version, not recommended) | 300 | |
speech-01-turbo |
Faster generation speed based on excellent generation quality (old version, not recommended) | 150 | |
|
cosyvoice-v1 |
CosyVoice speech synthesis v1, covering 20+ voices in Chinese, English, Japanese and Korean | 200 |
cosyvoice-v2 |
CosyVoice speech synthesis v2, covering 80+ voices in Chinese, English, Japanese and Korean | 200 | |
sambert-v1 |
Sambert speech synthesis, including 40+ voices in Chinese, English, Italian, Spanish, Indonesian, French, German and Thai | 100 | |
E ElevenLabs | eleven_multilingual_v2 |
Our most lifelike model with rich emotional expression, supporting 28 languages: English, Japanese, Chinese, German, Hindi, French, Korean, Portuguese, Italian, Spanish, Indonesian, Dutch, Turkish, Filipino, Polish, Swedish, Bulgarian, Romanian, Arabic, Czech, Greek, Finnish, Croatian, Malay, Slovak, Danish, Tamil, Ukrainian, Russian | 900 |
eleven_flash_v2_5 |
Ultra-fast model optimized for real-time use (~75ms latency), supporting 31 languages: all eleven_multilingual_v2 languages plus Hungarian, Norwegian, Vietnamese | 900 | |
eleven_flash_v2 |
Ultra-fast model optimized for real-time use (~75ms latency, English only) | 900 | |
eleven_turbo_v2_5 |
High quality, low-latency model with a good balance of quality and speed (~250ms-300ms latency), supporting 31 languages: all eleven_multilingual_v2 languages plus Hungarian, Norwegian, Vietnamese | 900 | |
eleven_turbo_v2 |
High quality, low-latency model with a good balance of quality and speed (~250ms-300ms latency, English only) | 900 | |
![]() |
edge-tts |
Edge browser speech service, supporting 80+ languages and 500+ voices | 0 |
Notes
- Each API call consumes at least 1 credit
- Higher-priced models generally have better audio quality and higher human-likeness
- edge-tts may be unstable due to various external factors, it is recommended to use Azure TTS for important business or add fallback strategies to ensure reliability
Frequently Asked Questions
Character counting rules: 1 Chinese character counts as 2 characters, 1 English letter, 1 punctuation mark, or 1 space between sentences counts as 1 character
Yes, you can upgrade or downgrade your subscription plan at any time. However, please note that when changing subscription plans, your remaining credits will not be added to the quota of the new subscription plan. Therefore, to make full use of credits, it is recommended to change subscription plans when credits are about to run out.
Credits in subscription plans reset monthly and do not accumulate to the next month.