Frequently Asked Questions

Find answers to common questions about Vox AI Studio

What is Vox AI Studio?

Vox AI Studio is an AI-powered Text-to-Speech platform that converts written text into natural-sounding, professional-quality speech in seconds.

How many languages do you support?

We support 23+ languages including English, Turkish, Spanish, French, German, Arabic, Japanese, and many more with multiple accents.

Is there a free plan?

Yes! New users get a 7-day Free Trial with 10,000 credits — no credit card required. The trial includes Flash voice only, with a maximum of 500 characters per request, and is limited to single-speaker generation.

How do credits work?

Credits are consumed per character of text you generate. The base rate is 1 credit = 1 character for Flash single-speaker. Multipliers apply for higher-quality modes:

  • • Flash single-speaker — (base)
  • • Pro single-speaker —
  • • Flash multi-speaker —
  • • Pro multi-speaker —

Credits on paid plans never expire — they stay in your account until you use them.

What is Multi-Speaker?

Multi-Speaker lets you create a full conversation between two or more voices. You assign each speaker a name and a voice, then add lines one by one — just like writing a script. When you generate, Vox AI Studio combines all the lines into a single, seamless audio file where each speaker sounds distinctly different.

Can I use the generated audio commercially?

Yes, you own the audio files generated from your text and can use them for personal or commercial purposes.

Still have questions?

Our support team is here to help you.