19-12-2024 , 19-12-2024
Smart caching in Speechgen is an advanced feature designed to save time and costs during text-to-speech synthesis. By storing and reusing previously voiced sentences, it ensures efficiency and minimizes redundant processing.
Reuse of Voiced Sentences:
Speechgen remembers each sentence you synthesize.
If you make changes to the text, only new or modified sentences are processed, while the unchanged ones are retrieved from memory.
Efficient Combination:
The system seamlessly merges new and cached sentences into a single audio file, eliminating the need for complete revoicing.
Time Efficiency: Spend less time on repeated voiceovers.
Cost Savings: Pay only for new content rather than the entire text.
When voicing an educational course, adding a short introduction to each lesson with other services might mean revoicing all lessons. With Speechgen, only the new introductions are voiced, while the original content remains intact and cost-free.
Cache Capacity:
The cache applies to texts of up to 100,000 characters.
For longer texts, Speechgen switches to a specialized mode for large blocks, accommodating up to 2,000,000 characters.
Storage Time:
Cached sentences remain available for 7 days.
Full voiceover history is accessible in your profile for 30 days.
Caching Rules:
Only exact matches (character by character) are reused.
Minor edits, such as adding or removing punctuation, mark sentences as new and require revoicing.
Content Edits: Any modification to a sentence, whether it’s changing a word, punctuation, or adding tags like <break>, results in revoicing.
Voice Settings: Adjusting the speed, tone, or speaker triggers a complete revoicing, as these parameters redefine the audio output.
Pauses: You can modify pauses between sentences or paragraphs without revoicing.
Format Changes: Switching audio formats (e.g., ogg, wav) or adjusting the sample rate does not incur additional costs.
With smart caching, Speechgen offers unmatched efficiency:
Lower Costs: Avoid paying for unchanged sentences.
Speed: Revoicing is faster and more streamlined.
Flexibility: Edit and refine your projects without worrying about repeated charges.
For a detailed explanation with practical examples, visit this blog post.
Speechgen’s caching technology redefines TTS by optimizing costs and workflow. It’s the ideal solution for anyone looking to produce high-quality voiceovers efficiently and economically.