Which languages are supported by this plugin?
English (American), English (Australian), English (British), English (Indian), English (Welsh), Welsh, Danish, Dutch, French, French (Canadian), German, Icelandic, Italian, Japanese, Korean, Polish, Portuguese, Portuguese (Brazilian), Romanian, Russian, Spanish (Castilian), Spanish (American), Swedish, Turkish, Norwegia
Amazon Polly Pricing
With Amazon Polly, you only pay for what you use. There is no setup cost and no minimum fee.
With Amazon Polly, you are charged based on the number of characters of text that you convert either to speech or to Speech Marks metadata. You can cache and replay Amazon Polly’s generated speech at no additional cost. You can also cache and reuse Amazon Polly’s generated Speech Marks at no additional cost.
The Amazon Polly free tier includes 5 million characters per month for speech or Speech Marks requests, for the first 12 months, starting from the first request for speech.
Pay-as-you-go $4.00 per 1 million characters for speech requests (when outside the free tier). Pay-as-you-go $4.00 per 1 million characters for Speech Marks requests (when outside the free tier).
Try Amazon Polly
The Amazon Polly free tier includes 5 million characters per month, for the first 12 months, starting from the first request for speech.
Pricing Examples (when outside the free tier)
|Example||Text Length||Speech Duration||Cost|
|Example||Text Length||Speech Duration||Cost|
|1,000 requests, 1,000 characters per request||1 million characters||~23 hours, 8 min||$4.00|
|10,000 requests, 100 characters per request||1 million characters||~23 hours, 8 min||$4.00|
|2016 Amazon Shareholders Letter||1.3k characters, single page||~1 min. 40 sec||$0.005|
|Average email message||~3.1k characters||~4 min||$0.02|
|Typical news article||~6.5k characters, three pages||~9 min||$0.03|
|“A Christmas Carol” by Charles Dickens||~165k characters, 64 pages||~3 hours 50 min||$0.66|
|“Adventures of Huckleberry Finn” by Mark Twain||~600k characters, 224 pages||~13 hours 50 min||$2.40|
– Average length of a single narration text: 100 characters
– Number of narration texts per animated production: 25
|2.5k characters per animation||~3.5 min||$0.01|
– Average spoken response length: 100 characters
– Requests per user per month: 300
|30k characters per user per month||~42 min||$0.12|
– Average length of single phrase from avatar: 100 characters
– Number of phrases produced by avatar: 25
– Need for Speech Marks to synchronize lips
|2.5k characters of synthesized speech
2.5k characters of Speech Marks data
|Storytelling with highlightext text for children:
– Length of text for the story: 10k characters
– Need for Speech Marks to synchronize highlighted text
|10k characters of synthesized speech
10k characters of Speech Marks data
Does Amazon Polly participate in the AWS Free Tier?
yes, as part of the AWS Free Usage Tier, you can get started with Amazon Polly for free. Upon sign-up, new Amazon Polly customers can synthesize up to 5M characters for free each month for the first 12 months.
Does plugin delete my audio files if I will delete the plugin?
No. All audio files are being preserved. Depending on your configuration, they will be stored on your WordPress server, or on your Amazon S3 bucket.
How do I view my Amazon PollyCast feed?
Attach ‘/amazon-pollycast/’ to any page URL.
How do I publish my podcast with iTunes?
Submit your Amazon PollyCast to the iTunes iConnect directory: https://itunesconnect.apple.com
Amazon Polly Product Details
Amazon Polly provides an API that enables you to quickly integrate speech synthesis into your application. You simply send the text you want converted into speech to the Amazon Polly API, and Amazon Polly immediately returns the audio stream to your application so your application can begin streaming it directly or store it in a standard audio file format, such as MP3.
|Sampling rate||Sample Code
|“Hi. My name is Joanna.”||from boto3 import client
polly = client(“polly”, region_name=”us-east-1″)
response = polly.synthesize_speech(
Text=”Hi. My name is Joanna.”,
Wide Selection of Voices and Languages
Amazon Polly includes dozens of lifelike voices and support for a variety of languages, so you can select the ideal voice and distribute your speech-enabled applications in many countries.
|Portuguese – Iberic||Inês||Cristiano|
|Spanish – Castilian||Conchita||Enrique|
Amazon Polly makes it easy to request an additional stream of metadata that provides information about when particular sentences, words and sounds are being pronounced. Using this metadata stream alongside the synthesized speech audio stream, you can now build your applications with an enhanced visual experience, such as speech-synchronized facial animation or karaoke-style word highlighting.
With Amazon Polly, you can stream all kinds of information through your application to users in near real time. You can also choose from various sampling rates to optimize bandwidth and audio quality for your application. Amazon Polly supports MP3, Vorbis, and raw PCM audio stream formats.
Amazon Polly supports Speech Synthesis Markup Language (SSML), a W3C standard, XML-based markup language for speech synthesis applications, and supports common SSML tags for phrasing, emphasis, and intonation. This flexibility helps you create lifelike speech that will attract and hold the attention of your audience.
To learn more, visit the Amazon Polly documentation on SSML tags.
|This is how I speak normally.||(none)|
|I can speak in a higher pitched voice, or I can speak in a lower pitched voice.||<speak>I can speak in a <prosody pitch=”high”>higher pitched voice</prosody>, or I can speak <prosody pitch=”low”>in a lower pitched voice</prosody></speak>|
|I can speak really slowly, or I can speak really fast.||<speak>I can speak <prosody rate=”x-slow”>really slowly</prosody>, or I can speak <prosody rate=”x-fast”>really fast</prosody></speak>|
|I can also speak very loudly, or I can speak very quietly.||<speak>I can also speak <prosody volume=”x-loud”>very loudly</prosody>, or I can speak <prosody volume=”x-soft”>very quietly</prosody>. </speak>|
|I can whisper.||<speak>I have a secret to tell you, I will whisper it to you.<amazon:effect name=”whispered”>'<prosody rate=”x-slow”> <prosody volume=”loud”>I am not human.</prosody></prosody></amazon:effect>Can you believe it?</speak>|
Amazon Polly supports all the programming languages included in the AWS SDK (Java, Node.js, .NET, PHP, Python, Ruby, Go, and C++) and AWS Mobile SDK (iOS/Android). Polly also supports an HTTP API so you can implement your own access layer.
Amazon Polly can be accessed via the Polly API (and various language-specific SDKs), AWS Management Console, and the AWS command-line interface (CLI). You have full control over all the capabilities of Amazon Polly, whether you use the service through the console, the API, or the CLI.
With Amazon Polly’s custom lexicons, or vocabularies, you can modify the pronunciation of particular words, such as company names, acronyms, foreign words and neologisms (e.g., “ROTFL”, “C’est la vie” when spoken in a non-French voice). To customize these pronunciations, you upload an XML file with lexical entries. For example, you can customize the pronunciation of Nguyen by providing a phoneme using this XML: