Wednesday, November 22, 2023
HomeCloud ComputingNew – Lengthy-Kind voices for Amazon Polly

New – Lengthy-Kind voices for Amazon Polly


Voiced by Polly

We’re launching three new voices for Polly. Powered by a brand new long-form engine, the voices are pure and expressive, with applicable pauses, emphasis, and tone.

New Voices
The brand new long-form voices are excellent for weblog posts, information articles, coaching movies, and advertising and marketing content material. The underlying Machine Studying mannequin extracts which means from the textual content, studying about speech segments, prosody (the sample of rhythm and pauses), intonation, and different facets of expressive speech, permitting the synthesized audio to specific feelings, particularly in dialogs. The brand new long-form engine makes use of a deep studying text-to-speech (TTS) mannequin skilled to accumulate a contextual understanding of the textual content that permits it to specific prosody in an applicable method. This permits the intention of the story to drive the vocal efficiency and create the proper emphasis, pauses, and tones of a practical human voice.

Listed here are the brand new voices:

Title Locale Gender Language Pattern
Danielle en_US Feminine English (US)
Gregory en_US Male English (US)
Ruth en_US Feminine English (US)

Utilizing the New Voices
You’ll be able to entry the brand new voices utilizing the AWS Administration Console, AWS Command Line Interface (AWS CLI), or the AWS SDKs. Utilizing the CLI, I begin by itemizing the voices that use the brand new long-form engine:

$ aws --region us-east-1 polly describe-voices --output json 
  | jq -r '.Voices[] | choose(.SupportedEngines | index("long-form")) | .Title'
Danielle
Gregory
Ruth

I can decide one, or I can attempt all of them:

for v in `aws polly describe-voices --output json 
          | jq -r '.Voices[] | choose(.SupportedEngines | index("long-form")) | .Title'`; do
    Textual content="Hiya my identify is $v and I can learn weblog posts, articles, 
and different long-form content material for you. I'm the very best!"
    aws polly synthesize-speech --output-format 'mp3' 
    --text "$Textual content" --voice-id $v $v.mp3 --engine long-form; 
    aws s3 cp $v.mp3 s3://jbarr-voices; 
carried out

My shell script had a small quoting bug, however the ensuing audio was too humorous to not embody!

Programmatically, you may reproduce my instance by writing code that calls the DescribeVoices and SynthesizeSpeech capabilities.

Issues to Know
Listed here are some attention-grabbing issues that you must know concerning the new voices:

Pricing – Lengthy-form voices are priced at $100 per million characters or Speech Marks requests. Try the Amazon Polly Pricing web page to study extra.

Engines & Voices – A number of the voices that I listed above can be utilized with a couple of engine. For instance, the Danielle voice can be utilized with the brand new long-form engine and the prevailing neural engine.

Areas – The brand new engine and voices can be found within the US East (N. Virginia) Area.

Try the brand new voices, construct one thing superior, and let me know what you suppose!

Jeff;





Supply hyperlink

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
Google search engine

Most Popular

Recent Comments