{"id":27046,"date":"2016-12-04T20:05:32","date_gmt":"2016-12-04T18:05:32","guid":{"rendered":"https:\/\/mamchenkov.net\/wordpress\/?p=27046"},"modified":"2019-02-25T12:02:07","modified_gmt":"2019-02-25T10:02:07","slug":"amazon-polly-text-to-speech-in-47-voices-and-24-languages","status":"publish","type":"post","link":"https:\/\/mamchenkov.net\/wordpress\/2016\/12\/04\/amazon-polly-text-to-speech-in-47-voices-and-24-languages\/","title":{"rendered":"Amazon Polly \u2013 Text to Speech in 47 Voices and 24 Languages"},"content":{"rendered":"<!-- google_ad_section_start -->\n<p><a href=\"https:\/\/aws.amazon.com\/blogs\/aws\/polly-text-to-speech-in-47-voices-and-24-languages\/\">Amazon announced<\/a> a new service &#8211; <a href=\"http:\/\/aws.amazon.com\/polly\">Amazon Polly<\/a> &#8211;&nbsp;text to speech in 47 voices and 24 languages. &nbsp;This part got me intrigued:<\/p>\n<blockquote><p><span title=\"\">Polly<\/span> was designed to address many of the more challenging aspects of speech generation. For example, consider the difference in pronunciation of the word \u201clive\u201d in the phrases \u201cI live in Seattle\u201d and \u201cLive from New York.\u201d <span title=\"\">Polly<\/span> knows that this pair of homographs are spelled the same but are pronounced quite differently. Or, what about the \u201cSt.\u201d Depending on the language and the context, this could mean (and should be pronounced) as either \u201cstreet\u201d or \u201csaint.\u201d Again, <span title=\"\">Polly<\/span> knows what to do here. <span title=\"\">Polly<\/span> can also deal with units, fractions, abbreviations, currencies, dates, times, and other speech components in sophisticated, language-specific fashion.<\/p><\/blockquote>\n<p>I am not much involved with text to speech these days, but I did experiments in this area a few years ago. &nbsp;Simple text to simple English has been around for a long time. &nbsp;But support for other languages was always limited, and even with English, the voices always sounded very robotic, and often failed to understand the simplest of native language constructs.<\/p>\n<p>I tried Amazon Polly and was blown away by the quality of the synthesis. &nbsp;Here are the English samples of the text from <a href=\"https:\/\/mamchenkov.net\/wordpress\/2016\/12\/02\/things-to-learn-about-linux\/\">this blog<\/a> post:<\/p>\n<p>US English, Kendra, female:<\/p>\n<audio class=\"wp-audio-shortcode\" id=\"audio-27046-1\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-us-kendra-linux.mp3?_=1\" \/><a href=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-us-kendra-linux.mp3\">https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-us-kendra-linux.mp3<\/a><\/audio>\n<p>British English, Bryan, male:<\/p>\n<audio class=\"wp-audio-shortcode\" id=\"audio-27046-2\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-british-brian-linux.mp3?_=2\" \/><a href=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-british-brian-linux.mp3\">https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-british-brian-linux.mp3<\/a><\/audio>\n<p>Welsh English, Geraint, male:<\/p>\n<audio class=\"wp-audio-shortcode\" id=\"audio-27046-3\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-welsh-geraint-linux.mp3?_=3\" \/><a href=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-welsh-geraint-linux.mp3\">https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/english-welsh-geraint-linux.mp3<\/a><\/audio>\n<p>With that, I wanted to see what happens with other languages. &nbsp;The only other language I speak is Russian, so I pasted the <a href=\"https:\/\/mamchenkov.net\/wordpress\/category\/in-russian\/\">Russian category<\/a> description into <a href=\"https:\/\/console.aws.amazon.com\/polly\">the converter<\/a>, selected the Russian language, and got this:<\/p>\n<p>Russian, Maxim, male:<\/p>\n<audio class=\"wp-audio-shortcode\" id=\"audio-27046-4\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/russian-maxim-category.mp3?_=4\" \/><a href=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/russian-maxim-category.mp3\">https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/russian-maxim-category.mp3<\/a><\/audio>\n<p>That is pretty good! &nbsp;Going further, I pasted the content of <a href=\"https:\/\/mamchenkov.net\/wordpress\/2013\/11\/03\/%D0%BF%D1%80%D0%BE-%D0%B1%D0%B0%D0%B1%D1%83%D1%88%D0%BA%D1%83\/\">this blog post<\/a>, which is a quoted story that somebody else wrote. &nbsp;It has a very informal flow to it and some weird punctuation. &nbsp;Listen to what it turned into:<\/p>\n<p>Russian, Maxim, male:<\/p>\n<audio class=\"wp-audio-shortcode\" id=\"audio-27046-5\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/russian-maxim-babushka.mp3?_=5\" \/><a href=\"https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/russian-maxim-babushka.mp3\">https:\/\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/russian-maxim-babushka.mp3<\/a><\/audio>\n<p>You can still make out that it&#8217;s a robot and not a human, but it&#8217;s way better than anything else I&#8217;ve heard so far. &nbsp;By far!<\/p>\n<p>So, how affordable is this technology now? &nbsp;The pricing page answer is very simple:<\/p>\n<blockquote><p>Pay-as-you-go $4.00 per 1 million characters (when outside the free tier).<\/p><\/blockquote>\n<p>It also provides some examples of how this pricing converts to real-life scenarios:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"27052\" data-permalink=\"https:\/\/mamchenkov.net\/wordpress\/2016\/12\/04\/amazon-polly-text-to-speech-in-47-voices-and-24-languages\/polly-pricing-examples\/\" data-orig-file=\"https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/polly-pricing-examples.png?fit=869%2C517&amp;ssl=1\" data-orig-size=\"869,517\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"polly-pricing-examples\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/polly-pricing-examples.png?fit=660%2C393&amp;ssl=1\" class=\"aligncenter size-medium wp-image-27052\" src=\"https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/polly-pricing-examples-500x297.png?resize=500%2C297&#038;ssl=1\" alt=\"polly-pricing-examples\" width=\"500\" height=\"297\" srcset=\"https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/polly-pricing-examples.png?resize=500%2C297&amp;ssl=1 500w, https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/polly-pricing-examples.png?resize=768%2C457&amp;ssl=1 768w, https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2016\/12\/polly-pricing-examples.png?w=869&amp;ssl=1 869w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/p>\n<p>I don&#8217;t know about you, but my mind is blown&#8230;<\/p>\n<!-- google_ad_section_end -->\n","protected":false},"excerpt":{"rendered":"<!-- google_ad_section_start -->\n<p>Amazon announced a new service &#8211; Amazon Polly &#8211;&nbsp;text to speech in 47 voices and 24 languages. &nbsp;This part got me intrigued: Polly was designed to address many of the more challenging aspects of speech generation. For example, consider the difference in pronunciation of the word \u201clive\u201d in the phrases \u201cI live in Seattle\u201d and &hellip; <a href=\"https:\/\/mamchenkov.net\/wordpress\/2016\/12\/04\/amazon-polly-text-to-speech-in-47-voices-and-24-languages\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Amazon Polly \u2013 Text to Speech in 47 Voices and 24 Languages<\/span><\/a><\/p>\n<!-- google_ad_section_end -->\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"Amazon Polly \u2013 Text to Speech in 47 Voices and 24 Languages #Amazon #text2speech #voice #AWS","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_links_to":"","_links_to_target":""},"categories":[1,62,1334],"tags":[3270,3462,2366,3463],"keyring_services":[],"class_list":["post-27046","post","type-post","status-publish","format-standard","hentry","category-general","category-technology","category-web-work","tag-amazon-aws","tag-amazon-polly","tag-cloud-computing","tag-text-to-speech"],"aioseo_notices":[],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":9444,"url":"https:\/\/mamchenkov.net\/wordpress\/2005\/09\/12\/along-came-polly\/","url_meta":{"origin":27046,"position":0},"title":"Along Came Polly","author":"Leonid Mamchenkov","date":"September 12, 2005","format":false,"excerpt":"I once said that I don't intentionally want to see another movie with Ben Stiller, but it was a long time ago and Jennifer Aniston was in it, so I decided to watch \"Along Came Polly\". Directed by: John Hamburg Genres: Comedy, Romance Cast: Ben Stiller, Jennifer Aniston, Philip Seymour\u2026","rel":"","context":"In &quot;All&quot;","block_context":{"text":"All","link":"https:\/\/mamchenkov.net\/wordpress\/category\/general\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":9569,"url":"https:\/\/mamchenkov.net\/wordpress\/2005\/10\/13\/d-tox\/","url_meta":{"origin":27046,"position":1},"title":"D-Tox","author":"Leonid Mamchenkov","date":"October 13, 2005","format":false,"excerpt":"I wasn't sure that I wanted to see \"D-Tox\" in the cinema. So I waited for it to come out on DVD. And than I totally forgot about it. Until today. Directed by: Jim Gillespie Genres: Drama, Thriller, Horror Cast: Sylvester Stallone, Charles S. Dutton, Polly Walker, Kris Kristofferson, Mif,\u2026","rel":"","context":"In &quot;All&quot;","block_context":{"text":"All","link":"https:\/\/mamchenkov.net\/wordpress\/category\/general\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":8954,"url":"https:\/\/mamchenkov.net\/wordpress\/2005\/05\/09\/gremlins\/","url_meta":{"origin":27046,"position":2},"title":"Gremlins","author":"Leonid Mamchenkov","date":"May 9, 2005","format":false,"excerpt":"This old classic called \"Gremlins\" was on TV, Maxim was sleeping, I had nothing better to do, so I watched it. Again. Directed by: Joe Dante Genres: Comedy, Fantasy, Horror, Action Cast: Zach Galligan, Phoebe Cates, Hoyt Axton, Frances Lee McCain, Polly Holliday, Glynn Turman, Dick Miller, Keye Luke, Scott\u2026","rel":"","context":"In &quot;All&quot;","block_context":{"text":"All","link":"https:\/\/mamchenkov.net\/wordpress\/category\/general\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":7469,"url":"https:\/\/mamchenkov.net\/wordpress\/2004\/04\/27\/a9-amazon-applied-to-searching-the-web\/","url_meta":{"origin":27046,"position":3},"title":"A9 &#8211; Amazon applied to searching the web","author":"Leonid Mamchenkov","date":"April 27, 2004","format":false,"excerpt":"I've came across a new search engine from Amazon.com today - A9.com. It looks and feels much like Google, which is used for web searches anyway, but there are a couple of interesting bits about it. First, if you'll create yourself an account, A9 will remember you previous search queries.\u2026","rel":"","context":"In &quot;All&quot;","block_context":{"text":"All","link":"https:\/\/mamchenkov.net\/wordpress\/category\/general\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":28928,"url":"https:\/\/mamchenkov.net\/wordpress\/2018\/11\/12\/awsome-day-athens-2018\/","url_meta":{"origin":27046,"position":4},"title":"AWSome Day Athens 2018","author":"Leonid Mamchenkov","date":"November 12, 2018","format":false,"excerpt":"Last week I've attended the AWSome Day Athens 2018\u00a0(huge thanks to Qobo for the opportunity).\u00a0 There aren't that many technology events in Cyprus, so I'm constantly on the lookout for events in Europe. AWSome Day Athens is part of the Amazon's AWSome Day Global Series, which are one day events\u2026","rel":"","context":"In &quot;All&quot;","block_context":{"text":"All","link":"https:\/\/mamchenkov.net\/wordpress\/category\/general\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/mamchenkov.net\/wordpress\/wp-content\/uploads\/2018\/11\/AWSome-Day-Athens-430x500.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":24618,"url":"https:\/\/mamchenkov.net\/wordpress\/2015\/08\/21\/wordpress-benchmark-of-mysql-server-on-amazon-ec2\/","url_meta":{"origin":27046,"position":5},"title":"WordPress Benchmark of MySQL server on Amazon EC2","author":"Leonid Mamchenkov","date":"August 21, 2015","format":false,"excerpt":"I have a friend who is a newcomer to the world of WordPress. \u00a0Until recently, he was mostly working with custom-built systems and a PostgreSQL database engine, so there are many topics to cover. One of the topics that came up today was the performance of the database engine. \u00a0A\u2026","rel":"","context":"In &quot;All&quot;","block_context":{"text":"All","link":"https:\/\/mamchenkov.net\/wordpress\/category\/general\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_sharing_enabled":true,"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/posts\/27046","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/comments?post=27046"}],"version-history":[{"count":0,"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/posts\/27046\/revisions"}],"wp:attachment":[{"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/media?parent=27046"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/categories?post=27046"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/tags?post=27046"},{"taxonomy":"keyring_services","embeddable":true,"href":"https:\/\/mamchenkov.net\/wordpress\/wp-json\/wp\/v2\/keyring_services?post=27046"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}