{"id":92952,"date":"2026-05-08T05:53:11","date_gmt":"2026-05-08T05:53:11","guid":{"rendered":"https:\/\/diyhaven858.wasmer.app\/index.php\/openai-is-tired-of-seeing-all-those-videos-of-people-clowning-on-its-voice-mode\/"},"modified":"2026-05-08T05:53:11","modified_gmt":"2026-05-08T05:53:11","slug":"openai-is-tired-of-seeing-all-those-videos-of-people-clowning-on-its-voice-mode","status":"publish","type":"post","link":"https:\/\/diyhaven858.wasmer.app\/index.php\/openai-is-tired-of-seeing-all-those-videos-of-people-clowning-on-its-voice-mode\/","title":{"rendered":"OpenAI Is Tired of Seeing All Those Videos of People Clowning on Its Voice Mode"},"content":{"rendered":"<p> <br \/>\n<br \/><img decoding=\"async\" src=\"https:\/\/gizmodo.com\/app\/uploads\/2024\/05\/72304a9885903a2bfcba79ca40608c30-1024x683.jpg\" \/><\/p>\n<div>\n<p>Earlier this year, Sam Altman was confronted directly with a video from what has become a viral trend: people showing off the significant shortcomings of OpenAI\u2019s voice model. It seems he didn\u2019t particularly enjoy that, because OpenAI is taking steps to save Altman from future embarrassment. On Thursday, the company announced three new voice models meant to open up the technology to developers who might be able to do groundbreaking things like program a functional timer.<\/p>\n<p>Per the company, it is releasing GPT-Realtime-2, its first voice model with \u201cGPT-5-class reasoning\u201d that can allegedly handle difficult prompts and better maintain conversations than its predecessors. It also introduced GPT-Realtime-Translate, which it claims can translate speech from more than 70 input languages into 13 output languages while \u201ckeeping pace with the speaker.\u201d The final model, GPT-Realtime-Whisper, is meant for live speech-to-text transcription.<\/p>\n<p>\u201cVoice is becoming one of the most natural ways for people to use software,\u201d the company said in a statement. \u201cBut building useful voice products takes more than fast turn-taking or a natural-sounding voice. A voice agent needs to understand what someone means, keep track of context, recover when a request changes, use tools while the conversation continues, and respond in a way that feels appropriate to the moment.\u201d<\/p>\n<p>The challenges that building AI models have presented have become the subject of many a meme over the past year or so. TikTok user @huskistaken, aka Husk, is perhaps the master of the genre, regularly poking holes in the capabilities of OpenAI\u2019s previous voice models\u2014though instead of doing so as a red teamer preventing issues from making it into the final product, he primarily encourages OpenAI to make changes via embarrassment.<\/p>\n<p>It was one of Husk\u2019s videos that made its way to Altman earlier this year. The CEO was made to watch ChatGPT\u2019s voice model very obviously lie about starting a timer. Husk would ask the model to time how long it took him to run a mile, then immediately say he was done, only for the model to claim he finished his mile in 10 minutes. Altman, visibly annoyed about the whole thing, said it\u2019d be \u201cMaybe another year before something like that works well.\u201d<\/p>\n<p>The new models are meant to speed up solutions to this confounding problem. Per OpenAI\u2019s press release, the new releases are adept at \u201cvoice-to-action, where people can describe what they need and the system can reason through the request, use tools, and complete the task.\u201d They provide an example like asking Zillow to \u201cfind me homes within my BuyAbility, avoid busy streets, and schedule a tour for Saturday.\u201d That certainly feels a bit more advanced than \u201cstart a timer,\u201d but it stands to reason that\u2019d fall under the same functionality.<\/p>\n<p>The real test of OpenAI\u2019s new models will be the jailbreakers like Husk. Earlier this year, former OpenAI founder Andrej Karpathy argued that people simply haven\u2019t updated their priors on AI models, which he argued are advancing all the time in ways that don\u2019t garner the same attention as voices messing with the voice model. But those videos aren\u2019t old\u2014Husk uploads new ones regularly. If he stops posting with the release of this new model, chalk up a win for the true believers like Karpathy.<\/p>\n<\/p><\/div>\n<p><br \/>\n<br \/><a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Earlier this year, Sam Altman was confronted directly with a video from what has become a viral trend: people showing off the significant shortcomings of OpenAI\u2019s voice model. It seems he didn\u2019t particularly enjoy that, because OpenAI is taking steps to save Altman from future embarrassment. On Thursday, the company announced three new voice models [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":92953,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_daextam_enable_autolinks":"","jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[11],"tags":[],"class_list":["post-92952","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech-news"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/diyhaven858.wasmer.app\/wp-content\/uploads\/2026\/05\/72304a9885903a2bfcba79ca40608c30-e1740080089739.jpg","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/posts\/92952","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/comments?post=92952"}],"version-history":[{"count":0,"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/posts\/92952\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/media\/92953"}],"wp:attachment":[{"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/media?parent=92952"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/categories?post=92952"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/diyhaven858.wasmer.app\/index.php\/wp-json\/wp\/v2\/tags?post=92952"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}