View all newsletters
Receive our newsletter - data, insights and analysis delivered to you

OpenAI launches GPT-4o, flaunting ability of model to detect user emotions

GPT-4o, claims OpenAI, is faster and more adept at handling text, audio and video – and even detecting user emotions, a major source of controversy in AI research. 

By Greg Noone

OpenAI has launched a new LLM named GPT-4o and a desktop version of ChatGPT for macOS. The new model, said its creators, is faster and more efficient at parsing text, audio and video than its predecessors. GPT-4o also allows ChatGPT to process up to 50 languages at greater speeds. The new model is accessible to developers through OpenAI’s API and to free and premium users of ChatGPT.

“GPT-4o reasons across voice, text and vision,” said OpenAI’s Mira Murati. “With [its] incredible efficiencies, it also allows us to bring GPT-4-class intelligence to our free users. This is something that we’ve been trying to do for many, many months, and we’re very, very excited to finally bring GPT-4o to all of our users.”

A screenshot of ChatGPT, powered by GPT-4o, parsing a line graph.
ChatGPT, powered by OpenAI’s new model GPT-4o, interprets a line graph. OpenAI also promoted the model’s emotion recognition capabilities, a field many AI researchers and psychologists find controversial. (Photo by OpenAI)

New emotion recognition capabilities for GPT-4o version of ChatGPT

OpenAI also highlighted the new voice and vision perception capabilities of ChatGPT powered by GPT-4o. In a demonstration by OpenAI’s head of post-training, Barret Zoph, the chatbot was shown to successfully parse a linear equation on a sheet of paper and correctly answer questions about how to solve it – a notable advance in its own right, given the historical difficulty LLMs have had in solving mathematical conundrums. 

A large part of the presentation was also devoted to showing how these new capabilities might be used to enhance the ability of ChatGPT to detect emotion. In a slightly halting demonstration, the firm’s head of frontiers research Mark Chen showed how the model could be interrupted and could pick up on emotional cues from the user. “When I was breathing super-hard there, it could tell and it knew, ‘Hey, you might want to calm down a little bit,’” said Chen. “Not only that, though, the model is able to generate voice in a variety of different emotive styles [with] a wide dynamic range.”

ChatGPT’s visual perception has also been enhanced by GPT-4o, claims OpenAI. Another demonstration by Zoph highlighted how the model can be used to interpret facial expressions to divine insights about the user’s emotional state from photos or video footage. 

GPT-4o’s capabilities in this area should be welcomed, said Sean Betts, Omnicom Media Group’s Chief Product & Technology Officer. “GPT-4o is the first technology humanity has ever developed that not only understands emotions but can also respond with emotion,” said Betts. “A lot of talk around generative AI chat models has centred around intelligence, but I think we now also need to start thinking about the emotional intelligence of generative AI. This is new territory for us, but one that I think will completely change our relationships with technology from now on.”

GPT Store now open to free users

Live emotion recognition remains a hugely controversial subject among AI researchers and academics, with many disputing the premise that it is possible to fully interpret how an individual is feeling from visual data alone. The implications of promoting GPT-4o’s supposed capabilities in this area were only partially addressed by OpenAI during the presentation, with Murati explaining that the firm was committed to continually working with red teamers and representatives of civil society to root out and resolve problems and ethical dilemmas. 

Content from our partners
Rethinking cloud: challenging assumptions, learning lessons
DTX Manchester welcomes leading tech talent from across the region and beyond
The hidden complexities of deploying AI in your business

OpenAI also announced that it would be making many premium features of ChatGPT available to free users, including access to its GPT store. The latter allows users to download generative AI applications powered by OpenAI’s LLMs tailored to specific use cases. A desktop version of ChatGPT was also announced, capable of live interpretation of data on the user’s screen. The app is only available to users of macOS, however, with a version for Windows to be released “later this year.”

Read more: OpenAI announces new text-to-video platform, Sora

Websites in our network
Select and enter your corporate email address Tech Monitor's research, insight and analysis examines the frontiers of digital transformation to help tech leaders navigate the future. Our Changelog newsletter delivers our best work to your inbox every week.
  • CIO
  • CTO
  • CISO
  • CSO
  • CFO
  • CDO
  • CEO
  • Architect Founder
  • MD
  • Director
  • Manager
  • Other
Visit our privacy policy for more information about our services, how Progressive Media Investments may use, process and share your personal data, including information on your rights in respect of your personal data and how you can unsubscribe from future marketing communications. Our services are intended for corporate subscribers and you warrant that the email address submitted is your corporate email address.
THANK YOU