OpenAI’s newest version of ChatGPT adds a bunch of new bells and whistles, including increased speed and a voice option.
On Monday, the company announced the launch its new artificial intelligence model and desktop version of ChatGPT with a new updated graphics user interface.
In a livestreamed event, technology chief Mira Murati announced that OpenAI is bringing GPT-4 to all users, including those on the free tier. She highlighted that the new model, GPT-4o, is “much faster” and boasts enhanced capabilities in text, video, and audio. OpenAI also revealed plans to eventually enable users to video chat with ChatGPT.
The “o” in GPT-4o stands for omni. The new model allows ChatGPT to handle 50 different languages with improved speed and quality. It will also be available via OpenAI’s API, enabling developers to start building applications with the new model today, according to Murati. She added that GPT-4o is twice as fast and half the cost of GPT-4 Turbo.
OpenAI team members demonstrated the new model’s audio capabilities, such as helping someone calm down before a public speech. OpenAI researcher Mark Chen explained that the model can “perceive your emotion” and can handle interruptions from users. The team also showcased its ability to analyze a user’s facial expressions and comment on the emotions the person might be experiencing.
Investors have valued OpenAI, backed by Microsoft, at over $80 billion. Founded in 2015, the company faces pressure to remain a leader in the generative AI market while seeking ways to generate revenue. This challenge comes as OpenAI spends massive sums on processors and infrastructure to build and train its models.
Read more: Verses AI raises CAD$10M in private placement and leans into AI product, Genius
Read more: Verses Technologies’ COSM adds another dimension to artificial intelligence
Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN
Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx
— OpenAI (@OpenAI) May 13, 2024
OpenAI to test voice mode in the coming weeks
In a blog post, OpenAI announced plans to test Voice Mode in the coming weeks complete with early access for ChatGPT Plus subscribers. The company said the new model can respond to users’ audio prompts “in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time in a conversation.”
Chen demonstrated the model’s ability to tell a bedtime story and adjusted its voice to be more dramatic or robotic upon request. He even had it sing the story. Additionally, OpenAI’s new model can function as a translator, even in audio mode. Chen showcased the tool’s ability to listen to Murati speaking Italian while he spoke English, translating their conversation into their respective languages.
Team members also demonstrated the model’s capabilities in solving math equations and helping write code, positioning it as a stronger competitor to Microsoft’s GitHub Copilot.
For OpenAI, this launch marks one of the company’s most significant announcements since the August debut of ChatGPT Enterprise, the AI chatbot’s business tier. OpenAI Chief Operating Officer Brad Lightcap indicated the company developed the tool in under a year with the help of more than 20 companies from various sizes and industries.
The company rolled out the new model on Tuesday to customers of ChatGPT Plus and Team, and intends on extending it to Enterprise later, according to the blog post. It’s also available to free users with usage limits. Users will have five times more message capacity than free users, and clients using the Team and Enterprise option will enjoy even greater than that.
Read more: AI discovers distinct difference between male and female brain
Over 700 generative AI deals in 2023
According to financial data and software company, Pitchbook, there have been approximately USD$29 billion invested over 700 generative AI deals in 2023, which is a boost of over 260 per cent from 2022. The gold rush is spearheaded by OpenAI, Microsoft (NASDAQ: MSFT) and Google (NASDAQ: GOOG). Earlier this month, Anthropic extended its first-ever enterprise program, and announced that its AI, Claude, will now be offered via a free phone app.
Meanwhile, smaller companies like VERSES AI (CBOE: VERS) (OTCQB: VRSSF), Nextech3D.ai (OTCQX: NEXCF) (CSE: NTAR) (FSE: EP2), Diagnamed Holdings Corp (CSE: DMED) (OTCQB: DGNMF) have each incorporated artificial intelligence to perform various functions in diverse industries.
Some in the industry have raised concerns about the speed at which untested new services are coming to market, and academics and ethicists are distressed about the technology’s tendency to propagate bias.
After launching ChatGPT in November 2022, OpenAI saw it break records as the fastest-growing consumer app in history, now boasting about 100 million weekly active users. OpenAI indicates that more than 92 per cent of Fortune 500 companies are using the platform.
Verses AI is a sponsor of Mugglehead news coverage.
.
Follow Joseph Morton on Twitter
joseph@mugglehead.com
