Technology

views：1160

OpenAI announces GPT-4, claims it can beat 90% of humans on the SAT

LatLongInfo.com

2023-03-15 16:31

OpenAI announced the latest version of its primary large language model, GPT-4, on Tuesday, that it says exhibits “human-level performance” on many professional tests.

ChatGPT-4 is “larger” than previous versions, which means it has been trained on more data and has more weights in its model file, making it more expensive to run as well.

Currently, many researchers in the field believe many of the recent advancements in AI come from running ever-larger models on thousands of supercomputers in training processes that can cost tens of millions of dollars. GPT-4 is an example of an approach centering around “scaling up” to achieve better results.

OpenAI said it used Microsoft Azure to train the model; Microsoft has invested billions in the startup. OpenAI did not publish details about the specific model size or the hardware it used to train it, which could be used to recreate the model, citing “the competitive landscape.”

OpenAI’s GPT large language model powers many of the artificial intelligence demos that have been wowing people in the technology industry in the past six months, including Bing’s AI chat and ChatGPT, and the latest version is a preview of new advancements that could start filtering down to consumer products like chatbots in the coming weeks. Bing’s AI chatbot uses GPT-4, Microsoft said on Tuesday.

OpenAI says the new model will produce fewer factually incorrect answers, go off the rails and chat about forbidden topics less often, and even perform better than humans on many standardized tests.

GPT-4 performed at the 90th percentile on a simulated bar exam, the 93rd percentile on an SAT reading exam, and the 89th percentile on the SAT Math exam, OpenAI claimed.

However, OpenAI warns that the new software isn’t perfect yet and that it is less capable than humans in many scenarios. It still has a major problem with “hallucination,” or making stuff up, and isn’t factually reliable, the company said. It is still prone to insisting it is correct when it is wrong.

“GPT-4 still has many known limitations that we are working to address, such as social biases, hallucinations, and adversarial prompts,” the company said in a blog post.

“In a casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle. The difference comes out when the complexity of the task reaches a sufficient threshold—GPT-4 is more reliable, creative, and able to handle much more nuanced instructions than GPT-3.5,” OpenAI wrote in a blog post.

The new model will be available to paid ChatGPT subscribers and will also be available as part of an API which allows programmers to integrate the AI into their apps. OpenAI will charge about 3 cents for about 750 words of prompts and 6 cents for about 750 words in response.

Finance

Expectations of the Federal Reserve cutting interest rates have cooled down: The PPI "fire" has been burning continuously, and Jackson Hole has become the stage for hawks

Two weeks ago, US Treasury Secretary Janet Bessent was still making a high-profile prediction that the Federal Reserve would cut interest rates by 50 basis points in September and declared that the benchmark interest rate should be significantly reduced by 150 to 175 basis points.

Technology

OpenAI announces GPT-4, claims it can beat 90% of humans on the SAT

Washington's Military Reinforcement Turmoil: Power Struggles, Social Rifts, and Democratic Dilemmas

Three Republican states in the United States are deploying hundreds of guards to Washington

Trump's "Ceasefire Ultimatum" : Political Performance and Real Predicaments in Geopolitical Games

Trump raised the possibility of Zelensky attending the Alaska talks

Trump is considering allowing a "major lawsuit" against the chairperson of the Federal Reserve.

Trump's "Reclaim the Capital" Plan: Political Gaming and Social Division Under the Guise of Security Governance

ISC.AI 2025: China leads global AI governance and blueprint for the era of intelligent agents

Embodied Intelligence Platform Released: Initiating the New Era of Artificial Intelligence

The impact of weak market demand and changing trade patterns: the continuous decline of industrial profits in China

China discovers deep uranium deposits: a silent earth science revolution

What Do Jensen Huang's Frequent Visits to China Bring to China's Economy?

The EU-US trade statement has been put on hold due to digital regulations

Putin discussed the US-Russia summit with the leaders of Belarus and Kazakhstan

A large amount of personal information of tourists was stolen from an Italian hotel by hackers

The European Union has urged Israel to halt its "E1 Zone" settlement plan

Europe and the United States Reach Five Consensus on Negotiation Principles with Russia

Recommend

Expectations of the Federal Reserve cutting interest rates have cooled down: The PPI "fire" has been burning continuously, and Jackson Hole has become the stage for hawks

Fast food industry reflects the economic pressure of Americans

Tariffs "Fail to Save" American Domestic Industries

Behind the imprisonment of Yin Xiyue and his wife

Trump urges Kiev to reach agreement: Russia Ukraine war may come to an end

A $55 million buyout of search rights? Behind the lawsuit filed by Australian regulators against Google for suspected anti-competitive behavior

Latest

Expectations of the Federal Reserve cutting interest rates have cooled down: The PPI "fire" has been burning continuously, and Jackson Hole has become the stage for hawks

Fast food industry reflects the economic pressure of Americans

Tariffs "Fail to Save" American Domestic Industries

Behind the imprisonment of Yin Xiyue and his wife

Trump urges Kiev to reach agreement: Russia Ukraine war may come to an end

A $55 million buyout of search rights? Behind the lawsuit filed by Australian regulators against Google for suspected anti-competitive behavior

News categories

Area categories

services