时间:2026-05-23 15:04:03 来源:网络整理编辑:百科
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.

GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
17 questions you can answer if you're a good communicator2026-05-23 14:37
利物浦前瞻 :薩拉赫力爭五連殺 衝26場不敗神跡2026-05-23 14:36
裏皮談中國足球:一切都縮水至20年前 包括我們的年薪2026-05-23 14:26
模仿C羅 ?姆巴佩死球後膝撞對手 精彩表現因此蒙塵2026-05-23 13:39
Wikipedia co2026-05-23 13:35
泰山12次晉級足協杯決賽衝擊第7冠 若遇海港大打對攻 ?2026-05-23 13:29
巴薩極有可能今天官宣哈維入主 已準備好砸解約金2026-05-23 13:08
取勝之匙:曼聯啃老那個男人 藍月太子反客為主2026-05-23 13:00
Make money or go to Stanford? Katie Ledecky is left with an unfair choice.2026-05-23 12:59
官方 :巴雷拉與國米續約兩年 新合同2026年到期2026-05-23 12:21
Despite IOC ban, Rio crowds get their political messages across2026-05-23 14:50
巴薩極有可能今天官宣哈維入主 已準備好砸解約金2026-05-23 14:47
贏得艱難!囧叔取斑馬生涯200勝 替補絕殺+對手染紅2026-05-23 14:36
皇馬前瞻:戰艦複仇之戰 若平局創52年尷尬曆史2026-05-23 14:32
This weird squid looks like it has googly eyes, guys2026-05-23 14:08
國足12強賽開球時間仍在協調 將爭取延後比賽時間2026-05-23 14:04
巴薩首發:德佩法蒂先發 德容布斯克茨出戰2026-05-23 13:14
尤文前瞻 :斑馬軍望結束兩連敗 或迎曆史最差開局2026-05-23 13:09
This German startup wants to be your bank (without being a bank)2026-05-23 12:59
西班牙人前瞻:大腿複出武磊繼續替補 中資德比上演2026-05-23 12:40