时间:2025-10-08 01:47:33 来源:网络整理编辑:百科
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.
GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
'The Flying Bum' aircraft crashes during second test flight2025-10-08 01:39
吳曦下周一到康橋基地報到 申花將再次全封閉訓練2025-10-08 01:16
卡西夫妻勞燕分飛 !曝已分居數周 妻子癌症剛治好2025-10-08 00:59
泰山隊曾準備兩套均衡陣容欲搏亞冠 遺憾被取消資格2025-10-08 00:50
Over 82,000 evacuate as Blue Cut fire rapidly spreads in southern California2025-10-08 00:47
高接低擋!納瓦斯9獻精彩撲救 拒梅西點球無愧MVP2025-10-08 00:43
日韓男足3月底約戰東京 國足40強賽前無國際熱身賽2025-10-07 23:47
5連勝!熱刺英超歐聯齊頭並進 歐冠資格的雙保險2025-10-07 23:31
Satisfy your Olympics withdrawals with Nike's latest app2025-10-07 23:17
吳曦發文告別江蘇隊 :很遺憾沒穿上一天胸前繡星的隊服2025-10-07 23:16
J.K. Rowling makes 'Harry Potter' joke about Olympics event2025-10-08 01:40
滬媒辟謠吉翔加盟海港:沒任何接觸 傳聞隻是一廂情願2025-10-08 01:39
綠城赴廣州備戰中超 盼免簽蘇寧外援+買斷魯能中場2025-10-08 01:01
曼聯VS米蘭前瞻:雙方各失鋒線主將 紅魔挑戰苦主2025-10-08 00:25
J.K. Rowling makes 'Harry Potter' joke about Olympics event2025-10-07 23:50
中超河北隊將聘請金鍾夫出任主帥 團隊6人多為韓國前國腳2025-10-07 23:46
國安所在小組仍未敲定舉辦地 東亞區附加賽亦延期2025-10-07 23:40
德媒 :阿拉巴肌肉受傷缺席訓練 可能缺戰不萊梅2025-10-07 23:31
Old lady swatting at a cat ends up in Photoshop battle2025-10-07 23:09
歐冠曆史失點次數排行:亨利5次最多 梅西居次席2025-10-07 23:03