时间:2025-06-17 10:58:19 来源:网络整理编辑:時尚
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.
GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
Xiaomi accused of copying again, this time by Jawbone2025-06-17 10:54
哈維太勇 !新援三叉戟聯袂首發 最強火力出擊德比2025-06-17 10:39
李佳悅 : 我們可以把女足變得更強 未來關注度不會比男足低2025-06-17 10:35
意大利記者 :裏皮親口告訴我 中國足協曾想在2035年前贏得世界杯2025-06-17 10:07
Old lady swatting at a cat ends up in Photoshop battle2025-06-17 10:01
比賽日:馬夏爾獻助攻塞維利亞22025-06-17 09:46
退役倒計時 ?曝C羅告知朋友 :第一次感受到了年齡2025-06-17 09:43
津門虎遭禁令還未到生死存亡時刻 記者:下周集結備戰2025-06-17 09:34
Plane makes emergency landing after engine rips apart during flight2025-06-17 09:33
曼聯前瞻 :C羅欲破12年最長球荒 618再擦出火花 ?2025-06-17 09:26
5 people Tim Cook calls for advice on running the biggest company in the world2025-06-17 10:48
記者:中國足球需完善日常監管 戰術層麵反倒是小事2025-06-17 10:48
熱刺前瞻 :8天3賽終點之戰 熱刺體能迎來臨界點 ?2025-06-17 10:39
比賽日 :久保健英造烏龍馬洛卡32025-06-17 09:47
Tyler, the Creator helped Frank Ocean celebrate 'Blonde' release in a delicious way2025-06-17 09:43
C羅為何直接離場?朗尼克:他37歲了 該直接問他2025-06-17 09:38
劉建宏:揪住海參不放是對中國足球的誤讀 進國足明碼實價不是現在2025-06-17 09:20
告別足球 ?意甲名將嗑藥禁賽12年 46歲複出接著踢2025-06-17 09:17
Dog elected for third term as mayor of Minnesota town2025-06-17 09:07
曆史新低!最嚴限薪令將公布:內援稅前300萬元 外援200萬歐2025-06-17 08:14