时间:2026-02-22 03:04:24 来源:网络整理编辑:百科
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new versio
Just when you started coming to terms with ChatGPT's eerie capabilities, OpenAI dropped a new version of its AI language model.
OpenAI says GPT-4 is much more advanced than GPT-3, which powers ChatGPT. And to prove it, they made GPT-4 sit down for a bunch of exams. OpenAI tested GPT-4 with a variety of standardized tests from high school to graduate to professional level and spanning across mathematics, science, coding, history, literature, and even the one you take to become a sommelier. The exams were comprised of multiple choice and free-response question and GPT-4 was scored using the standard methodology for each exam.
SEE ALSO:How to get access to GPT-4 right nowPut your pencil down, GPT-4, it's time to see check your scores.
GPT-4 didn't just get into law school, it passed the bar. The AI language model scored in the 88th percentile on the LSATs (Law School Admission Test) and did even better on the Bar (Uniform Bar Exam) by scoring in the 90th percentile. By comparison, GPT-3 was in the bottom 40 percent of the LSATs and 10 percent on the Bar.

GPT-4 took both the math and reading/writing sections of the SATs and all three sections of the GREs which are broken down into quantitative, verbal, and writing skills. It scored in the 80th or 90th percentile of all sections except for the writing section of the GREs... which it kind of bombed in the 54th percentile.
The quintessential overachiever, GPT-4 also took allthe AP (Advanced Placement) high school exams. It aced most of them, scoring between the 84th and 100th, except for a few outliers.
GPT-4 scored 44th in AP English Language and a measly 22nd in AP English Literature. So all you wordsmiths out there might have some more time before GPT-4 replaces you. GPT-4 didn't do so hot on AP Calculus BC scoring between 43rd and 59th, proving that even for a supercomputer, calculus is not easy. But that still earns GPT-4 a four, so it might still place out of college calculus.
GPT-4 still has some work to do with its coding skills, which is curious since one of its marketed uses is for helping developers. Its rating for Codeforces, which hosts competitive programming events, is 392, which puts it way down in the Newbie category of anything below 1199.
It did pretty well on the easy level of the Leetcode (31 out of 41 problems solved) but struggled when it came to medium or hard level of difficulty (21/80 and 3/45 respectively). As we saw in the developer demo livestream, GPT-4 is fully capable of writing Python, but required some manual tweaking to set the right parameters, which might explain some these test scores. Or maybe it didn't eat breakfast that morning.
GPT-4 passed the sommelier exams with flying colors. It placed lowest (77th percentile) in the most advanced sommelier exam. But for a non-human entity that's never tasted wine, we'll let that one slide.
OpenAI has released a full breakdown of how GPT-4 performed. GPT-4 might not write the next great American novel...yet, but GPT-4's future as a mathematically brilliant lawyer and wine connoisseur looks pretty bright.
TopicsArtificial IntelligenceChatGPT
Daughter gives her 1002026-02-22 02:57
馬寧傅明執法受認可 中國裁判衝擊世界杯競爭激烈2026-02-22 02:41
國足在沙迦安“家” 入駐順利球員感慨比回家還快2026-02-22 02:12
全運U18女足上海42026-02-22 02:07
MashReads Podcast: What makes a good summer read?2026-02-22 02:02
冠絕南美!巴西世預賽三數據創曆史 跨屆豪奪9連勝2026-02-22 01:54
國足在沙迦仍無法走出酒店 訓練比賽麵臨高溫考驗2026-02-22 01:49
拉波爾塔欲起訴巴托梅烏 10月召開會員大會並提議2026-02-22 01:06
Twitter grants everyone access to quality filter for tweet notifications2026-02-22 00:59
越南足協主席致函FIFA及亞足聯 建議加強裁判工作質量檢查2026-02-22 00:56
Tourist survives for month in frozen New Zealand wilderness after partner dies2026-02-22 02:42
克羅斯:希望弗裏克能帶隊成功 不用再考慮征召我2026-02-22 02:37
官方:德國隊乘坐的飛機被迫降落在愛丁堡 全員安全2026-02-22 02:07
拉波爾塔欲起訴巴托梅烏 10月召開會員大會並提議2026-02-22 01:58
Here's what 'Game of Thrones' actors get up to between takes2026-02-22 01:56
C羅願望落空?超算預測英超排名 :曼城奪冠 曼聯第四2026-02-22 01:54
南美足聯官方 :反對世界杯兩年一屆 捍衛比賽質量2026-02-22 01:48
國足隻能土帥帶?李鐵爭議言論或成雙刃劍 自信有積極意義2026-02-22 01:19
Donald Trump's tangled web of Russian influence2026-02-22 00:35
79球!梅西上演帽子戲法 超貝利成南美國家隊射手王2026-02-22 00:29