{"id":54748,"date":"2025-08-08T00:44:42","date_gmt":"2025-08-07T19:14:42","guid":{"rendered":"https:\/\/officechai.com\/?p=54748"},"modified":"2025-08-08T00:44:44","modified_gmt":"2025-08-07T19:14:44","slug":"grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second","status":"publish","type":"post","link":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/","title":{"rendered":"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second"},"content":{"rendered":"\n<p>GPT-5 might currently be the best AI model in the world, but it seems to have gotten no closer to achieving AGI.<\/p>\n\n\n\n<p>xAI&#8217;s Grok 4 remains the best-performing model on the ARC-AGI 2 index, which tracks the models&#8217; general intelligence. On the index, Grok 4 had earlier <a href=\"https:\/\/officechai.com\/ai\/xai-releases-grok4-beats-openai-and-google-on-many-benchmarks\/\">blown away<\/a> the competition with a score of 15.9%. GPT-5, released a month later today, scored just 9.9%. GPT-5, however, became the second-best performing model on the index.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"embed-twitter\"><blockquote class=\"twitter-tweet\" data-width=\"550\" data-dnt=\"true\"><p lang=\"en\" dir=\"ltr\">Grok 4 is still state-of-the-art on ARC-AGI-2 among frontier models. <br><br>15.9% for Grok 4 vs 9.9% for GPT-5. <a href=\"https:\/\/t.co\/wSezrsZsjw\">pic.twitter.com\/wSezrsZsjw<\/a><\/p>&mdash; Fran\u00e7ois Chollet (@fchollet) <a href=\"https:\/\/twitter.com\/fchollet\/status\/1953511631054680085?ref_src=twsrc%5Etfw\">August 7, 2025<\/a><\/blockquote><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n<\/div><\/figure>\n\n\n\n<p>&#8220;Grok 4 is still state-of-the-art on ARC-AGI-2 among frontier models. 15.9% for Grok 4 vs 9.9% for GPT-5,&#8221; said Francois Chollet, founder of the Arc Prize. This didn&#8217;t go unnoticed by Elon Musk, who has <a href=\"https:\/\/officechai.com\/ai\/elon-musk-sam-altman-trade-barbs-openais-announcement-of-500-billion-funding\/\">no love lost<\/a> for OpenAI or Sam Altman. &#8220;Grok 4 beats GPT-5 on ARC-AGI,&#8221; he declared.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"embed-twitter\"><blockquote class=\"twitter-tweet\" data-width=\"550\" data-dnt=\"true\"><p lang=\"en\" dir=\"ltr\">Grok 4 beats GPT-5 on ARC-AGI <a href=\"https:\/\/t.co\/FwpF3kfeLk\">pic.twitter.com\/FwpF3kfeLk<\/a><\/p>&mdash; Elon Musk (@elonmusk) <a href=\"https:\/\/twitter.com\/elonmusk\/status\/1953512163571904671?ref_src=twsrc%5Etfw\">August 7, 2025<\/a><\/blockquote><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n<\/div><\/figure>\n\n\n\n<p>The ARC-AGI Prize is a $1 million+ open competition created to advance progress toward Artificial General Intelligence (AGI). It incentivizes teams to solve the ARC-AGI benchmark, which is a set of reasoning tasks designed to evaluate how well AI systems can generalize and solve problems they have never seen before, which is a core aspect of human-like intelligence. The ARC-AGI benchmark, first released in 2019, consists of IQ-test-like puzzles using colored grids to test abstract reasoning from minimal examples, without requiring prior domain knowledge or language. Performance on the ARC-AGI index is measured as the percentage of correct solutions on a private evaluation set, acting as a rigorous metric and milestone for AGI research. <\/p>\n\n\n\n<p>While the other benchmarks have practical purposes &#8212; there are benchmarks for coding, math, science, and host of other fields &#8212; ARC-AGI gets the models to solve puzzles which seemingly have no real-life value. These puzzles can be easily solved by most humans, but AI models struggle with it, and as such, these puzzles are a good test of how close AI is getting to generalized human intelligence. On the latest version of the test, the best AI model thus far, Grok 4, could only solve 15.9 percent of the puzzles. And the fact that GPT-5 can solve only 9.9 percent of the puzzles shows that while GPT-5 might&#8217;ve topped the practical benchmarks, it&#8217;s likely taken us no closer to achieving AGI.<\/p>\n\n\n\n<p><a href=\"https:\/\/x.com\/elonmusk\/status\/1953512163571904671\/photo\/1\"><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>GPT-5 might currently be the best AI model in the world, but it seems to have gotten no closer to achieving AGI. xAI&#8217;s&#8230;<\/p>\n","protected":false},"author":1,"featured_media":54751,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1029],"tags":[],"class_list":["post-54748","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second\" \/>\n<meta property=\"og:description\" content=\"GPT-5 might currently be the best AI model in the world, but it seems to have gotten no closer to achieving AGI. xAI&#8217;s...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/\" \/>\n<meta property=\"og:site_name\" content=\"OfficeChai\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/OfficeChai\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-08-07T19:14:42+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-08-07T19:14:44+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"630\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"OfficeChai Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@OfficeChai\" \/>\n<meta name=\"twitter:site\" content=\"@OfficeChai\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"OfficeChai Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/\",\"url\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/\",\"name\":\"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second\",\"isPartOf\":{\"@id\":\"https:\/\/officechai.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1\",\"datePublished\":\"2025-08-07T19:14:42+00:00\",\"dateModified\":\"2025-08-07T19:14:44+00:00\",\"author\":{\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2\"},\"breadcrumb\":{\"@id\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1\",\"width\":1200,\"height\":630},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/officechai.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/officechai.com\/#website\",\"url\":\"https:\/\/officechai.com\/\",\"name\":\"OfficeChai\",\"description\":\"Startups, Businesses And Careers\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/officechai.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2\",\"name\":\"OfficeChai Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g\",\"caption\":\"OfficeChai Team\"},\"description\":\"Dotting the i's, crossing the t's.\",\"url\":\"https:\/\/officechai.com\/author\/admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/","og_locale":"en_US","og_type":"article","og_title":"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second","og_description":"GPT-5 might currently be the best AI model in the world, but it seems to have gotten no closer to achieving AGI. xAI&#8217;s...","og_url":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/","og_site_name":"OfficeChai","article_publisher":"https:\/\/www.facebook.com\/OfficeChai\/","article_published_time":"2025-08-07T19:14:42+00:00","article_modified_time":"2025-08-07T19:14:44+00:00","og_image":[{"width":1200,"height":630,"url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1","type":"image\/jpeg"}],"author":"OfficeChai Team","twitter_card":"summary_large_image","twitter_creator":"@OfficeChai","twitter_site":"@OfficeChai","twitter_misc":{"Written by":"OfficeChai Team","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/","url":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/","name":"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second","isPartOf":{"@id":"https:\/\/officechai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#primaryimage"},"image":{"@id":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1","datePublished":"2025-08-07T19:14:42+00:00","dateModified":"2025-08-07T19:14:44+00:00","author":{"@id":"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2"},"breadcrumb":{"@id":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#primaryimage","url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1","contentUrl":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1","width":1200,"height":630},{"@type":"BreadcrumbList","@id":"https:\/\/officechai.com\/ai\/grok-4-remains-top-model-on-arc-agi-2-gpt-5-comes-in-distant-second\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/officechai.com\/"},{"@type":"ListItem","position":2,"name":"Grok 4 Remains Top Model On ARC-AGI 2, GPT-5 Comes In Distant Second"}]},{"@type":"WebSite","@id":"https:\/\/officechai.com\/#website","url":"https:\/\/officechai.com\/","name":"OfficeChai","description":"Startups, Businesses And Careers","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/officechai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2","name":"OfficeChai Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/officechai.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g","caption":"OfficeChai Team"},"description":"Dotting the i's, crossing the t's.","url":"https:\/\/officechai.com\/author\/admin\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2025\/08\/MixCollage-08-Aug-2025-12-42-AM-1358.jpg?fit=1200%2C630&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p685C6-ef2","jetpack_likes_enabled":true,"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/54748","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/comments?post=54748"}],"version-history":[{"count":1,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/54748\/revisions"}],"predecessor-version":[{"id":54752,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/54748\/revisions\/54752"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/media\/54751"}],"wp:attachment":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/media?parent=54748"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/categories?post=54748"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/tags?post=54748"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}