{"id":58886,"date":"2026-02-12T22:30:53","date_gmt":"2026-02-12T17:00:53","guid":{"rendered":"https:\/\/officechai.com\/?p=58886"},"modified":"2026-02-12T23:25:12","modified_gmt":"2026-02-12T17:55:12","slug":"gemini-3-deep-think-benchmarks-arc-agi","status":"publish","type":"post","link":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/","title":{"rendered":"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%"},"content":{"rendered":"\n<p><a href=\"https:\/\/officechai.com\/ai\/gpt-5-2-pro-creates-new-record-of-54-2-on-arc-agi-2-beats-gemini-3-deep-think-preview-at-45-1\/\">ARC-AGI 2<\/a> &#8212; an iteration on the original ARC-AGI benchmark which was designed to test for AGI &#8212; appears to be close to getting saturated.<\/p>\n\n\n\n<p>Google DeepMind has unveiled a major upgrade to its<a href=\"https:\/\/officechai.com\/ai\/gemini-3-benchmarks\/\"> Gemini 3<\/a> family with the enhanced Gemini 3 Deep Think mode, positioning it as a breakthrough in advanced AI reasoning capabilities. This specialized mode, designed for tackling the most demanding scientific, research, and engineering challenges, delivers unprecedented performance across several key benchmarks.<\/p>\n\n\n\n<p>&#8220;We\u2019ve upgraded our specialized reasoning mode Gemini 3 Deep Think to help solve modern science, research, and engineering challenges \u2013 pushing the frontier of intelligence,&#8221; Google DeepMind said on X.<\/p>\n\n\n\n<p>On ARC-AGI-2 \u2014 a challenging benchmark emphasizing abstract reasoning, adaptability, and core intelligence without relying on memorized patterns \u2014 Gemini 3 Deep Think achieves a verified score of <strong>84.6%<\/strong>. This significantly outperforms competitors, including Gemini 3 Pro Preview at 31.1%, Claude Opus 4.6 (Thinking Max) at 68.8%, and GPT-5.2 (Thinking xhigh) at 52.9%. The result, verified by the ARC Prize Foundation, highlights substantial progress toward saturating this once-formidable test of general intelligence.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" fetchpriority=\"high\" decoding=\"async\" width=\"640\" height=\"800\" src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-819x1024.png?resize=640%2C800&#038;ssl=1\" alt=\"\" class=\"wp-image-58887\" srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?resize=819%2C1024&amp;ssl=1 819w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?resize=240%2C300&amp;ssl=1 240w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?resize=768%2C960&amp;ssl=1 768w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?resize=1229%2C1536&amp;ssl=1 1229w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?resize=1638%2C2048&amp;ssl=1 1638w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?w=2048&amp;ssl=1 2048w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?w=1280 1280w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-20-scaled.png?w=1920 1920w\" sizes=\"(max-width: 640px) 100vw, 640px\" \/><\/figure>\n\n\n\n<p>Gemini 3 Deep Think&#8217;s results were verified by ARC-AGI. &#8220;New SOTA result on ARC-AGI 2,&#8221; it <a href=\"https:\/\/x.com\/arcprize\/status\/2021985585066652039?s=20\">posted <\/a>on X. Gemini 3 Deep Think (2\/26) Semi Private Eval &#8211; ARC-AGI-1: 96.0%, $7.17\/task &#8211; ARC-AGI-2: 84.6% $13.62\/task,&#8221; it added.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"398\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-25.png?resize=640%2C398&#038;ssl=1\" alt=\"\" class=\"wp-image-58895 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-25.png?w=937&amp;ssl=1 937w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-25.png?resize=300%2C187&amp;ssl=1 300w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-25.png?resize=768%2C478&amp;ssl=1 768w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/398;\" \/><\/figure>\n\n\n\n<p>On the original ARC-AGI 1 benchmark, Gemini 3 Deep Think did even better, scoring 96% and essentially all but saturating the benchmark.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"404\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-26.png?resize=640%2C404&#038;ssl=1\" alt=\"\" class=\"wp-image-58896 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-26.png?w=938&amp;ssl=1 938w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-26.png?resize=300%2C189&amp;ssl=1 300w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-26.png?resize=768%2C485&amp;ssl=1 768w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/404;\" \/><\/figure>\n\n\n\n<p>In academic reasoning, Gemini 3 Deep Think scores <strong>48.4%<\/strong> on <strong>Humanity\u2019s Last Exam<\/strong> (no tools), surpassing Gemini 3 Pro Preview (37.5%), Claude Opus 4.6 (40.0%), and GPT-5.2 (34.5%). This benchmark, often described as one of the toughest evaluations of PhD-level knowledge across disciplines, underscores the model&#8217;s potential as a powerful assistant for researchers handling complex, interdisciplinary problems.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"800\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-819x1024.png?resize=640%2C800&#038;ssl=1\" alt=\"\" class=\"wp-image-58888 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?resize=819%2C1024&amp;ssl=1 819w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?resize=240%2C300&amp;ssl=1 240w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?resize=768%2C960&amp;ssl=1 768w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?resize=1229%2C1536&amp;ssl=1 1229w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?resize=1638%2C2048&amp;ssl=1 1638w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?w=2048&amp;ssl=1 2048w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?w=1280 1280w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-21-scaled.png?w=1920 1920w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/800;\" \/><\/figure>\n\n\n\n<p>For coding and algorithmic prowess, Gemini 3 Deep Think attains an impressive Elo rating of <strong>3455<\/strong> on <strong>Codeforces<\/strong> (no tools), well ahead of Gemini 3 Pro Preview (2512) and Claude Opus 4.6 (2352). This demonstrates elite-level performance in competitive programming, where solving novel, time-constrained algorithmic challenges is essential.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"800\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-819x1024.png?resize=640%2C800&#038;ssl=1\" alt=\"\" class=\"wp-image-58889 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?resize=819%2C1024&amp;ssl=1 819w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?resize=240%2C300&amp;ssl=1 240w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?resize=768%2C960&amp;ssl=1 768w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?resize=1229%2C1536&amp;ssl=1 1229w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?resize=1638%2C2048&amp;ssl=1 1638w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?w=2048&amp;ssl=1 2048w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?w=1280 1280w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-22-scaled.png?w=1920 1920w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/800;\" \/><\/figure>\n\n\n\n<p>In multimodal understanding, Gemini 3 Deep Think leads on <strong>MMMMU-Pro<\/strong> with <strong>81.5%<\/strong>, edging out Gemini 3 Pro Preview (81.0%), Claude Opus 4.6 (73.9%), and GPT-5.2 (79.5%). This reflects strong capabilities in reasoning across text, images, and other modalities \u2014 crucial for real-world applications like scientific analysis and engineering design.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"800\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-819x1024.png?resize=640%2C800&#038;ssl=1\" alt=\"\" class=\"wp-image-58890 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?resize=819%2C1024&amp;ssl=1 819w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?resize=240%2C300&amp;ssl=1 240w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?resize=768%2C960&amp;ssl=1 768w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?resize=1229%2C1536&amp;ssl=1 1229w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?resize=1638%2C2048&amp;ssl=1 1638w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?w=2048&amp;ssl=1 2048w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?w=1280 1280w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-23-scaled.png?w=1920 1920w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/800;\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"398\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1-1024x637.jpg?resize=640%2C398&#038;ssl=1\" alt=\"\" class=\"wp-image-58902 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?resize=1024%2C637&amp;ssl=1 1024w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?resize=300%2C187&amp;ssl=1 300w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?resize=768%2C478&amp;ssl=1 768w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?resize=1536%2C956&amp;ssl=1 1536w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?resize=2048%2C1274&amp;ssl=1 2048w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?w=1280 1280w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/HA-LooqbsAMPPS8-1.jpg?w=1920 1920w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/398;\" \/><figcaption class=\"wp-element-caption\">Detailaed Gemini 3 Deep Think benchmarks<\/figcaption><\/figure>\n\n\n\n<p>These results stem from DeepMind&#8217;s methodology, which emphasizes enhanced reasoning chains, parallel hypothesis exploration, and inference-time optimizations in Deep Think mode. The mode excels in scenarios requiring deep, iterative thought rather than quick pattern matching.<\/p>\n\n\n\n<p>The rollout targets high-end users and enterprises. Google AI Ultra subscribers can access the upgraded Deep Think directly in the Gemini app. For broader experimentation in research and development, an early access program via Vertex AI is now available, allowing qualified users to integrate the model through the Gemini API.<\/p>\n\n\n\n<p>With Gemini 3 Deep Think, Google DeepMind reinforces its push toward AI systems that not only match but exceed human-level performance in specialized reasoning domains, paving the way for accelerated discovery in science and technology. As benchmarks like ARC-AGI-2 approach saturation, the focus shifts to practical, real-world impact \u2014 an area where this release aims to deliver immediate value.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>ARC-AGI 2 &#8212; an iteration on the original ARC-AGI benchmark which was designed to test for AGI &#8212; appears to be close to&#8230;<\/p>\n","protected":false},"author":1,"featured_media":58891,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1029],"tags":[],"class_list":["post-58886","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%\" \/>\n<meta property=\"og:description\" content=\"ARC-AGI 2 &#8212; an iteration on the original ARC-AGI benchmark which was designed to test for AGI &#8212; appears to be close to...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/\" \/>\n<meta property=\"og:site_name\" content=\"OfficeChai\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/OfficeChai\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-12T17:00:53+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-12T17:55:12+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"562\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"OfficeChai Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@OfficeChai\" \/>\n<meta name=\"twitter:site\" content=\"@OfficeChai\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"OfficeChai Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/\",\"url\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/\",\"name\":\"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%\",\"isPartOf\":{\"@id\":\"https:\/\/officechai.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1\",\"datePublished\":\"2026-02-12T17:00:53+00:00\",\"dateModified\":\"2026-02-12T17:55:12+00:00\",\"author\":{\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2\"},\"breadcrumb\":{\"@id\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1\",\"width\":1000,\"height\":562},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/officechai.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/officechai.com\/#website\",\"url\":\"https:\/\/officechai.com\/\",\"name\":\"OfficeChai\",\"description\":\"Startups, Businesses And Careers\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/officechai.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2\",\"name\":\"OfficeChai Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g\",\"caption\":\"OfficeChai Team\"},\"description\":\"Dotting the i's, crossing the t's.\",\"url\":\"https:\/\/officechai.com\/author\/admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/","og_locale":"en_US","og_type":"article","og_title":"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%","og_description":"ARC-AGI 2 &#8212; an iteration on the original ARC-AGI benchmark which was designed to test for AGI &#8212; appears to be close to...","og_url":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/","og_site_name":"OfficeChai","article_publisher":"https:\/\/www.facebook.com\/OfficeChai\/","article_published_time":"2026-02-12T17:00:53+00:00","article_modified_time":"2026-02-12T17:55:12+00:00","og_image":[{"width":1000,"height":562,"url":"http:\/\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg","type":"image\/jpeg"}],"author":"OfficeChai Team","twitter_card":"summary_large_image","twitter_creator":"@OfficeChai","twitter_site":"@OfficeChai","twitter_misc":{"Written by":"OfficeChai Team","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/","url":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/","name":"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%","isPartOf":{"@id":"https:\/\/officechai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#primaryimage"},"image":{"@id":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1","datePublished":"2026-02-12T17:00:53+00:00","dateModified":"2026-02-12T17:55:12+00:00","author":{"@id":"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2"},"breadcrumb":{"@id":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#primaryimage","url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1","contentUrl":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1","width":1000,"height":562},{"@type":"BreadcrumbList","@id":"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/officechai.com\/"},{"@type":"ListItem","position":2,"name":"Google Releases Gemini 3 Deep Think, Tops ARC-AGI 2 Benchmark With 84.6%"}]},{"@type":"WebSite","@id":"https:\/\/officechai.com\/#website","url":"https:\/\/officechai.com\/","name":"OfficeChai","description":"Startups, Businesses And Careers","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/officechai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2","name":"OfficeChai Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/officechai.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g","caption":"OfficeChai Team"},"description":"Dotting the i's, crossing the t's.","url":"https:\/\/officechai.com\/author\/admin\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/unnamed-6.jpg?fit=1000%2C562&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p685C6-fjM","jetpack_likes_enabled":true,"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/58886","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/comments?post=58886"}],"version-history":[{"count":3,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/58886\/revisions"}],"predecessor-version":[{"id":58904,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/58886\/revisions\/58904"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/media\/58891"}],"wp:attachment":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/media?parent=58886"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/categories?post=58886"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/tags?post=58886"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}