Grok 3：Elon Musk的Xai AI模型优于GPT-4O和双子座

Grok 3的性能与GPT-4O和双子座相比如何

由Elon Musk的Xai推出的Grok 3旨在与OpenAI的GPT-4O和Google的Gemini [3] [4]等其他AI模型竞争。 Xai声称Grok 3是“地球上最聪明的AI” [1]。

Grok 3对GPT-4O：
*基准：与GPT-4O相比，Grok 3在几个基准上表现出了出色的性能[1] [4]。其中包括数学(Aime 24)，科学(GPQA)和编码(LCB OCT-FEB)[1]。 Grok 3在数学上得分52，科学的得分为75，在这些领域的编码中得分超过GPT-4O [1] [4]。
*语言理解：Grok 3在语言理解测试方面达到了94.2％的精度，略超过Chatgpt的92.8％[2]。
*编码：据报道，Grok 3的代码生成速度比Chatgpt快1.2倍，平均响应时间为0.8秒[2]。
*竞技场得分：Grok 3的早期版本，称为“巧克力”，是第一个在LMSYS Chatbot Arena中超过1400分的AI，表现优于GPT-4O [4]。
*推理和实时数据：Grok 3显示了数学推理，编码任务，实时数据分析和时事讨论的强度[2]。
*培训：Grok 3是使用X(以前为Twitter)的实时数据培训的，为其提供了最新信息[2]。它接受了Xai的巨人超级集团的培训，配备了100,000 GPU [2]。

Grok 3对双子座：

*基准：Grok 3优于Google DeepMind的Gemini-2 Pro在各种基准测试方面[1]。
*聊天机器人竞技场(LMSYS)：Grok 3的早期版本优于诸如Gemini-2.0 Flash在聊天机器人体育馆上思考[1]。
*数学(Aimeâ24)：在数学(Aimeâ24)基准上，Grok 3得分为52，而Gemini-2 Pro得分为39 [4]。
*科学(GPQA)：在科学(GPQA)中，Grok 3得分75，表现优于Gemini-2 Pro，得分为65 [4]。

引用：
[1] https://www.outlookbusiness.com/start-up/news/elon-musk-unveils-grok-3-how-it-performs-performs-against-openais-popenais-gpt-4o-deepseek
[2] https://9meters.com/technology/ai/grok-3-vs-chatgpt-a-head-to-head-comparparison
[3] https://opentools.ai/news/elon-musks-xai-unveils-grok-3-a-a- game-changer-in-ai--ai-performance and-performance and-capabilitys and-performance and-capabilities
[4] https://felloai.com/2025/02/xais-grok-3-is-here-and-might-be-be-be-the-smartest-ai-on-on-earth/
[5] https://www.youtube.com/watch?v=wxqhhcgnbzs
[6] https://www.gurufocus.com/news/2701835/musks-xai-unveils-grok-3-says-it-beats-it-beats-openais-popenais-gpt4o-key-neke-benchmarks?r=caf6fe6fe6fe6fe6fe6fee0e0e0e0e0e0e0e0db70d93603333333da33da5a5546141414141414141414141414141414141414141414141461414614146146.614614614614614614611.61461461414146141461141461年
[7] https://www.reddit.com/r/singularity/comments/1h8ox94/how_does_gemini_grok_or_or_or_llama_llama_llama_compare_to_gpt_to_gpt_or/
[8] https://www.digitaltrends.com/computing/xai-grok-3-ai-model-think-think-deep-search-gemini-chatgpt-competition/
[9] https://blog.getbind.co/2025/02/18/grok-3-chatbot-vs-vs-chatgpt-is-is-rok-better-better-than-than-chatgpt/