Content
Gemini 2.5 Pro vs. Claude 3.7 Sonnet (thinking) vs. Grok 3 (think)
In this blog post, I will compare two of the best models Gemini 2.5 Pro and Claude 3.7 Sonnet on coding tasks.
Content
In this blog post, I will compare two of the best models Gemini 2.5 Pro and Claude 3.7 Sonnet on coding tasks.
Content
Deepseek has silently released a bombshell update to the Deepseek v3 base model. And surprisingly, it went under the carpet amid the Chatgpt image generation launch. It has improved over its predecessor in reasoning and coding. The current coding champion (in raw output) is Claude 3.7 Sonnet. I was
Content
Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet, a model card and a 641GB MIT-licensed open-weight base model—very Deepseek-like. This, paired with Gemini 2.5 and Chatgpt image generation, caused the entire launch to go under
Content
If you’re closely following the AI scene, you know XAI and OpenAI are currently each other’s arch-nemesis. And there’s no point in guessing it’s because of the infamous feud between Musk and Altman. This led Elon Musk to build the largest GPU cluster in the world,
Content
What is MCP? To define vaguely, MCP is an open protocol that standardises the integration of AI (e.g., large language models) with external data sources and services. It is basically a bridge that lets AI step outside its training data and interact with real-world data in real-time. What exactly
Content
Big AI models are powerful but expensive. Smaller Chain-of-Thought (CoT) models like Gemini 2.0 Flash Thinking, OpenAI’s O3-Mini, and DeepSeek R1 offer a cheaper way to handle reasoning tasks. The real question is whether they are just as good. Each model has a different pricing style. Gemini 2.
Content
A comprehensive analysis for o3-Mini-High vs Claude Sonnet 3.7 Thinking vs Grok 3 Think vs Deep Seek R1 on multiple reasoning, math, coding, and writing questions. Which one is bang for your buck in 2025? Motivation It’s been a fascinating few months in the AI landscape with the
Content
After so long, OpenAI finally unveiled GPT-4.5, its biggest-ever base model. The initial vibe checks from taste testers have been outstanding. The model has many different vibes than the previous corporate drone-sounding models. It is more expressive, feels natural, and generates excellent green text. Summarizing Karpathy's vibe
Content
So, Anthropic finally broke the silence and released Claude 3.7 Sonnet, a hybrid model that can think step-by-step like a thinking model for complex reasoning tasks and answer instantly like a base model. From the ARC-AGI benchmarks, Claude’s 3.7 Sonnet with thinking has scored on par with
Content
Just a week after Grok’s release, we now have the Claude 3.7 Sonnet, which certainly has eaten into Grok’s hype pie. Grok was definitely one of the best models for coding, and now, with the new Claude, the equations might change. Anthropic has been clear about where
Content
After much anticipation, xAI has finally released the third iteration of Grok. It is apparently the smartest LLM in the world, scoring above 1400 in the Chatbot arena, the first model to do so. But is it the new SOTA? Apparently yes. But how good is it compared to the
Content
AI and crypto are two revolutionary technologies of the 21st century. They are redefining how we process information and transact value, and both possess immense potential for value creation. Crypto-Kit is a step in that direction. We’re making it easy for agents to interact with Web3 platforms to build