November AI news

Length:

6 min

Published:

December 3, 2025

November belonged to frontier models and agentic tools for developers. It is no longer about higher benchmark scores. It is about how well a model handles real work in a repository, a long conversation, and an automated workflow.

Claude Opus 4.5

Claude Opus 4.5 is so far the most capable model from Anthropic. It improves clearly at coding, working with autonomous agents, data analysis, spreadsheets, and preparing presentations. It handles multi-step workflows, long context, and complex tasks reliably, while spending fewer tokens, so it runs more efficiently. That makes it a good fit for enterprise deployment, process automation, and large agentic scenarios.

Useful resources: Claude Opus 4.5

Gemini 3 Pro

Gemini 3 Pro is the first model we can say raises the bar across almost every benchmark. In the Artificial Analysis Index it stays above GPT-5.1, by roughly three points according to some sources. On ARC-AGI 2 it doubled the previous state-of-the-art result. Some sources describe it as a genuinely huge model, around 2–3x larger than other proprietary models. It shows that scaling still works, it is just getting harder to reach.

Useful resources: Gemini 3 Pro

OpenAI GPT-5.1

OpenAI released a new model, GPT-5.1. The main upgrade is in speed and runtime efficiency, while intelligence itself changed only a little. The model has two modes: Instant for quick answers and Thinking for harder tasks where longer reasoning pays off. It comes in several sizes, from Mini to Pro. It also adds a much larger context window, so you can comfortably work with bigger codebases or documentation in a single pass.

Useful resources: OpenAI: GPT-5.1 overview and Instant / Thinking modes

OpenAI GPT-5.1 Codex Max

GPT-5.1-Codex-Max is a new frontier model from OpenAI aimed purely at programming and agentic work. It combines chain-of-thought, generating intermediate reasoning steps, with a context-compaction technique. That lets it carry long, project-scale tasks such as refactors, large debugging sessions, or generating complex systems, without overloading the context window.

Useful resources: OpenAI 5.1 Codex Max

Moonshot Kimi K2

Kimi K2 is an open-source model with one trillion parameters, of which about 32 billion activate during inference. It suits teams that want control over their data and at the same time need top performance in agentic and automation tasks. Because of its size, though, it needs robust infrastructure. Running it usually means several GPUs, for example high-end cards or specialized clusters, since full operation calls for a lot of memory, VRAM, and compute.

Useful resources: Moonshot Kimi K2

Grok 4.1

Grok 4.1 works well with emotion and interpersonal context. The Fast variant handles up to 2 million tokens, so it can carry a large codebase or a long conversation. Thanks to the Agent Tools API it fits production agents and demanding tool-calling. It did, however, show a problem with overpraising Musk and a strong slant, which points to possible bias. On sensitive topics such as history, politics, or verified facts, its answers may therefore not be neutral or reliable enough.

Useful resources: Grok 4.1, TechCrunch

Google Antigravity

Antigravity is Google's new agent-first IDE built around Gemini 3 Pro. In practice it is a development environment where agents get direct access to the editor, terminal, and browser, so they can write, run, and verify code on their own. Right after launch, though, some users reported the model being unavailable under heavy load. Serious security vulnerabilities also surfaced: with default settings, agents can read sensitive files and run arbitrary commands.

Useful resources: Google Antigravity, Techradar

November shows nicely that the goal is not the biggest, smartest model, but the one that fits the team's specific work and stack. Specialized code models, fast small models, and strong open-source alternatives give you far more room to tune performance, cost, and control over your data. And it holds more and more true that real value appears once you wire AI well into the IDE, chat, and internal tools, where it genuinely speeds up development.

If you want more AI news and trends:

October AI news - a new month is here, and with it the October AI news.
September AI news - as always, a selection of the most important things from the world of artificial intelligence.
AI: Assistant or threat to juniors? - AI in development through a junior's eyes.
How to get started with GitHub Copilot - GitHub Copilot step by step.

Back to insights

Want to stay one step ahead?

Don't miss our best insights. No spam, just practical analyses, invitations to exclusive events, and podcast summaries delivered straight to your inbox.