I dictated a concept to Gemini and asked it to scan a 200-page PDF on Incan fortifications at the same time.
It held both. Context intact, no complaint, no hallucination spiral. It produced a structured draft that had all the right information—zigzag walls as micro-segmentation, Chasqui runners as out-of-band management, Pizarro's Cajamarca move as the social engineering root capture. Accurate. Organized. Technically solid.
The voice was flat. Listicle formatting. Numbered sections. "For the love of the Sun God" as an attempt at humor in the conclusion.
I brought it to Claude. Claude stripped the numbers, opened in the stone, made Cajamarca the climax, and turned the whole thing into something I'd actually publish. You can read it here: The Sacsayhuamán Protocol.
That's not a complaint about Gemini. That's the workflow.
April 2026 Is Not a Single-Model World
Here's the reality nobody wants to say plainly because everyone has a preferred tool and a preferred allegiance.
No model does everything best. Not Claude. Not Gemini. Not GPT-4o. Not Grok. They are all shaped by the business structures and competitive pressures of the companies that built them—and those structures guarantee that each model will be optimized for different things, limited in different ways, and genuinely excellent in lanes that the others are merely competent in.
Anthropic built Claude around safety, coherence, and writing quality. The warmth is real. The literary instinct is real. The context management is real. The model has a personality that comes through in the prose, which is either a feature or a problem depending on what you need.
Google built Gemini around scale, retrieval, and multimodal context. It can hold a very long conversation without losing the thread. It can process a PDF the way a researcher processes a source—as raw material to be organized, not a prompt to respond to. It is less warm. More structured. Better at operating as a processing layer than as a creative partner.
OpenAI built GPT-4o to be versatile and deployable at enterprise scale, with tool integrations that make it the most plugged-in of the major models for complex agentic workflows.
These aren't personality quirks. They're design choices made by companies operating under different constraints, serving different markets, with different investors asking for different things. The competitive pressure will keep them differentiated. A world where all models do everything equally well is a world where none of them have a business model.
That means the gap between the models is structural, not temporary. And that means the skill worth developing is knowing which model to put in which seat.
The Recipe Book
Think of it less as a toolkit and more as a recipe book.
A recipe doesn't ask which ingredient is best. A recipe asks what the dish needs and puts the right thing in at the right moment. Flour does not do what an egg does. Both are essential to the same result. Arguing about which one is better misses the point by the full length of the kitchen.
In practice: