Machines Do It Better
  • Home
  • Investigation
  • Comics
  • Reviews
  • About
Sign in Subscribe
Comics

Intelligent...ish #3

John

16 Jan 2026
Intelligent...ish #3

Read more

Same Questions. New Models. Mixed Results.

Same Questions. New Models. Mixed Results.

Large language models are often judged on complex benchmarks, but some of their most interesting failures show up on questions that seem trivial at first glance. In this article, we test a range of OpenAI models using a small set of deliberately easy questions, the kind that have a track

By John 18 Jan 2026
Claude Opus 4.5 vs Shaders

Claude Opus 4.5 vs Shaders

Claude Opus 4.5 has been drawing attention for its coding skills, so I decided to put it to the test on a problem I’ve struggled with before: getting an AI to write shaders that actually work. I was curious to see whether it could handle the challenge better

By John 15 Jan 2026
What Samsung’s TRM does and why it matters

What Samsung’s TRM does and why it matters

In a recent development out of Samsung’s AI lab in Montreal, researchers have introduced a new “Tiny Recursive Model” (TRM) that challenges the prevailing notion that more parameters = more intelligence.) Key ideas & claims * Tiny footprint: TRM has only ~7 million parameters—orders of magnitude smaller than typical large

By John 10 Jan 2026
Intelligent...ish #2

Intelligent...ish #2

By John 02 Jan 2026
Machines Do It Better
  • Sign up
Powered by Ghost

Machines Do It Better

For everything AI - news, reviews, comics, and more.