From GPT-4 to GPT-5 in Medical Language Understanding

Thu Aug 21 2025•GPT-5 Healthcare

If you’ve been following the progress of large language models in healthcare, you know things move fast. There’s always a new benchmark, a new claim, a new set of numbers. This month OpenAI released their latest model: GPT-5. Unlike the GPT-4 era models, GPT-5 exposes a single interface to its users withouth distinction between fast models and reasoning models. The model itself decides when to think more carefully.

I recently worked in evaluating the perfoamnce of GPT-5 in healtcare. More specifically, I performed an apples-to-apples comparisson between GPT-4 era models and GPT-5 in the Stanford MedHelm benchmark. The Resulsts are enlighting.

Here's the link to the study.

Studies like this are important for society. It let's us know the objective progress in the AI field beyond a public image.