Source: FT 06.30.25
I believe they're also doing surgeries now.
Microsoft has built an artificial intelligence-powered medical tool it claims is four times more successful than human doctors at diagnosing complex ailments, as the tech giant unveils research it believes could speed up treatment.
Microsoft’s new system is underpinned by a so-called “orchestrator” that creates virtual panels of five AI agents acting as “doctors” — each with a distinct role, such as coming up with hypotheses or choosing diagnostic tests — which interact and “debate” together to choose a course of action.To test its capabilities, “MAI-DxO” was fed 304 studies from the New England Journal of Medicine (NEJM) that describe how some of the most complicated cases were solved by doctors.
Microsoft used leading large language models from OpenAI, Meta, Anthropic, Google, xAI and DeepSeek. The orchestrator made all LLMs perform better, but worked best with OpenAI’s o3 reasoning model to correctly solve 85.5 per cent of the NEJM cases.That compared with about 20 per cent by experienced human doctors, but those physicians were not allowed access to textbooks or to ask colleagues in the trial, which could have increased their success rate.
Suleyman said Microsoft is nearing “AI models that are not just a little bit better, but dramatically better, than human performance: faster, cheaper and four times more accurate”.“That is going to be truly transformative,” he added.
The AI models were also prompted to be cost-conscious, which significantly cut the number of tests required to get to a correct diagnosis in the trial, saving hundreds of thousands of dollars in some cases, he said. However, King stressed that the technology was still in its early stages, had not been peer reviewed and was not yet ready for a clinical environment. “This is a landmark study,” said Eric Topol, a cardiologist and founder and director of the Scripps Research Translational Institute. “While this work was not done in the setting of real world medical practice, it is the first to provide evidence for the efficiency potential of generative AI in medicine — accuracy and cost savings.”
I believe they're also doing surgeries now.
Microsoft claims AI diagnostic tool can outperform doctors
Research is first initiative from Big Tech group’s AI health unit formed by ex-DeepMind co-founder Mustafa SuleymanMicrosoft has built an artificial intelligence-powered medical tool it claims is four times more successful than human doctors at diagnosing complex ailments, as the tech giant unveils research it believes could speed up treatment.
Microsoft’s new system is underpinned by a so-called “orchestrator” that creates virtual panels of five AI agents acting as “doctors” — each with a distinct role, such as coming up with hypotheses or choosing diagnostic tests — which interact and “debate” together to choose a course of action.To test its capabilities, “MAI-DxO” was fed 304 studies from the New England Journal of Medicine (NEJM) that describe how some of the most complicated cases were solved by doctors.
Microsoft used leading large language models from OpenAI, Meta, Anthropic, Google, xAI and DeepSeek. The orchestrator made all LLMs perform better, but worked best with OpenAI’s o3 reasoning model to correctly solve 85.5 per cent of the NEJM cases.That compared with about 20 per cent by experienced human doctors, but those physicians were not allowed access to textbooks or to ask colleagues in the trial, which could have increased their success rate.
Suleyman said Microsoft is nearing “AI models that are not just a little bit better, but dramatically better, than human performance: faster, cheaper and four times more accurate”.“That is going to be truly transformative,” he added.
The AI models were also prompted to be cost-conscious, which significantly cut the number of tests required to get to a correct diagnosis in the trial, saving hundreds of thousands of dollars in some cases, he said. However, King stressed that the technology was still in its early stages, had not been peer reviewed and was not yet ready for a clinical environment. “This is a landmark study,” said Eric Topol, a cardiologist and founder and director of the Scripps Research Translational Institute. “While this work was not done in the setting of real world medical practice, it is the first to provide evidence for the efficiency potential of generative AI in medicine — accuracy and cost savings.”