Skip to content Skip to sidebar Skip to footer

Microsoft introduces rStar-Math, an SLM for math reasoning and problem solving

A team of math and AI researchers at Microsoft Asia has designed and developed a small language model (SLM) that can be used to solve math problems. The group has posted a paper on the arXiv preprint server outlining the technology and math behind the new tool and how well it has performed on standard benchmarks.

Over the past several years, multiple tech giants have been working hard to steadily improve their LLMs, resulting in AI products that have in a very short time become mainstream. Unfortunately, such tools require massive amounts of computer power, which means they consume a lot of electricity, making them expensive to maintain.

Because of that, some in the field have been turning to SLMs, which as their name implies, are smaller and thus far less resource intensive. Some are small enough to run on a local device. One of the main ways AI researchers make the best use of SLMs is by narrowing their focus—instead of trying to answer any question about anything, they are designed to answer questions about something much more specific—like math. In this new effort, Microsoft has focused its efforts on not just solving math problems, but also in teaching an SLM how to reason its way through a problem.

In developing its model, Microsoft made it in a way that allows for its use by other, larger models. An overall strategy that could be the wave of the future. New LLMs could soon be nothing more than an amalgam of many SLMs. Notably, the announcement by Microsoft came not long after the debut of its Phi-4 SLM, which also serves to solve math problems.

rStar-Math does its work differently than Phi-4, the researchers note, by making use of Monte Carlo Tree Search—a reasoning method developed to mimic the way humans attack problems in a step-by-by process. They note that by using such an approach, their new SLM can break down a problem into its smaller parts as a way to figure out how to solve a particular problem. They also note that rStar-Math shows its work by outputting its thought process in both Python code and natural language.

The team also noted that rStar-Math has already scored well on several benchmarks. And according to a post on Hugging Face, the team plans to make the code and data publicly available on GitHub.

We provide only
quality games content

Newsletter Signup
Menu
Say Hello

© Red Studio Limited.
All third party content, brands, names, and logos are used under license and remain property of their respective owners. All rights reserved.