UseJournal

One of the best language models for mathematics

One of the best language models for mathematics

This is a large language model for mathematics. On the MATH benchmark Llemma outperforms all known open base models, as well as the unreleased Minerva model suite on an equi-parameter basis. Moreover, Llemma is capable of tool use and formal theorem proving without any further finetuning.

The resulting models show improved mathematical capabilities, and can be adapted to various tasks through prompting or additional fine-tuning.

These models are particularly strong at chain-of-thought mathematical reasoning and using computational tools for mathematics, such as Python and formal theorem provers.

Link:

Read the full story

Sign up now to read the full story and get access to all posts for subscribers only.

Subscribe
Already have an account? Sign in

UseJournal

We're back and better than ever!

UseJournal

Great! You’ve successfully signed up.

Welcome back! You've successfully signed in.

You've successfully subscribed to UseJournal.

Success! Check your email for magic link to sign-in.

Success! Your billing info has been updated.

Your billing was not updated.