☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to

Technology@lemmygrad.ml • 5 months ago

By using the same techniques Google used to solve Go (MTCS and backprop), Llama8B gets 96.7% on math benchmark GSM8K. That’s better than GPT-4, Claude and Gemini, with 200x less parameters!

cross-posted to:
machinelearning@lemmy.ml

18

By using the same techniques Google used to solve Go (MTCS and backprop), Llama8B gets 96.7% on math benchmark GSM8K. That’s better than GPT-4, Claude and Gemini, with 200x less parameters!

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to

Technology@lemmygrad.ml • 5 months ago

cross-posted to:
machinelearning@lemmy.ml

*removed externally hosted image*

You must log in or register to comment.

Chat