DeepScaleR-1.5B-PreviewDeepscaleR-1.5b 是在 DeepSeekR1-distilled-Qwen1.5b 上仅使用 3800 A100h(~$4500) 进行 RL 微调的 LLM该模型在 AIME 2024 上获得了 43.1％@1 的准确性，较基底（28.8％）提高 14％，在 1.5B 参数下超过了 o1-preview（Arena Math 中 R1>Gemini 2 Thinking>o1p>Gemini 2 Pro）Open sourced dataset

Garyの梦呓

DeepScaleR-1.5B-Preview

DeepscaleR-1.5b 是在 DeepSeekR1-distilled-Qwen1.5b 上仅使用 3800 A100h(~$4500) 进行 RL 微调的 LLM

该模型在 AIME 2024 上获得了 43.1％@1 的准确性，较基底（28.8％）提高 14％，在 1.5B 参数下超过了 o1-preview
（Arena Math 中 R1>Gemini 2 Thinking>o1p>Gemini 2 Pro）

Open sourced dataset, code, training logs and models
Github: Github.com/agentica-project/deepscaler
Inference GGUF
#AI

www.tg-me.com/us/telegram/com.GaryNoHeya/2920

2.5K viewsFeb 17 at 15:23

tg-me.com/GaryNoHeya/2920

Create: 2025-02-17
Last Update: 2025-06-29 12:04:40

BY Garyの梦呓

Share with your friend now:
tg-me.com/GaryNoHeya/2920

telegram Telegram | DID YOU KNOW?

Date: 2025-06-29| telegram

Telegram is riding high, adding tens of million of users this year. Now the bill is coming due.Telegram is one of the few significant social-media challengers to Facebook Inc., FB -1.90% on a trajectory toward one billion users active each month by the end of 2022, up from roughly 550 million today.

Find Channels On Telegram?

Telegram is an aspiring new messaging app that’s taking the world by storm. The app is free, fast, and claims to be one of the safest messengers around. It allows people to connect easily, without any boundaries.You can use channels on Telegram, which are similar to Facebook pages. If you’re wondering how to find channels on Telegram, you’re in the right place. Keep reading and you’ll find out how. Also, you’ll learn more about channels, creating channels yourself, and the difference between private and public Telegram channels.