☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to

Technology@lemmygrad.mlEnglish · 4 days ago

QwQ-32B is a 32 billion parameter language model achieves comparable performance to DeepSeek-R1 with 671 billion parameters, using reinforcement learning for scaling

qwenlm.github.io

3

cross-posted to:
technology@lemmy.ml

1

QwQ-32B is a 32 billion parameter language model achieves comparable performance to DeepSeek-R1 with 671 billion parameters, using reinforcement learning for scaling

qwenlm.github.io

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to

Technology@lemmygrad.mlEnglish · 4 days ago

3

cross-posted to:
technology@lemmy.ml

Chat

marl_karx@lemmygrad.ml
link
fedilink
English
arrow-up
0·
2 days ago
Isnt deepseek based on qwen? at least the distilled models?
- ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
  link
  fedilink
  arrow-up
  0·
  2 days ago
  I think so, but this looks like an update of qwen with some new tricks.