You must log in or register to comment.
deleted by creator
insane, absolutely insane
Why insane? For quality, speed, size? I find the coder 1.5b and 3b light and good
It matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32
GGUF quants are already out: https://huggingface.co/bartowski/Qwen_QwQ-32B-GGUF
Yay! let’s try
ollama run hf.co/bartowski/Qwen_QwQ-32B-GGUF:Q4_K_M
/set parameter num_ctx 32768