Jesus to Political [email protected] • 3 days agoWhat could possibly go wronglemmy.worldmessage-square78fedilinkarrow-up1558
arrow-up1558imageWhat could possibly go wronglemmy.worldJesus to Political [email protected] • 3 days agomessage-square78fedilink
minus-square@[email protected]linkfedilink11•3 days agoSeems like the model you mentioned is more like a fine tuned Llama? Specifically, these are fine-tuned versions of Qwen and Llama, on a dataset of 800k samples generated by DeepSeek R1. https://github.com/Emericen/deepseek-r1-distilled
minus-square@[email protected]linkfedilinkEnglish8•edit-23 days agoYeah, it’s distilled from deepseek and abliterated. The non-abliterated ones give you the same responses as Deepseek R1.
Seems like the model you mentioned is more like a fine tuned Llama?
https://github.com/Emericen/deepseek-r1-distilled
Yeah, it’s distilled from deepseek and abliterated. The non-abliterated ones give you the same responses as Deepseek R1.
deleted by creator