Hellfire103 to Not The [email protected]English • 2 days agoOpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.comessage-square121fedilinkarrow-up11.13K
arrow-up11.13Kexternal-linkOpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coHellfire103 to Not The [email protected]English • 2 days agomessage-square121fedilink
minus-squareAvid AmoebalinkfedilinkEnglish23•1 day agoIs there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
minus-square@[email protected]linkfedilinkEnglish36•edit-21 day agoIt’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233 But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community. Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
minus-square@[email protected]linkfedilinkEnglish6•1 day agoNot distillate, they just trained on the outputs of openai
Is there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
It’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233
But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community.
Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
Thank you 🙏
Not distillate, they just trained on the outputs of openai