I wouldn't rule it out (and it's not a binary question) but I am not confident DeepSeek v3’s performance primarily, or largely, stems from distillation.
Share this post
Is it highly likely DeepSeek was distilled…
Share this post
I wouldn't rule it out (and it's not a binary question) but I am not confident DeepSeek v3’s performance primarily, or largely, stems from distillation.