• 0 Posts
  • 47 Comments
Joined 2 years ago
cake
Cake day: August 2nd, 2023

help-circle
  • lily33@lemm.eetoOpen Source@lemmy.mlProton's biased article on Deepseek
    link
    fedilink
    arrow-up
    4
    arrow-down
    1
    ·
    edit-2
    1 day ago

    What makes these consumer-oriented models different is that that rather than being trained on raw data, they are trained on synthetic data from pre-existing models. That’s what the “Qwen” or “Llama” parts mean in the name. The 7B model is trained on synthetic data produced by Qwen, so it is effectively a compressed version of Qen. However, neither Qwen nor Llama can “reason,” they do not have an internal monologue.

    You got that backwards. They’re other models - qwen or llama - fine-tuned on synthetic data generated by Deepseek-R1. Specifically, reasoning data, so that they can learn some of its reasoning ability.

    But the base model - and so the base capability there - is that of the corresponding qwen or llama model. Calling them “Deepseek-R1-something” doesn’t change what they fundamentally are, it’s just marketing.













  • Because FOSS shouldn’t add burdens. You publish your work and let everyone else use it. That shouldn’t add extra obligations on you. Usually, you’d also write some docs - after all, without them nobody will know how to use your program, so why bother publishing - but it shouldn’t be an obligation. Make it easy for people to open up their code without this attaching strings.

    Documentation is nice, but it’s kind of different thing that open source: a program can be open and undocumented, or closed but well documented - and I don’t see why we’d want it different for models.




  • Because it’s not a very easy case. In fact, there is no real case.

    1. It’s not just a stretch, but a huge leap, to claim that using “he” or “she” counts as “instruction […] on sexual orientation or gender identity”.
    2. And even if you did manage that, you also have to argue that it’s also “not age appropriate”.
    3. And if you managed that as well somehow, you have the problem that judges can take into account things like the intent of the lawmakers, and what’s reasonable, not just the raw text of the law.