@lily33

lily33@lemm.ee · edit-2 1 day ago

What makes these consumer-oriented models different is that that rather than being trained on raw data, they are trained on synthetic data from pre-existing models. That’s what the “Qwen” or “Llama” parts mean in the name. The 7B model is trained on synthetic data produced by Qwen, so it is effectively a compressed version of Qen. However, neither Qwen nor Llama can “reason,” they do not have an internal monologue.

You got that backwards. They’re other models - qwen or llama - fine-tuned on synthetic data generated by Deepseek-R1. Specifically, reasoning data, so that they can learn some of its reasoning ability.

But the base model - and so the base capability there - is that of the corresponding qwen or llama model. Calling them “Deepseek-R1-something” doesn’t change what they fundamentally are, it’s just marketing.

lily33@lemm.ee · 2 days ago

There are already other providers like Deepinfra offering DeepSeek. So while the the average person (like me) couldn’t run it themselves, they do have alternative options.

lily33@lemm.ee · 2 days ago

A server grade CPU with a lot of RAM and memory bandwidth would work reasonable well, and cost “only” ~$10k rather than 100k+…

lily33@lemm.ee · 2 days ago

To be fair, most people can’t actually self-host Deepseek, but there already are other providers offering API access to it.

lily33@lemm.ee · edit-2 1 month ago

It’s almost sure to be the case, but nobody has managed to prove it yet.

Simply being infinite and non-repeating doesn’t guarantee that all finite sequences will appear. For example, you could have an infinite non-repeating number that doesn’t have any 9s in it. But, as far as numbers go, exceptions like that are very rare, and in almost all (infinite, non-repeating) numbers you’ll have all finite sequences appearing.

lily33@lemm.ee · 2 months ago

Well, he didn’t even buy the original (I guess it has spoiled by then), but a DIY replica and a certificate.

lily33@lemm.ee · edit-2 3 months ago

I guess technically that makes them “not in Ukraine”, but it is the same war in the end. At least for me that’s the important part, not where exactly on the front line they are.

lily33@lemm.ee · 3 months ago

Well, NK and Russia have a defense treaty which obliges NK to sent military assistance to Kursk. So if they aren’t, they’re breaking their obligations.

lily33@lemm.ee · 4 months ago

My bet is, it’ll be Saturday that goes, finally achieving a 6-day work week.

lily33@lemm.ee · 5 months ago

Technically, “enforced pay it forward” is called credit. Your debt would then be “the amount you still have to pay forward”.

Of course, this defeats both the spirit and the purpose of a pay it forward scheme.

lily33@lemm.ee · 7 months ago

The biggest issue is that there isn’t a universal agreement on what causes harm. There is agreement on the basics - murder, violence, etc - but they’re already illegal anyways, no need to ban them by license.

lily33@lemm.ee · 8 months ago

upcoming EU AI Act that regulates open source systems differently, creating an urgent need for practical openness assessment

So when they say “openness” they do put it in the context of open source rather accessibility.

lily33@lemm.ee · 8 months ago

Because FOSS shouldn’t add burdens. You publish your work and let everyone else use it. That shouldn’t add extra obligations on you. Usually, you’d also write some docs - after all, without them nobody will know how to use your program, so why bother publishing - but it shouldn’t be an obligation. Make it easy for people to open up their code without this attaching strings.

Documentation is nice, but it’s kind of different thing that open source: a program can be open and undocumented, or closed but well documented - and I don’t see why we’d want it different for models.

lily33@lemm.ee · edit-2 8 months ago

A bunch of these columns are outright absurd TBH, to the extend I’m not sure the author really knows what FOSS is about. What’s open API access even supposed to be - API access is closed by definition.

Also there has never been a requirement that open source software needs to be documented - and for good reason - so I’m not a fan of the documentation column as well.

lily33@lemm.ee · 9 months ago

How do you declaratively apply the configuration? Is that a feature of Kvaesitso?

lily33@lemm.ee · edit-2 1 year ago

Because it’s not a very easy case. In fact, there is no real case.

It’s not just a stretch, but a huge leap, to claim that using “he” or “she” counts as “instruction […] on sexual orientation or gender identity”.
And even if you did manage that, you also have to argue that it’s also “not age appropriate”.
And if you managed that as well somehow, you have the problem that judges can take into account things like the intent of the lawmakers, and what’s reasonable, not just the raw text of the law.

lily33@lemm.ee · 1 year ago

Because judges are people, not robots mindlessly applying legislation. To succeed in such case you need the judges on the trial and all appeals to all decide to maliciously comply with the law.

lily33@lemm.ee · 1 year ago

You obviously consented that your data can be shared with all Lemmy/Fediverse instances federated with yours, and they can distribute it to Lemmy/Fediverse users - because that’s the basic premise of Lemmy.

Now, I can host a Lemmy instances of my own and get all your posts that way. No need to bother buying them.

lily33@lemm.ee · 1 year ago

It is natural. Any particular individual’s actions are not natural - but the fact that, amongst a large, diverse group of people, there will be someone who would try to establish themselves or their group as rulers - is just a statistical property. So any anarchic system needs a mechanism to counter that.

lily33@lemm.ee · 1 year ago

Linux can totally do that. Even if your distro doesn’t package it, you can always install spyware from source.