218
submitted 1 month ago by yogthos@lemmy.ml to c/memes@lemmy.ml
top 50 comments
sorted by: hot top controversial new old
[-] SplashJackson@lemmy.ca 17 points 1 month ago

What's a deepseek? Sounds like a search engine?

[-] Karcinogen@discuss.tchncs.de 27 points 1 month ago

Deepseek is a Chinese AI company that released Deepseek R1, a direct competitor to ChatGPT.

[-] yogthos@lemmy.ml 29 points 1 month ago

You forgot to mention that it's open source.

load more comments (22 replies)
[-] SplashJackson@lemmy.ca 7 points 1 month ago

Nice! What are they competing for? I'm new to this AI business thing.

[-] MajorSauce@sh.itjust.works 22 points 1 month ago

So far, they are training models extremely efficiently while having US gatekeeping their GPUs and doing everything they can to slow their progress. Any innovation in having efficient models to operate and train is great for accessibility of the technology and to reduce the environment impacts of this (so far) very wasteful tech.

[-] mukt@lemmy.ml 4 points 1 month ago

Market share, in a speculated market to be in the future.

[-] ICastFist@programming.dev 15 points 1 month ago

Come on, OP, Altman is still a billionaire. If he got out of the game right now, with OpenAi still unprofitable, he'd still have enough wealth for a dozen generations.

[-] yogthos@lemmy.ml 12 points 1 month ago

He's a billionaire based on the valuation of OpenAI, if the company fizzles so does his wealth.

[-] Grapho@lemmy.ml 12 points 1 month ago

πŸ™πŸΎπŸ™πŸΎπŸ™πŸΎ

[-] Sabre363@sh.itjust.works 10 points 1 month ago

We doing paid promotions or something on Lemmy now? You sure seem to be pushing this DeepSeek thing pretty hard, op.

[-] yogthos@lemmy.ml 17 points 1 month ago

That's right I'm a huge open source shill.

[-] Sabre363@sh.itjust.works 8 points 1 month ago

None of this has anything to do with the model being open source or not, plenty of other people have already disputed that claim.

[-] Grapho@lemmy.ml 13 points 1 month ago

It's a model that outperforms the other ones in a bunch of areas with a smaller footprint and which was trained for less than a twentieth of the price, and then it was released as open source.

If it were European or US made nobody would deem it suspicious if somebody talked about it all month, but it's a Chinese breakthrough and god forbid you talk about it for three days

[-] yogthos@lemmy.ml 11 points 1 month ago

It has everything to do with the tech being open. You can dispute it all you like, but the fact is that all the code and research behind it is open. Anybody could build a new model from scratch using open data if they wanted to. That's what matters.

[-] Sabre363@sh.itjust.works 5 points 1 month ago

I'm commenting on the odd nature of the post and your behavior in the comments, pointing out that it comes across as more a shallow advertisement than a sincere endorsement, that is all. I don't know enough about DeepSeek to discuss it meaningfully, nor do I have enough evidence to decide upon its open source status.

load more comments (8 replies)
[-] sem@lemmy.ml 9 points 1 month ago

Deepseek collects and process all the data you sent to their LLN even from API calls. It is a no-go for most of businesses applications. For example, OpenAI and Anyhropic do not collect or process anyhow data sent via API and there is an opy-ouy button in their settings that allows to avoid processing of the data sent via UI.

[-] fl42v@lemmy.ml 31 points 1 month ago

You can run 'em locally, tho, if their gh page is to be believed. And this way you can make sure nothing gets even sent to their servers, and not just believe nothing is processed.

[-] yogthos@lemmy.ml 17 points 1 month ago

DeepSeek is an open source project that anybody can run, and it's performant enough that even running the full model is cheap enough for any company to do.

[-] shawn1122@lemm.ee 3 points 1 month ago* (last edited 1 month ago)

Since it's open source is there a way for companies to adjust so it doesn't intentionally avoid saying anything bad about China?

[-] yogthos@lemmy.ml 8 points 1 month ago

Anybody can adjust the weights any way they want.

[-] Ajen@sh.itjust.works 3 points 1 month ago

That doesn't mean it's straightforward, or even possible, to entirely remove the censorship that's baked into the model.

[-] Grapho@lemmy.ml 6 points 1 month ago

People saying truisms that confirm their biases about shit they clearly know nothing about? I thought I'd left reddit.

[-] yogthos@lemmy.ml 4 points 1 month ago

It doesn't mean it's easy, but it is certainly possible if somebody was dedicated enough. At the end of the day you could even use the open source code DeepSeek published and your own training data to train a whole new model with whatever biases you like.

load more comments (8 replies)
load more comments (1 replies)
load more comments (2 replies)
[-] jol@discuss.tchncs.de 9 points 1 month ago

Where do I find this opy ouy button? Sounds tasty

[-] haerrii@feddit.org 7 points 1 month ago

why are you so heavily and openly advertising Deepseek?

[-] yogthos@lemmy.ml 30 points 1 month ago

Because it's an open source project that's destroying the whole closed source subscription AI model.

[-] TheOctonaut@mander.xyz 11 points 1 month ago

I don't think you or that Medium writer understand what "open source" means. Being able to run a local stripped down version for free puts it on par with Llama, a Meta product. Privacy-first indeed. Unless you can train your own from scratch, it's not open source.

Here's the OSI's helpful definition for your reference https://opensource.org/ai/open-source-ai-definition

[-] yogthos@lemmy.ml 11 points 1 month ago

You can run the full version if you have the hardware, the weights are published, and importantly the research behind it is published as well. Go troll somewhere else.

[-] TheOctonaut@mander.xyz 6 points 1 month ago

All that is true of Meta's products too. It doesn't make them open source.

Do you disagree with the OSI?

[-] yogthos@lemmy.ml 10 points 1 month ago

What part of OSI are you claiming DeepSeek doesn't satisfy specifically?

[-] TheOctonaut@mander.xyz 9 points 1 month ago* (last edited 1 month ago)

The data part. ie the very first part of the OSI's definition.

It's not available from their articles https://arxiv.org/html/2501.12948v1 https://arxiv.org/html/2401.02954v1

Nor on their github https://github.com/deepseek-ai/DeepSeek-LLM

Note that the OSI only ask for transparency of what the dataset was - a name and the fee paid will do - not that full access to it to be free and Free.

It's worth mentioning too that they've used the MIT license for the "code" included with the model (a few YAML files to feed it to software) but they have created their own unrecognised non-free license for the model itself. Why they having this misleading label on their github page would only be speculation.

Without making the dataset available then nobody can accurately recreate, modify or learn from the model they've released. This is the only sane definition of open source available for an LLM model since it is not in itself code with a "source".

[-] yogthos@lemmy.ml 5 points 1 month ago

Uh yeah, that's because people publish data to huggingface. GitHub isn't made for huge data files in case you weren't aware. You can scroll down to datasets here https://huggingface.co/deepseek-ai

[-] TheOctonaut@mander.xyz 9 points 1 month ago

That's the "prover" dataset, ie the evaluation dataset mentioned in the articles I linked you to. It's for checking the output, it is not the training output.

It's also 20mb, which is miniscule not just for a training dataset but even as what you seem to think is a "huge data file" in general.

You really need to stop digging and admit this is one more thing you have surface-level understanding of.

[-] yogthos@lemmy.ml 4 points 1 month ago

Do show me a published data set of the kind you're demanding.

[-] TheOctonaut@mander.xyz 11 points 1 month ago* (last edited 1 month ago)

Since you're definitely asking this in good faith and not just downvoting and making nonsense sealion requests in an attempt to make me shut up, sure! Here's three.

https://commoncrawl.org/

https://github.com/togethercomputer/RedPajama-Data

https://huggingface.co/datasets/legacy-datasets/wikipedia/tree/main/

Oh, and it's not me demanding. It's the OSI defining what an open source AI model is. I'm sure once you've asked all your questions you'll circle back around to whether you disagree with their definition or not.

load more comments (13 replies)
[-] Grapho@lemmy.ml 7 points 1 month ago

What makes it open source is that the source code is open.

My grandma is as old as my great aunts, that doesn't transitively make her my great aunt.

load more comments (2 replies)
[-] haerrii@feddit.org 5 points 1 month ago

Thanks for clarification!

load more comments (2 replies)
[-] cmgvd3lw@discuss.tchncs.de 8 points 1 month ago

I think deepseek opens up new efficient ways for LLM training which in turn increases competition.

[-] uberstar@lemmy.ml 7 points 1 month ago

I tried DeepSeek, and immediately fell in love.. My only nitpick is that images have to have text on them, otherwise it complains, but for the price of free, I'm basically just asking for too much. Contemporaries be damned.

[-] geneva_convenience@lemmy.ml 3 points 1 month ago

The bubble must be repaired. Pump more cash in!

load more comments
view more: next β€Ί
this post was submitted on 26 Jan 2025
218 points (100.0% liked)

Memes

47459 readers
956 users here now

Rules:

  1. Be civil and nice.
  2. Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 5 years ago
MODERATORS