DeepSeek collects keystroke data and more, storing it in Chinese servers

restingboredface@sh.itjust.works · 1 year ago

DeepSeek collects keystroke data and more, storing it in Chinese servers

Snot Flickerman@lemmy.blahaj.zone · edit-2 1 year ago

DeepSeek does the same things that OpenAI does, but it’s a foreign actor so OOooooOOWwwwooOOOO sCaRrRey!

TheFeatureCreature@lemmy.world · 1 year ago

Wait until they hear what data Instagram/Meta collects during use!

But they’re a US company so it’s ok.

Bleys@lemmy.world · 1 year ago

Realistically what is the worst thing China is doing with your private data? Selling it? If you’re not a Chinese National, at least you don’t fall under their jurisdiction.

If you’re a U.S. citizen, with all the tech oligarchs cozying up to the current administration, I’d be a lot more concerned with Facebook/Twitter/Etc collecting your data.

frozenspinach@lemmy.ml · 1 year ago

Realistically what is the worst thing China is doing with your private data?

Probably mapping out the extended support networks of democratic activists in Taiwan to prepare to throw them in jail after a forcible military takeover.

Grapho@lemmy.ml · 1 year ago

So democratic activists in Taiwan have extensive networks in the US?

I mean, you said it.

catsarebadpeople@sh.itjust.works · 1 year ago

Extensive networks with their close ally? My pearls must be clutched!!

Grapho@lemmy.ml · 1 year ago

Networks with a foreign actor undermining national sovereignty, which financed several massacres in your country

catsarebadpeople@sh.itjust.works · 1 year ago

My country? Not sure what you’re talking about but I know that Taiwan deserves sovereignty. You don’t? Surely you’re not pro imperialism…

Ulrich@feddit.org · 1 year ago

The CCP is significantly more oppressive, gives zero shits about human rights or trademarks or really anyone at all. The US at least pretends to care.

Sir_Kevin@lemmy.dbzer0.com · 1 year ago

Bro you can stop that narrative. The truth is out now.

frozenspinach@lemmy.ml · 1 year ago

The truth is out now.

Who talks like this and thinks it means something?

Sir_Kevin@lemmy.dbzer0.com · 1 year ago

For the past week the people of China and the United States, as well as other countries have been comparing notes. Debunking propaganda on both sides. Realizing that much of what we’ve all been told for years/decades, has been lies.

JohnnyCanuck@lemmy.ca · 1 year ago

I’m ootl. What debunks have come out?

Ulrich@feddit.org · edit-2 1 year ago

Removed by mod

TreeGhost@lemm.ee · 1 year ago

I’m not here to defend the Chinese government or anything, but there is an argument to be made that the US has an equivalency to each one of these things.

CCP officials at tech companies - NSA backdoors

Uyghur slaves - Prison labor aka war on drugs

Taiwan - Gaza/Literally any “3rd world” nation with oil

Censorship - Right wing media empires/red state bills targeted to downplay US atrocities taught in schools

Retaliation against protestors - Police brutality Social media censorship - Oligarchs owned social media

I think a lot of people are less falling for Chinese propaganda and more overcoming US propaganda.

Grapho@lemmy.ml · 1 year ago

With the caveat that we have tons of actual evidence for the US equivalent, whereas the claims that China does those things are usually “We absolutely swear they do bro” from the people who swore Hamas was raping babies or whatever.

Ulrich@feddit.org · 1 year ago

If you think any of those are remotely the same, you’re simply delusional.

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 1 year ago

Removed by mod

Ulrich@feddit.org · 1 year ago

You make a compelling argument.

Grapho@lemmy.ml · 1 year ago

’d love to be wrong.

No you wouldn’t. If you were, you’d have listen to the many people that probably have corrected you on all those State Department talking points

Ulrich@feddit.org · 1 year ago

That’s never happened. And being that you haven’t either, I think it’s a fair guess that it won’t anytime soon.

Kras Mazov@lemmygrad.ml · edit-2 1 year ago

EDIT: I just realized feddit blocks both Lemmygrad and Hexbear, so this user cannot see my comment. If anyone wants to use/copy my comment or link directly to it, feel free to do so, I believe I provided enough evidence to debunk most of this user’s sourceless claims. It’s a shame some instances just block us and shows who they truly are.

E: any of you downvoters, feel free to correct me, I’d love to be wrong.

You throw a bunch of claims with zero source and wants to be taken seriously. At least give us the bare minimum before just spewing this much US State Department propaganda.

That being said, I will address some of your points, since someone else might stumble upon this and need an actual answer.

They don’t have CCP officials required by law to work at tech companies and disclose any and all data they acquire?

Keeping a close look on the companies on their country and keeping them on a short leash is good actually. China is not a capitalist hellhole like the US or most of the world, it is a socialist state where the rich does not control the government. Keeping them in check is the right thing to do given their current development level of socialism.

They’re not using Uyghur slaves in their factories?

That’s a new one, so far I have only heard about how they are being genocided. Which you can debunk with a little bit of research: Arab League’s visit to Xinjiang rejects Western accusations of ethnic genocide, religious persecution.

They’re not trying to literally erase Taiwan off the maps?

LMAO, no. Taiwan is part of China, why would China want to erase part of itself off the map? Even the US agrees. The only thing China wants is proper reunification with Taiwan.

They’re not still censoring information about their horrific pasts?

What “horrific past”? Be specific, this vague stance achieves nothing. If you’re talking about Tiananmen Square, here’s a good video about that: The Tiananmen Square “Massacre” Never Happened.

They’re not targeting, retaliating against and kidnapping protestors domestic and abroad?

Again, provide a damn source, I have no idea of what you’re talking about and it is something I never saw anyone claim before.

What I can do tho is bring into attention the names of a few people like Huey P. Newton being killed by the US government and Snowden having to seek asylum abroad after blowing the whistle on the US surveillance state for the world to see. And if that’s not enough, how about Pro-Palestinian protesters clash with US police on second night of DNC and New Report Details How Pro-Palestinian Protests Are Suppressed in Democratic Countries.

They’re not censoring virtually every US social website entirely from the entire country?

No they are not, Microsoft operates in China. Not only that, but they do not explicitly want to simply ban US sites on there, it’s a simple matter of national sovereignty where companies like Facebook and Google refuse to abide by Chinese law, so China simply developed all their tools in-house. Not only that, but Chinese citizens have access to VPNs and can easily access websites abroad that are not usually allowed in China.

Meanwhile the US banned Huawei and tried to ban TikTok when it became apparent they could not control it and that the people were seeing the US for what it truly is, a genocidal state funding Israel in it’s attempt to genocide the Palestinian people.

The last link I posted is a proxy on 12ft.io since The Intercept won’t allow to see the page without registering.

Grapho@lemmy.ml · 1 year ago

The US is in the process of deporting all its migrants and threatening invasions on half the world.

I get that gringos don’t want to own up to their complicity by inaction but you oughta stop pontificating about how other governments are worse. Unless they’re called Israel, they weren’t before and they sure as fuck aren’t now.

Ulrich@feddit.org · 1 year ago

Get fucked, racist.

Grapho@lemmy.ml · 1 year ago

Lmaooo hurting gringos feelings is being racist? Y’all have had concentration camps for longer than you’ve been without them, you know their fucking addresses and they’re still there.

Do forgive me for throwing y’all’s opinions on racism in the dustbin.

TheTetrapod@lemmy.world · 1 year ago

You cannot be a serious leftist and pretend to be offended by a little “anti-white” rhetoric.

BrainInABox@lemmy.ml · 1 year ago

Based on what? The US imprisons more people, kills more people, tortures more people. The only way to argue that China is more oppressive is basically to start with the assumption they are and then work backwards to justify it.

Ulrich@feddit.org · 1 year ago

I listed a handful of reasons above, of which no one has denied or refuted. Just downvoted.

BrainInABox@lemmy.ml · 1 year ago

Actually you didn’t. You listed a bunch of accusations against China (which were refuted, you just ignored that), but you didn’t even try to explain how that’s more oppressive than the USA. Even if all your accusations were true, the US is still more oppressive.

Ulrich@feddit.org · 1 year ago

I see you are sticking with the pack here and going with generic denial and ignoring my arguments rather than actually refuting them.

vfreire85@lemmy.ml · 1 year ago

now we’ve got another refutopolis warrior.

Ulrich@feddit.org · 1 year ago

What does that even mean?

Euphoma@lemmy.ml · 1 year ago

That doesn’t affect people not in china or not bordering china.

vfreire85@lemmy.ml · 1 year ago

that’s pure ideology.

St.Elsewhere@threads.net@sh.itjust.works · 1 year ago

Strawmanning the open source federated social media enthusiast crowd as unaware fans of meta?

mspencer712@programming.dev · edit-2 1 year ago

As a US citizen, I prefer services that US consumer protections could apply to. (While we still have them, ahem.) I know that Chinese laws will not protect me from things a Chinese business does in China.

(What’s with the rude replies? Did I fail to notice what instance I’m on or something?)

Grapho@lemmy.ml · 1 year ago

As a chauvinist

Ftfy

mspencer712@programming.dev · 1 year ago

This makes me sad, that we can’t engage in civil discussion about this. Why did you assume and not ask questions? Be curious, not judgmental.

To me it’s a question of laws. The laws of the U.S. at least somewhat constrain the people of my own country, and can prevent them from working against their own citizens. Like me.

Please be kind when replying.

m532@lemmygrad.ml · 1 year ago

Fuck civility, its a tool of oppression

Tangentism@lemmy.ml · 1 year ago

That sinophobia isn’t going to stoke itself!

sunzu2@thebrainbin.org · 1 year ago

Pathetic

Tangentism@lemmy.ml · 1 year ago

Western authorities have been harvesting data for a few decades from social media so any complaint that singles out Chinese apps doing the same is obviously rooted in sinophobia.

The fact you think my joking about racists doing that is pathetic shows which side of that assertion you fall.

sunzu2@thebrainbin.org · 1 year ago

My content on here speaks for it self… Dear

rAyCIsM 🤡

zante@slrpnk.net · 1 year ago

The response the deepseek has been so transparent and cliched .

I thought more of Mashable. , but I suppose it’s good when they show you who they really are

jol@discuss.tchncs.de · 1 year ago

I’m not American so they are indeed a foreign actor.

index@sh.itjust.works · 1 year ago

Nope you can’t run chatgpt locally.

frozenspinach@lemmy.ml · edit-2 1 year ago

but it’s a foreign actor so OOooooOOWwwwooOOOO sCaRrRey!

I love that people think this is a solid own. Lest we forget Hong Kong, or an impending hot war in Taiwan or building out extradition systems with an expanding network of countries to forcibly repatriate and torture dissidents and human rights lawyers.

You used to not have to explain why authoritarianism was bad.

Edit: I would love to know the Pro side of what happened in Hong Kong, or the forced extradition regime, since evidently I’m clearly in the wrong in thinking those were bad. What am I missing?

Foni@lemm.ee · 1 year ago

It used to not be necessary because democracies used to have moral authority but since the revelations of Manning and Snowden non-Americans see no difference between giving our data to the USA or to China or any other. We also know from the reaction to the war in Ukraine and Gaza that human rights claims are only sometimes used.

Grapho@lemmy.ml · edit-2 1 year ago

Anti terrorism is good, actually. I don’t support people kicking seniors for speaking mandarin to try to bully a government into not prosecuting murderers in the mainland, which was the reason the protests happened (that and Washington money)

BrainInABox@lemmy.ml · 1 year ago

or an impending hot war in Taiwan

When you can’t even find things that China actually has done to complain about, so you have to start complaining about things they haven’t done.

circuitfarmer@lemmy.sdf.org · 1 year ago

This “China’s AI is taking your data and that’s bad” is shockingly similar to “TikTok is taking your data and that’s bad”. Lots of US counterparts do the same thing, but I don’t see (as much) media coverage about that.

Don Draper: “no no no, everyone else’s cigarettes are dangerous. Lucky Strikes are… toasted.”

chicken@lemmy.dbzer0.com · 1 year ago

The way I think of it is, I don’t live in China, so regardless of my objections to their values or human rights abuses, why would CCP or an affiliated company care about me or ruin my life on the basis of or by abusing my data? A big part of why I care about privacy is I don’t want to be filtering my every thought through consideration of whether the powers that be would approve, and US companies are way more relevant to that.

mystic-macaroni@lemmy.ml · 1 year ago

Sell to the highest bidder

shawn1122@lemm.ee · 1 year ago

These the excuses you start to make when you’re losing. Not looking great for the US…

Jeena@piefed.jeena.net · 1 year ago

This is probably only a problem with the online version. In contrast to google and openAI they, like meta, let you download the model and run it offline, where they can’t access any of this data I presume.

0x01@lemmy.ml · 1 year ago

I’ve been running it locally using ollama, works completely offline, no keystroke data for anyone!

sunzu2@thebrainbin.org · 1 year ago

Yeah I scan logs and so far nothing… I still don’t trust them but I can’t tell shit either

Jessica@discuss.tchncs.de · 1 year ago

Just use little snitch, open snitch or simple wall depending on your operating system and block the outbound connection if one ever occurs

sunzu2@thebrainbin.org · 1 year ago

portmaster?

Jessica@discuss.tchncs.de · 1 year ago

Seems legit! https://safing.io/blog/2022/04/11/portmaster-vs-simplewall/

sunzu2@thebrainbin.org · 1 year ago

I heard about little snitch, is there any benefit to it v portmaster in your opinion, off the cuff type thing?

Jessica@discuss.tchncs.de · 1 year ago

Oh little snitch was just what I used when macOS was my main operating system. When I switched to windows I started using simple wall and I just recently was poking around for a Linux solution and I found open snitch

Pennomi@lemmy.world · 1 year ago

Right, the offline version (if you have the hardware to run it) is completely under your control, and no one can take that away from you. Honestly nice to see that happen, I thought it would take several years.

AbouBenAdhem@lemmy.world · edit-2 1 year ago

Anyone using DeepSeek as a service the same way proprietary LLMs like ChatGPT are used is missing the point. The game-changer isn’t that a Chinese company like DeepSeek can compete with OpenAI and its ilk—it’s that, thanks to DeepSeek, any organization with a few million dollars to train and host their own model can now compete with OpenAI.

Snot Flickerman@lemmy.blahaj.zone · 1 year ago

On-prem vs. Cloud, basically. On-prem just magically got cheaper.

mac@lemm.ee · edit-2 1 year ago

Onprem has always been cheaper. Cloud compute was the most successful marketing campaign I can think of.

superkret@feddit.org · 1 year ago

Not when it’s about LLMs.

WalnutLum@lemmy.ml · 1 year ago

Or open source groups can make a fully open repro of it: https://github.com/huggingface/open-r1

naeap@sopuli.xyz · 1 year ago

I’d like to look into that, how can I train an existing model further?

I’m only playing around with ollama, but like to do a bit more - mostly just to fulfill my needs to understand things - but have no idea where to start

WalnutLum@lemmy.ml · 1 year ago

You’re going to have to learn python.

Here’s a good overview: https://huggingface.co/docs/transformers/training

naeap@sopuli.xyz · edit-2 1 year ago

Python is not a problem
SW Dev is my job. Just never had real contact with AI before, besides playing around a bit.

Thank you very much for the link!!

Edit: thank you very much again, that was pretty much exactly what I was looking for.
Don’t know how I missed to checkout huggingface. Thought of it always just as a github for models and didn’t bother checking for docs…
But that’s a great intro with simple tools/tutorials to get a grip on it, thanks!

sunzu2@thebrainbin.org · 1 year ago

They all do this…

Don’t use hosted models unless you pay for your own server space and it is encrypted.

Don’t be a fucking idiot.

frozenspinach@lemmy.ml · 1 year ago

They all store data on Chinese servers?

sunzu2@thebrainbin.org · 1 year ago

🤡

gitgud@lemmy.ml · 1 year ago

Unironically quite a lot of them probably do because it’s probably cheap and they have a fiduciary duty to the shareholders to gEt ThE bEsT dEaL!

deadcatbounce@reddthat.com · 1 year ago

No.

As opposed to Microsoft, Google, … NSA, or GCHQ servers. Or all of the above.

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 1 year ago

Yeah, but these are wholesome domestic spy agencies that are just looking after you and protect you from yourself.

deadcatbounce@reddthat.com · 1 year ago

Fuck. They told me that they were storing my backups.

lemmyseizethemeans@lemmygrad.ml · 1 year ago

[stellar wind has never left the chat]

Treczoks@lemmy.world · 1 year ago

“We store the information we collect in secure servers located in the People’s Republic of China”

Now you Americans know how we Europeans feel when Google, Amazon and Facebook store our information on American servers. Hint: The protective wall between Chinese servers and their government are about as good as the one between American servers and their government - at least for non-US citizens. The last thin veil of privacy for Eurpeans has been ripped to shreds by Trump last week.

Ferk@lemmy.ml · edit-2 1 year ago

The last thin veil of privacy for Eurpeans has been ripped to shreds by Trump last week.

What did he do? I know Trump does not like the GDPR, but did he sign something affecting it last week?

Zip2@feddit.uk · 1 year ago

Did the American technology giants think they had the monopoly on capturing human input too?

SatansMaggotyCumFart@lemmy.world · 1 year ago

My gym sock captures human input too.

Zip2@feddit.uk · 1 year ago

That’s human output surely?

SatansMaggotyCumFart@lemmy.world · 1 year ago

I input it into the sock.

grey_maniac@lemmy.ca · 1 year ago

I’m confused. Isn’t “collecting keystroke data” just an alarmist way to describe text entry?

noisefree@lemmy.world · 1 year ago

Maybe. They could also be doing things like paying attention to input cadence and typos/pre-send typo corrections to use as part of a fingerprint associated with the identifying information a user gives them when creating an account so that they can then attempt to detect the user elsewhere on the web whether they are using an identifying account or not.

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 1 year ago

This argument applies to literally every single web app you use.

ubergeek@lemmy.today · 1 year ago

So, basically using Facebook technology in their AI app?

noisefree@lemmy.world · 1 year ago

You’ll hear no arguments from me on that point, US tech companies are toxic af.

Melvin_Ferd@lemmy.world · 1 year ago

How far we’ve come

uis@lemm.ee · 1 year ago

Not exactly. Timing between key presses can be used to identify people.

grey_maniac@lemmy.ca · 1 year ago

I am literally so paranoid I regularly vary my keysteoke rhythms and explore polyrhytmic techniques to create variations. Not even joking.

kekmacska@lemmy.zip · 1 year ago

lol no. only the sounds of the keys can identify the keyboard’s model

uis@lemm.ee · 1 year ago

The goal is not to identify keyboard model. The goal is to identify person. And people tend to have something called habbits.

kekmacska@lemmy.zip · 1 year ago

the chance of this is almost zero. if you are a dangerous cybercriminal, they will track your device down by a networking solution, wait until you leave it unattended and install a hardware-based spy device and capture evidence. No fbi agent will fuck around with keyboard sounds or movie bs like that

uis@lemm.ee · 1 year ago

with keyboard sounds

Ok, I see you are intentionally going in circles.

vfreire85@lemmy.ml · 1 year ago

this. i mean, the session logs for the prompt are kept at least for your user, right?

ubergeek@lemmy.today · 1 year ago

Yes.

tux@lemmy.world · 1 year ago

Not usually. Keystroke info is different than text input, like if you didn’t click onto any field and typed it would only be captured if keystroke are all being grabbed. It’s especially scary if you keep the app running in the bg and then type something and it still captures it. Not saying they’re doing that, but the privacy policy says they might.

The rhythm part is annoying, it’s commonly used to ID people even through things like ad blocks and dns blocks. Could also (in theory) be used to capture what people are typing just by hearing how they type.

Ferk@lemmy.ml · 1 year ago

This is the full paragraph:

We collect certain device and network connection information when you access the Service. This information includes your device model, operating system, keystroke patterns or rhythms, IP address, and system language. We also collect service-related, diagnostic, and performance information, including crash reports and performance logs. We automatically assign you a device ID and user ID. Where you log-in from multiple devices, we use information such as your device ID and user ID to identify your activity across devices to give you a seamless log-in experience and for security purposes.

It looks to me that they are using it to identify the user uniquely, maybe also related to captcha to prevent bots (it’s common practice to capture mouse and keyboard while resolving captchas to see if the movement is human-like).

grey_maniac@lemmy.ca · 1 year ago

Looks like there are more things I need to start randomizing and injecting with noise.

Subverb@lemmy.world · 1 year ago

If you think the American companies do anything different you’re not paying attention and simply believing the propaganda.

ozoned@lemmy.world · 1 year ago

Chinese company does what American companies have done for 25+ years now!

Is it time for REAL data privacy laws or are we just gonna keep playing whack-a-mole with Chinese tech companies that get us nowhere?

Someonelol@lemmy.dbzer0.com · 1 year ago

Our data’s just too valuable for these parasites. Data privacy laws may eventually pass to compel software companies to store everything in US servers only.

ozoned@lemmy.world · 1 year ago

Excellent Point. If that’s the case though, then wouldn’t other countries follow suit which still limits big tech’s reach and makes them less profitable and less powerful? Idk. Guess we’ll see how it plays out. Either way, I’m staying as far from those ecosystems as possible to at least try to mitigate some of what they do. I’ll never be totally successful, genie is put of the bottle, but we can at least attempt.

Fuck Work@slrpnk.net · 1 year ago

At least its not stored on american servers.

JonEFive@midwest.social · 1 year ago

I feel like Meta could do a ton more damage with my information than Tencent

mel ♀@jlai.lu · 1 year ago

Same as Chrome’s magic bar, or android keyboard no ? So in the end, does USA doing it good because “democracy” (never ever with napalm) when China is bad because human rights violation (USA never did anything like this) ?

Dearth@lemmy.world · 1 year ago

Seriously this. Nothing that China is accused of doing is any worse than what i know America has done. If it’s the Chinese Communist Party stealing your data at least you know it won’t be used to inject ads everywhere you go on the internet

Max-P@lemmy.max-p.me · 1 year ago

At least they’re transparent about it, unlike american companies that hide behind convoluted terms of services and then sell the data behind your back but it’s technically legal.

China’s like “yeah we collect everything”. I can appreciate the honesty.

mavu@discuss.tchncs.de · 1 year ago

It’s a chinese company, where else would they store the data?

ShinkanTrain@lemmy.ml · 1 year ago

The balls.

Critical_Thinker@lemm.ee · 1 year ago

Antarctica, clearly.

smb@lemmy.ml · 1 year ago

I think its called a data lake, so they don’t “store” it, its rather floating around there 🤪

howrar@lemmy.ca · 1 year ago

These lakes are formed when the cloud is saturated and gives us data precipitation.

smb@lemmy.ml · edit-2 1 year ago

thanks for the great picture 👍

so here is the current cloud clima forecast:

The saturated clouds will rain into the data lakes that are already overspilling here and there into the ransomstreams already taking all soil in their way with them. During the day there will be security clouds preventing from visible rain only while during the night those same security clouds rain themselves all collected data to their homelake while their homelake security already is corrupted and spills over regulary.

As soon as the fort-cisc-pal-ocstricken-redm-ondams breach it’ll gonna have floods with multi-exabyte waveheights and the ripples of the release will be felt over to far east china and the currents will circulate around the world multiple times causing damage and devastation in their wake around the world and eventually even reach connected orbit.

The floods will have the potential to also wash away and /or drown or choke all the big tech dinosaurs. Only small foss mammals and deep sea amphibics will survive this historic event.

… you kinda asked for it 😉 same as “they” kinda asked for it too. 🤔

JOMusic@lemmy.ml · 1 year ago

This article is what US propaganda looks like folks. Mashable should be ashamed.

Literally all AI companies do this to run their services. Except you can actually download Deepseek and run it completely securely on your own devices. You know who doesn’t allow that security? OpenAI and the other US companies currently being screwed.

zeca@lemmy.eco.br · 1 year ago

every google site has been doing this for years too. every comment we write in youtube and discard before posting, its being recorded. this isnt news at all.