Chinese AI chatbot DeepSeek censors itself in realtime, users report

Domino@lemmings.world · 1 day ago

Chinese AI chatbot DeepSeek censors itself in realtime, users report

observantTrapezium@lemmy.ca · 1 day ago

I downloaded the 70B model and tried politically “naughty” questions. Even without the chatbot guardrails, it mostly says things that the CCP would approve of, but you could trick it to be more honest (not super easy!). One interesting thing is that while it usually spews this <think></think> blocks, for some politically sensitive questions (“is Taiwan part of China”) it just spits the answer.

RedstoneValley@sh.itjust.works · 1 day ago

I experimented with a local installation as well. The censored answers were not going to through the chain-of-thought routine, but were instant answers instead. Follow-up questions however made it spill the beans rather quickly, giving out even more juicy details than I had initially asked for.

andrew0@lemmy.dbzer0.com · 1 day ago

Wait until they learn that OpenAI does the exact same thing. Try to get advice on how to crack software and see how far you get.

boredtortoise@lemm.ee · 1 day ago

Cracking isn’t a global superpower authority and replaced with fabrications tho

The Snark Urge@lemmy.world · 1 day ago

DeepSeek and all LLMs are all massively overvalued, but also, isn’t it fun to watch corpo media turn on a dime in service of shareholders to rebuild sentiment in tech stocks. Show’s over folks, back to line-goey-uppy.

Tinfoil hat mode: Watch NVIDIA stock recover in a week like this wasn’t some manufactured rug pull. Probably by friends of that Intel exec who went on the news talking about how he bought the dip. Not being super serious, but it’d be hardly the most blatant manipulation we’ve seen.

NaibofTabr@infosec.pub · 1 day ago

Wait, you think The Guardian is “corpo media” helping to rebuild the sentiment in Western tech stocks? Am I understanding your meaning correctly?

The Snark Urge@lemmy.world · 1 day ago

No. Do you mind telling me how many extra caveats would have made this conversation unnecessary?

NaibofTabr@infosec.pub · 22 hours ago

All of them. All the caveats you got.

bdonvr · 1 day ago

Lol that’s not at all unique to DeepSeek. I remember recording my screen to see outputs on other models before they censored the message

magic_lobster_party@fedia.io · 1 day ago

Good thing DeepSeek is open

Pennomi@lemmy.world · edit-2 1 day ago

One thing that’s so weird to me is that while DeepSeek is somewhat too large to run on my PC, just it existing makes it easy to distill the reasoning functionality into other smaller models. I’ve been running a 34B distill locally and it’s been much better than any other local model I’ve tried so far.