Attached: 1 image
the cyberpunk present is weird as fuck: the latest Shai Hulud malware wave contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware
https://socket.dev/blog/mini-shai-hulud-miasma-and-hades-worms-target-bioinformatics-and-mcp-developers-via-malicious
a while back, for a work thing I tried using AI to put a filter on a pic of a model wearing an off-the-shoulder. She was fully dressed, except the skin on her shoulder was showing to the collarbone. No cleavage.
It kept refusing to do it for “nudity” reasons. and then because i was trying to “impersonate” someone (it was a stock image)
Not to give them ideas, but couldn’t they just start flagging files that fail to pass the LLM lol?
Aside from “violent” and “criminal” prompts, is there anything an LLM can refuse that would otherwise be common?
Until workaround 1,000,001 comes round, yes.
a while back, for a work thing I tried using AI to put a filter on a pic of a model wearing an off-the-shoulder. She was fully dressed, except the skin on her shoulder was showing to the collarbone. No cleavage.
It kept refusing to do it for “nudity” reasons. and then because i was trying to “impersonate” someone (it was a stock image)
Thie actually reminded me of chatbots breaking when you asked for reeponses that used slurs so I guesss there’s probably a lot more of these.