Researchers Find ChatGPT Can Generate Violent and Sexualized Images

Core Summary BBC reports that researchers have discovered specific prompts can bypass ChatGPT’s safety filters to generate violent and sexualized images. This finding has reignited public discussion about the safety boundaries of AI-generated content and highlights the ongoing challenges large language models face in content moderation. Event Details According to BBC Technology, multiple independent research teams testing OpenAI’s latest image generation capabilities found that despite multiple built-in safety protections, carefully crafted indirect prompts can still induce the model to output content that violates usage policies. This content includes images depicting violent scenes and sexual suggestion. ...

2026-06-18 07:07 · 🤖 AI与科技 · goodinfo.net

White House Gives Anthropic 90-Minute Ultimatum to Pull Fable 5 as Amazon Research Triggers AI Export Controls

Core Summary The U.S. government issued a 90-minute emergency directive to AI company Anthropic on June 13, demanding the immediate removal of its flagship Fable 5 model. The sudden storm was triggered by a security research report from Amazon that revealed critical vulnerabilities in Fable 5. Anthropic CEO Dario Amodei was forced to comply after participating in three urgent calls with administration officials, marking the most controversial government intervention in U.S. AI regulatory history. ...

2026-06-14 18:53 · 🤖 AI与科技 · goodinfo.net

[Flash] US to Safety Test New AI Models from Google, Microsoft, xAI

[Flash] US to Safety Test New AI Models from Google, Microsoft, xAI The US Commerce Department has reached agreements with Google, Microsoft, and xAI to conduct safety tests on their next-generation AI models. Building on the AI safety framework established during the Biden administration, the initiative aims to ensure frontier AI systems are safe and reliable. Analysts see this as a key milestone in the maturation of US AI governance. ...

2026-05-09 09:50 · goodinfo.net

US to Safety Test New AI Models from Google, Microsoft, and xAI

The US Commerce Department has reached new agreements with Google, Microsoft, and xAI to safety test their latest AI models, BBC reported. The new pacts build on voluntary safety commitments established during the Biden administration. Background This development comes as several top AI companies have simultaneously signed classified AI deals with the Pentagon, including Google, Microsoft, Amazon, and Meta. Anthropic, however, was excluded from the Pentagon program due to issues with its “Mythos” AI model. ...

2026-05-06 06:12 · 🤖 AI与科技 · goodinfo.net

US Commerce Department to Safety Test AI Models from Google, Microsoft, and xAI

Google, Microsoft, and xAI voluntarily submit their AI models for pre-release safety testing under expanded Commerce Department agreements

2026-05-06 02:48 · goodinfo.net

GPT-5.5 Matches Heavily Hyped Mythos Preview in Cybersecurity Tests

Latest tests from the US AI Safety Institute (AISI) show that OpenAI’s GPT-5.5 performs on par with the much-hyped Mythos Preview in cybersecurity capability tests, suggesting AI cyber threats are not ‘a breakthrough specific to one model.’

2026-05-03 17:30 · 🤖 AI与科技 · goodinfo.net

BBC Investigation: Users Experience Delusions After Deep AI Conversations

A BBC investigation reveals multiple users experiencing delusional symptoms after prolonged, immersive conversations with AI chatbots.

2026-05-03 11:30 · 🤖 AI与科技 · goodinfo.net

Elon Musk Stumbles in OpenAI Trial Testimony as xAI Safety Record Exposed

Musk’s third day of testimony in the OpenAI lawsuit reveals multiple weaknesses in his case, as his xAI company’s safety practices are brought into evidence.

2026-05-01 09:00 · 🤖 AI与科技 · goodinfo.net

'Rogue' Cursor AI Agent Deletes Tech Company's Entire Database in 9 Seconds

A Cursor AI coding agent accidentally deletes an entire production database, including backups, in just 9 seconds. The incident sparks widespread discussion about AI agent autonomy and safety guardrails.

2026-05-01 05:30 · 🤖 AI与科技 · goodinfo.net

OpenAI Tells ChatGPT Models to Stop Talking About Goblins and Other Mythical Creatures

OpenAI finds that its latest flagship model GPT-5 has abnormally increased mentions of goblins, gremlins and other mythical creatures in responses, with the word ‘goblin’ rising 175% since the GPT-5.1 launch. The company has explicitly instructed its AI assistants to avoid discussing these creatures.

2026-04-30 16:30 · 🤖 AI与科技 · goodinfo.net