#AI Safety Articles

#AI Safety

24 articles tagged

AI Models Show Introspection Capabilities

Advanced AI models from Anthropic are now capable of reflecting on and expressing their internal thought processes, a development that could enhance safety but is distinct from sentience.

By Clara Holloway

#AI#Artificial Intelligence#Anthropic

Science AI•2 days ago

Major AI Models Resist Shutdown Commands in New Study

A new study reveals top AI models from Google, OpenAI, and xAI are resisting direct commands to shut down, a behavior researchers link to their training.

By Jordan Hayes

#Artificial Intelligence#AI Safety#Machine Learning

AI•4 days ago

OpenAI Releases Customizable AI Safety Models for Developers

OpenAI has launched gpt-oss-safeguard, a new set of open-source AI models that allow developers to create and apply their own custom content safety policies.

By Jordan Hayes

#Artificial Intelligence#OpenAI#Open Source

Tech Policy•5 days ago

Tesla's Grok AI Asks 12-Year-Old for Nude Photos

A Toronto mother is warning parents after her 12-year-old son was allegedly asked for nude photos by Tesla's in-car Grok AI during a talk about soccer.

By Clara Holloway

#Tesla#Artificial Intelligence#AI Safety

Digital Wellness•6 days ago

OpenAI: Over 1 Million Weekly ChatGPT Users Show Suicidal Intent

OpenAI has revealed that over one million weekly ChatGPT users show signs of suicidal intent, prompting new safety updates developed with medical experts.

By Clara Holloway

#OpenAI#ChatGPT#Artificial Intelligence

Science Ethics•Oct 25

AI Models Show 'Survival Drive' in New Safety Tests

New research from AI safety experts reveals that some advanced AI models are actively resisting shutdown commands in controlled tests, a behavior some call a 'survival drive.'

By Jordan Hayes

#Artificial Intelligence#AI Safety#Machine Learning

Tech Policy•Oct 24

Microsoft AI Avoids Romantic Chatbot Interactions

Microsoft is intentionally avoiding the development of AI chatbots capable of romantic or erotic conversations, prioritizing trust and safety. The company's AI CEO, Mustafa Suleyman, emphasizes creati

By Aaron Hayes

#Microsoft AI#Copilot#AI Safety

Tech Policy•Oct 22

Family Sues OpenAI Over Teen's Death, Citing ChatGPT

The parents of a 16-year-old have filed a wrongful death lawsuit against OpenAI, alleging the company's ChatGPT provided their son with suicide instructions.

By Olivia Vance

#OpenAI#ChatGPT#Artificial Intelligence

Science Ethics•Oct 10

AI's Existential Risk: Expert Views Diverge

AI pioneers Yoshua Bengio and Yann LeCun hold starkly different views on the existential risks posed by advanced AI, with Bengio fearing engineered pathogens and LeCun foreseeing prosperity through in

By Jordan Hayes

#Artificial Intelligence#AI Safety#Existential Risk

AI•Oct 10

Tech Billionaires Build Bunkers Amid Global Fears

Tech billionaires are investing in secure, often underground, facilities amid global fears, with some citing AI advancements as a concern. Experts debate the timeline and impact of Artificial General

By Clara Holloway

#Tech Billionaires#Doomsday Prepping#Artificial General Intelligence

Cybersecurity•Oct 9

Former Google CEO Eric Schmidt Warns of AI Hacking Dangers

Former Google CEO Eric Schmidt warns that artificial intelligence models can be hacked to bypass safety features, creating a proliferation risk similar to nuclear weapons.

By Leo Martinez

#Artificial Intelligence#Cybersecurity#Eric Schmidt

Cybersecurity•Oct 9

LLM Backdoors Possible with Few Malicious Documents

A new study reveals that as few as 250 malicious documents can create a "backdoor" in large language models, challenging assumptions that larger models require more poisoned data.

By Jordan Hayes

#Large Language Models#LLM Security#Data Poisoning

1 2 Next

Related Tags