everythingAInews

AI moves fast. Keep up.

Safety

See all 30 topics →
Showing:Safety
Top stories today

Times of Israel

AI chatbots are telling Israeli voters exactly what they want to hear

With 1 in 4 Israelis likely to ask AI for voting guidance, a tech startup tests how leading models handle such queries, warns chatbots rely on biased sources and prioritize pleasing users over stating facts The post AI chatbots are telling Israeli voters exactly what they want to hear appeared…

IT Brief NZ · seanm@techday.com (Sean Mitchell)

Flux raises USD $5 million to track AI code output

The new capital will help the Boston startup expand sales and engineering as firms seek clearer oversight of AI-assisted coding and software risk.

IT Brief NZ · joseph@techday.com (Joseph Gabriel Lagonsin)

Field Effect launches AI detection & response tool

Businesses face growing shadow AI risks as Field Effect folds monitoring and controls into its managed detection and response platform.

Times of Israel

AI models are absorbing antisemitism from humans, study says

Peer-reviewed psychology paper finds that, despite efforts to rein in bias, large language models replicate antisemitic tropes, with possible implications for areas like hiring The post AI models are absorbing antisemitism from humans, study says appeared first on The Times of Israel.

The Verge AI · Robert Hart

Anthropic apologizes for invisible Claude Fable guardrails

Anthropic has apologized for stealthily throttling its new AI model, Claude Fable 5, with hidden guardrails that undermine both researchers and rivals using it to develop competing systems.

The Guardian AI · Eduardo Porter

After SpaceX’s huge IPO, Americans’ financial future will be bound to AI

They’re about to get more AI rammed down their throats, stuck into their pension plans and investment portfolios Americans are growing worried about what artificial intelligence portends for their futures.

IT Brief NZ · seanm@techday.com (Sean Mitchell)

Ivalua launches IVA Studio for procurement AI automation

Procurement teams will be able to handle sourcing, invoicing and supplier risk in one interface, as Ivalua adds AI agent IVA Studio.

MIT Technology Review · Will Douglas Heaven

Google DeepMind is worried about what happens when millions of agents start to interact

Google DeepMind is funding research into the potential dangers of situations where millions of different AI agents interact with each other online. According to Rohin Shah, who directs the company’s AGI safety and alignment research, the mass-market arrival of agents that can carry out tasks…

The Guardian AI · Manoush Zomorodi and Keith Diaz

Those tedious errands, tasks and chores that AI wants to replace? They help keep you fit | Manoush Zomorodi and Keith

There’s a downside to too much convenience: it harms our bodies There is a seductive fantasy being floated by AI executives that all the efficiency their products will bring us will lead to humans finally returning to their essential, best selves.

Bloomberg Technology

AI Sparks Job Loss Worries in China, Call for Protection

China’s rapid adoption of AI in the workplace has prompted an unusually blunt call from a state-run newspaper. The Workers' Daily, the official paper of China’s umbrella trade union organization, called on regulators to protect labor rights as officials consider how to contain risks posed by the…

The Guardian AI · Blake Montgomery and agency

Canadian mother sues OpenAI, alleging ChatGPT led her daughter to kill herself

Suit filed in US alleges chatbot told Alice Carrier, 24, ‘maybe this is just the end’ as she struggled with suicidal thoughts A Canadian mother sued OpenAI and its CEO, Sam Altman, in US court on Thursday, alleging that ChatGPT encouraged her daughter to commit suicide.

IT Brief NZ · karen@techday.com (Karen Joy Bacudo)

FSB consults on sound AI practices for financial firms

The proposals could shape how banks and insurers manage cyber and operational risks as AI adoption accelerates across the sector.

IT Brief NZ · mark@techday.com (Mark Tarre)

Braze report says data beats AI for personalisation

Media and entertainment groups risk wasted AI spend unless they first fix fragmented data and measurement, Braze's report says.

IT Brief NZ · seanm@techday.com (Sean Mitchell)

Zscaler launches zero-trust tools to secure AI agents

Enterprises face new risks as autonomous software agents spread through systems faster than older security tools can track or control.

Sponsored

Advertise here

Reach the people following, building and buying AI every day

everythingAInews is read by founders, engineers, researchers and anyone who wants to stay on top of the most important developments in AI. Sponsor the daily digest, place an in-feed ad or publish sponsored content.

IT Brief NZ · mark@techday.com (Mark Tarre)

Anthropic launches Claude Fable 5 with safety limits

Many harmless prompts will now be diverted to Claude Opus 4.8 as Anthropic tightens safeguards around its newest general-use model.

Bloomberg Technology

Inside Anthropic, the $965 Billion AI Titan

Emily Chang meets Anthropic co-founders Dario and Daniela Amodei for a rare, in-depth discussion of the startup's origin story, its battles with the Pentagon and how the company says it intends to put safety first in the high-stakes AI race. (Source: Bloomberg)

Bloomberg Technology · Charlie Zhu, Jing Li

AI Wave Sparks Alarm in China With Call to Protect Worker Rights

China’s rapid adoption of artificial intelligence in the workplace has prompted an unusually blunt call from a state-run newspaper to protect labor rights, as Beijing considers how to contain risks posed by the new technology.

Google News AI

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims - TechCrunch

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims TechCrunch

Axios · Ashley Gold

Anthropic CEO says government should block dangerous AI

The government should legally be able to block or deter dangerous AI deployments, Anthropic CEO Dario Amodei wrote in an essay Wednesday. Why it matters: Anthropic's ideas for tech regulation and economic disruption from AI go far beyond anything currently under serious consideration in Washington…

Axios · Sam Sabin

Anthropic and OpenAI spark new race for frontier AI access

Frontier AI labs are converging on a new strategy for controlling their most cyber-capable models while still commercializing them: selective access. Why it matters: OpenAI's trusted-access program and a pending program from Anthropic are creating a new power center in cybersecurity where AI…

The Verge AI · Nilay Patel

Microsoft’s AI chief says superintelligence is near, but won’t take your job

Today I’m talking with Mustafa Suleyman, the CEO of Microsoft AI. And I’m actually going to keep today’s intro short. I’m working from my wife’s family farm this week, as you’ll see in the video, but also this is a real burner of an episode.

Axios · Russell Contreras

AI is masking America's "post-literate" workforce

Millions of working Americans struggle to read at a functional level. and artificial intelligence may be helping hide it. Why it matters: Low literacy is quietly becoming a major economic drag, even as AI tools allow workers to complete tasks they may not fully understand.

The Verge AI · Robert Hart

Anthropic releases its first Mythos-class model Claude Fable

Anthropic just announced Claude Fable 5, a new AI model it said is the most powerful model it has ever made widely available. According to the company, Fable 5 "shows exceptional performance in software engineering, knowledge work, and vision," with its lead over other models growing as tasks…