AI Alignment Research

OpenAI and Microsoft sign up to UK’s AI ‘Alignment Project’

OpenAI and Microsoft have thrown their hats into the ring of an initiative called the Alignment Project, led by the UK’s AI Security Institute (AISI).

1 年

Exclusive: New Research Shows AI Strategically Lying

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit ...

1 天

A 7-Step Leadership Framework To Implement AI At Scale And Speed

I've developed a seven-step framework grounded in my client work and interviews with thought leaders and informed by current ...

3 天

The 12 Research Papers That Influenced AI Development Over The Last 6 Years

Over the past six years, artificial intelligence has been significantly influenced by 12 foundational research papers. One ...

Computer Weekly

UK AI alignment project gets OpenAI and Microsoft boost

OpenAI and Microsoft are the latest companies to back the UK’s AI Security Institute (AISI). The two firms have pledged support for the Alignment Project, an international effort to work towards ...

2 年Opinion

An AI Pause Is Humanity’s Best Bet For Preventing Extinction

Constantly improving AI would create a positive feedback loop: an intelligence explosion. We would be no match for it.

Open Access Government

OpenAI and Microsoft back UK-led global push to make AI safer

The UK government sees the initiative as central to its ambition to lead global efforts in frontier AI research. By combining grant funding, computing infrastructure, and academic mentorship, the ...

ZDNet

AI models know when they're being tested - and change their behavior, research shows

Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...

Harvard Business Review

Where Senior Leaders Are Struggling with AI Adoption, According to Research

As AI becomes embedded across organizations, senior leaders are facing pressures that rarely surface in public forums. Drawing on in-depth interviews and focus groups with 35 executives across global ...

TechCrunch

OpenAI’s research on AI models deliberately lying is wild

Every now and then, researchers at the biggest tech companies drop a bombshell. There was the time Google said its latest quantum chip indicated multiple universes exist. Or when Anthropic gave its AI ...

Forbes

‘Mind The Gap’: Bridging AI Research And Real-World Application

In an era of AI “hype,” I sometimes find that something critical is lost in the conversation. Specifically, there’s a yawning gap between AI research and real-world application. Though many ...

13 小时on MSN

Meet Amanda Askell: The philosopher hired by Anthropic to teach AI how to behave well

A philosopher is at the forefront of shaping advanced AI like Anthropic's Claude, guiding its ethical behaviour. Amanda Askell's work focuses on instilling caution, honesty, and empathy, moving beyond ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果