Business Wire

AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI

Share

Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.

By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.

Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.

To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.

Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.

The key benefits of this breakthrough include:

  • Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
  • Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
  • Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
  • Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.

“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”

Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.

“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”

Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.

To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.

About Aligned AI:

Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Media:
Alana Bannan
Matter Communications
360-975-1812
AlignedAI@matternow.com

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Suzano Reports Record First-Quarter Revenue9.5.2025 03:12:00 EEST | Press release

Suzano, the world’s largest pulp producer, announces its first quarter results for 2025 (1Q25) with record net revenue of R$11.6 billion, up 22% on the same quarter last year (1Q24). The result was driven by the exchange rates, increased pulp sales volumes from the new Ribas do Rio Pardo mill, higher paper volume and prices and the positive contribution from our paperboard mills recently acquired in the U.S. The record revenues occurred despite a series of planned downtimes in the quarter, including production lines of the Três Lagoas Unit, Mucuri Unit, and Aracruz Unit, and the Ribas do Rio Pardo Unit’s first scheduled maintenance downtime. Sales exceeded 3 million tonnes in the quarter, a rise of 12% compared to 1Q24, comprising 2.7 million tonnes of pulp and 390 thousand tonnes of paper, up 10% and 25%, respectively, on the same quarter last year. Adjusted EBITDA totaled R$4.9 billion, a 7% increase over 1Q24. Operating cash generation totaled R$2.6 billion, rising 5% on 1Q24. Net p

GC Aesthetics® Strengthens Board of Directors with Strategic Appointments8.5.2025 18:10:00 EEST | Press release

GC Aesthetics® (GCA), a privately-held medical technology company providing aesthetic and reconstruction solutions for global healthcare markets is pleased to announce the appointment of Mr. Luigi Ferrari as Chairman of the Board (non-executive) and Mr. Patrick Lee as Board Director, reinforcing the company’s strategic direction and long-term growth plans. These appointments follow the renewed phase of partnership initiated in early 2024 with Hayfin Capital Management, a longstanding investor in GCA. This collaboration has brought fresh momentum to the company’s commitment to innovation, safety, and global expansion in aesthetic and reconstructive breast surgery. Luigi Ferrari, a seasoned executive and investor with a proven track record in the healthcare sector, brings deep leadership experience, commercial growth expertise and industry insight. From 2012 to 2022 he was CEO of Lima Corporate, a global medical device company in the joint replacement market, acquired then by Enovis Corp

PPG to invest $380 million to build new U.S. manufacturing facility in Shelby, N.C. for aerospace coatings and sealants8.5.2025 17:30:00 EEST | Press release

PPG (NYSE: PPG) today announced that it will invest $380 million to build a new aerospace coatings and sealants manufacturing facility in Shelby, N.C. Construction on the 62-acre site, which will initially include manufacturing and warehousing units, is set to commence in October 2025 and is expected to be completed in the first half of 2027. The 198,000-square-foot facility will enable the company to continue meeting the growing demands of the aerospace industry. It will employ more than 110 people and produce the full line of PPG’s aerospace coatings and sealants. The additional capacity of this new plant, combined with nearby transport links that improve supply chain and shipping logistics, will help improve service levels for customers. “PPG’s investment in this new manufacturing facility demonstrates the significant demand growth for our world-class technologies and our continued commitment to serving our aerospace customers,” said Tim Knavish, PPG chairman and chief executive off

WHOOP Unveils WHOOP® 5.0 and WHOOP® MG: Powerful New Devices with Breakthrough Health and Longevity Features8.5.2025 17:00:00 EEST | Press release

WHOOP, the human performance company, today introduces WHOOP 5.0 and WHOOP MG — two next-generation wearables designed to unlock a new approach to personal health and longevity. Paired with a redesigned WHOOP experience, the devices offer 14-day battery life in a sleeker, seven percent smaller form - and introduce category-defining features, including Healthspan with WHOOP Age, Heart Screener with on-demand ECG, Blood Pressure Insights, and more. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250508546933/en/ WHOOP Unveils Next Generation WHOOP® 5.0 and WHOOP® MG These innovations arrive at a pivotal moment when ailing health systems cost more and deliver less. WHOOP is advancing a new solution and a better way - one that empowers people to connect their daily decisions to performance and health outcomes that can be measured and felt. While others track surface-level trends, WHOOP delivers longevity through depth — translati

Seoul Semiconductor Closes in on OSRAM for Global No.28.5.2025 16:00:00 EEST | Press release

Seoul Semiconductor Co., Ltd. (KOSDAQ: 046890), a leading global innovator of LED products and technology, announced that it has achieved a remarkable milestone by maintaining stable growth despite a downturn in the LED industry, narrowing the market share gap with global No.2 player ams OSRAM to just 1 percentage point. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250508658995/en/ "2024 Global LED Market Share Rankings" (Source: Omdia) (Image: Seoul Semiconductor) According to the recently published “2024 Global LED Market Share Rankings” by market research firm Omdia, Seoul Semiconductor was the only company among the global top three to sustain both revenue and market share, while industry leaders Nichia and ams OSRAM experienced significant revenue declines amid the market slowdown. This achievement is underpinned by Seoul Semiconductor’s robust technological competitiveness, driven by its commitment to innovation even

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye