AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI
Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.
By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.
Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.
To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.
Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.
The key benefits of this breakthrough include:
- Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
- Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
- Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
- Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.
“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”
Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.
“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”
Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.
To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.
About Aligned AI:
Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230927032399/en/
Contact information
Media:
Alana Bannan
Matter Communications
360-975-1812
AlignedAI@matternow.com
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
SCENTMATIC's AI "KAORIUM" Debuts at THAMEEN Fragrance Launch in London's Selfridges4.7.2025 12:13:00 EEST | Press release
SCENTMATIC Inc., a leader in scent digitalization, introduced its AI-powered scent-to-language system, KAORIUM, at the THAMEEN Fragrance new product launch event. This pivotal event took place from June 5 to 11, 2025, at Selfridges department store in London, UK. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250703662207/en/ State of exhibition Global Expansion: KAORIUM Establishes UK Presence Europe leads the global fragrance market, with the UK projected to reach US$2.82 billion by 2033. Recognizing this, SCENTMATIC is rapidly expanding its international footprint. In May 2024, SCENTMATIC established its overseas subsidiary, KAORIUM, in London, appointing industry expert Ben Yanoushek as CEO. Official UK operations commenced on February 1, 2025, with the launch of its dedicated website: www.kaorium.com. KAORIUM Trialed at "Florentine Diamond" Launch Event The "Florentine Diamond" launch event for luxury brand THAMEEN Frag
Global Tourism Surging Ahead of Economic Growth, With Visits to Hit 30 Billion by 20344.7.2025 02:00:00 EEST | Press release
The World Economic Forum has today published a new report forecasting that the travel and tourism industry is projected to serve 30 billion tourist trips by 2034. Travel and Tourism at a Turning Point: Principles for Transformative Growth, produced in collaboration with Kearney and the Ministry of Tourism Saudi Arabia, reveals a projected $16 trillion contribution to global GDP by the same year—representing more than 11% of the total world economy, according to World Travel & Tourism Council estimates. The report also found that the sector is expanding 1.5 times faster than the global economy, generating significant commercial opportunities as long as the mounting challenges of climate change, labour shortages and infrastructure gaps are addressed. Inbound and outbound trips increasing fast Asia is on track to become the world’s fastest-growing tourism economy, with the direct travel and tourism GDP contribution expected to exceed 7% across the region by 2034. Notably, India and China
The 2025-2026 World Branding Awards Animalis Edition Honouring Leading Pet and Animal Brands Globally3.7.2025 22:00:00 EEST | Press release
The 2025-2026 World Branding Awards Animalis Edition marked its fifth instalment, bringing together leading pet and animal brands from all around the world. These brands were celebrated for their outstanding achievements, earning recognition as National, Regional, and Global Winners. The awards ceremony, held at Vienna's prestigious Hofburg Palace, welcomed winners across diverse categories, including pet food, retail, wellness, pet exhibitions, and aquatic products. Mounia Berrada-Gouzi expertly hosted the evening, which culminated in a grand celebration of brand excellence. “The Animalis Edition of the World Branding Awards recognises brands that have achieved the highest distinction—genuine recognition in the hearts and minds of consumers. Tonight, we honour those whose names resonate globally, whose values inspire loyalty, and whose presence defines excellence in the pet and animal industry,” said Richard Rowles, Chairman of the World Branding Forum. Out of over 950 brands nominate
Venture Global Announces 20-Year Sales and Purchase Agreement with PETRONAS3.7.2025 15:59:00 EEST | Press release
Today, Venture Global, Inc. (NYSE: VG) announced the execution of a new 20-year Sales and Purchase Agreement (SPA) with PETRONAS LNG Ltd. (PLL), a subsidiary of the Malaysian state-owned oil and gas company, PETRONAS. Under the terms of the SPA, PETRONAS will purchase 1 million tonnes per annum (MTPA) of liquefied natural gas (LNG) from Venture Global’s third facility, CP2 LNG, for 20 years. This builds upon Venture Global’s existing agreement with PETRONAS for 1 MTPA of LNG supply from Plaquemines LNG. PETRONAS, a world-class partner in the LNG industry, joins other CP2 LNG customers in Europe, Asia and the rest of the world in a strategically important project to global energy supply and security. To date, approximately 10.75 MTPA of the 14.4 MTPA nameplate capacity for CP2 Phase One has been sold. About Venture Global Venture Global is a long-term, low-cost provider of U.S. LNG sourced from resource rich North American natural gas basins. Venture Global’s business includes assets ac
Frost & Sullivan Recognizes Novotech as 2025 Global Biotech CRO Company of the Year3.7.2025 15:05:00 EEST | Press release
In recognition of its innovation, client-focused delivery, and global impact, Novotech has been awarded the 2025 Global Biotechnology Contract Research Organization (CRO) Company of the Year by Frost & Sullivan. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250703950144/en/ Novotech Wins Global CRO Award Novotech is a globally recognized full-service clinical CRO and scientific advisory firm, trusted by biotech and small- to mid-sized pharmaceutical companies to advance their drug development programs at every phase. With a global footprint spanning Asia-Pacific, North America, and Europe, Novotech supports over 5,000 clinical trial sites and a distributed team of experts delivering seamless, end-to-end solutions across geographies. “Novotech is redefining biotech-focused clinical research through AI-driven innovation, global expansion, and a client-embedded partnership model. With a clear vision to be the CRO of choice for
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom