UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models
Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/
Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)
Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.
Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.
Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.
With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.
TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230529005045/en/
Contact information
Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Biocytogen and Acepodia Expand Collaboration Through Option-based Evaluation Framework for First-in-Class Bispecific and Dual-Payload ADCs (BsAD2C)9.1.2026 14:00:00 EET | Press release
Biocytogen Pharmaceuticals (Beijing) Co., Ltd. (Biocytogen, SSE: 688796; HKEX: 02315) and Acepodia (6976:TT), today announced that the companies have entered into an option and license agreement designed to enable the structured evaluation of bispecific antibody-drug conjugate (BsADC) programs to further advance the development of dual-payload bispecific antibody-drug conjugates (BsAD2Cs). The agreement grants Acepodia an option to obtain an exclusive worldwide license from Biocytogen for two BsADC programs. Under the terms of the agreement, Biocytogen is eligible to receive an upfront option fee and, upon Acepodia’s exercise of the option, additional payments including option exercise fees, development, regulatory, and commercial milestone payments, as well as royalties on future product sales. The financial terms of the agreement were not disclosed. “This new agreement builds upon our recent co-development collaboration with Acepodia, which has focused on the evaluation and selection
Blockstream Capital Partners Announces Strategic Acquisition of Derivatives Trading Team from Numeus Group, Leveraging Strategic Partnership with Komainu9.1.2026 12:08:00 EET | Press release
Blockstream Capital Partners (“BCP”) today announced that it has entered into a strategic agreement to acquire a division within Numeus Group’s digital asset trading and investment business. The transaction includes the absorption of select Bitcoin focused trading strategies with a focus on yield generation as well as a ten person derivatives trading team led by Chief Investment Officer Deepak Gulati, a specialist in volatility and derivatives markets. Deepak Gulati, appointed Co-Chief Investment Officer of Blockstream Capital Management alongside Rodrigo Rodriguez, previously served as Global Head of Proprietary Trading at JPMorgan, before founding Argentiere Capital, a multibillion-dollar volatility-focused hedge fund. With a thesis that derivatives would drive Bitcoin and digital asset market maturity, he co-founded Numeus Group in 2021 to develop institutional-grade trading, risk management and market-structure capabilities. Komainu, an existing BCP strategic investment, has played
Autel Unveils Strategic Shift at CES 2026: Building the AI-Powered Infrastructure of Future Cities9.1.2026 11:44:00 EET | Press release
At the Consumer Electronics Show (CES), Autel Energy is presenting its latest AI-driven smart hardware and software systems for the future of intelligent urban operations. This year’s showcase highlights Autel’s shift from AI concept validation to scenario-based engineering deployment, emphasizing closed-loop execution across real-world infrastructure. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260101511436/en/ Autel Energy New Product Debut at CES 2026 Focusing on two key domains — Smart Energy and Smart Inspection — the company is unveiling a range of new products. In Smart Energy, Autel is introducing V2G (Vehicle-to-Grid) AC charger for homes and DC charging solution for fleets, strengthening its multi-tier energy touchpoint strategy from households to commercial operations. Live demonstrations at the booth will feature a smart charging robot showcasing automated plug-in capabilities. In Smart Inspection, Autel will
Allianz and Anthropic Forge Global Partnership to Advance Responsible AI in Insurance9.1.2026 11:00:00 EET | Press release
Allianz SE and Anthropic today announced a global partnership to accelerate the adoption of responsible Artificial Intelligence (AI) at Allianz. The collaboration centers on three transformative projects within Allianz Group designed to empower Allianz employees and accelerate operations, while setting new benchmarks for accuracy. “With this partnership, Allianz is taking a decisive step to address critical AI challenges in insurance. Anthropic’s focus on safety and transparency complements our strong dedication to customer excellence and stakeholder trust. Together, we are building solutions that prioritize what matters most to our customers while setting new standards for innovation and resilience,” said Oliver Bäte, CEO of Allianz SE. “Insurance is an industry where the stakes of using AI are particularly high: the decisions can affect millions of people. Allianz and Anthropic both take that very seriously, and we look forward to working together to make insurance better for those w
MRM Health Secures FDA IND Clearance to Launch its Phase 2b Trial of MH002 in Mild-to-Moderate Ulcerative Colitis9.1.2026 08:00:00 EET | Press release
MRM Health NV, a clinical-stage biopharmaceutical company pioneering microbiome-based therapeutics for inflammatory diseases and immune-oncology, today announced that the U.S. Food and Drug Administration (FDA) has granted clearance of its Investigational New Drug (IND) application for its lead program MH002. This enables the initiation of the STARFISH-UC Phase 2b clinical trial in patients with mild-to-moderate ulcerative colitis, marking a significant step forward in the development of next-generation therapies for inflammatory bowel diseases (IBD). About the STARFISH-UC Phase 2b Trial The STARFISH-UC trial is a randomized, double-blind, placebo-controlled study designed to confirm the promising efficacy signals and favorable safety profile previously observed in MH002’s Phase 2a studies. MH002, a rationally designed live microbial consortium, is the most advanced Live Biotherapeutic Product (LBP) of its kind, targeting disease-specific mechanisms through a synergistic combination of
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
