UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models
Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/
Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)
Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.
Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.
Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.
With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.
TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230529005045/en/
Contact information
Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Transition Industries Signs Strategic Agreements for the Pacifico Mexinol Project, the Largest Standalone Ultra-Low Carbon Chemical Production Facility in the World30.6.2025 21:30:00 EEST | Press release
Transition Industries LLC, a developer of world-scale, net-zero carbon emissions methanol and green hydrogen projects in North America, held a signing event for an Engineering, Procurement, and Construction (EPC) contract with the consortium of Samsung E&A Co., Ltd. (Samsung E&A), Grupo Samsung E&A Mexico, S.A. de C.V., and Techint Engineering and Construction for the Pacifico Mexinol project located in Ahome, Sinaloa, Mexico, which is contingent upon the fulfillment of customary conditions precedent and obtainment of all required approvals. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250630940954/en/ MAIRE group’s technology division NextChem, through its subsidiary KT TECH SpA, also signed a Basic Engineering, Critical and Proprietary Equipment Supply Agreement with Samsung E&A in connection with its proprietary NX AdWinMethanol®Zero technology supply to the project. Transition Industries is jointly developing the Pacif
Westinghouse and ITER Sign a $180M Contract to Advance Nuclear Fusion30.6.2025 16:45:00 EEST | Press release
Westinghouse Electric Company and ITER signed a contract for $180 million for the assembly of the vacuum vessel for the fusion reactor. This is a key milestone in the construction of the ITER reactor, leading the way toward the use of fusion as a practical future source of reliable carbon-free energy. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250630497810/en/ The ITER Tokamak pit with the two vacuum vessel sector modules installed. Westinghouse has participated in the fabrication of the sectors of the vacuum vessel, as part of the Fusion for Energy (F4E) Consortium with its partners Ansaldo Nucleare and Walter Tosto. Westinghouse will be responsible for completing the vacuum vessel which is ITER’s most critical component: a hermetically sealed, double-walled steel container that will house the fusion plasma. When all the vacuum vessel sectors are in place, Westinghouse will start the most intensive stage of ITER assembl
Monetate Acquires SiteSpect to Deliver AI-Native Personalization and Testing at Enterprise Scale30.6.2025 16:00:00 EEST | Press release
Monetate, the leading AI-driven personalization platform, today announced it has acquired SiteSpect, a leader in A/B testing, to drive next-generation digital experience optimization. This acquisition accelerates Monetate’s vision to deliver intelligent, intentional, and individualized experiences at scale, powered by agentic AI and backed by the industry’s most advanced, enterprise-grade infrastructure. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250630514541/en/ The combination of Monetate’s real-time personalization and SiteSpect’s zero-flicker testing will yield an industry-first solution for enterprise-grade personalization, testing, and optimization. For the first time, global ecommerce and digital experience leaders can access unified client-side and server-side experimentation with full personalization capabilities in a single solution, designed for scalability and regulatory compliance. Monetate is now the only e
SS&C Blue Prism Recognized as a Gartner® Magic Quadrant™ RPA Leader for the Seventh Consecutive Year30.6.2025 16:00:00 EEST | Press release
SS&C Technologies Holdings, Inc. (Nasdaq: SSNC) today announced that SS&C Blue Prism has been recognized as a Leader in the 2025 Gartner Magic Quadrant for Robotic Process Automation (RPA). “We’re delighted SS&C Blue Prism has been named a Leader in the Gartner Magic Quadrant for Robotic Process Automation for the seventh year running,” said Bill Stone, CEO and Chairman of SS&C Technologies. “SS&C Blue Prism combines market-leading RPA and orchestration technologies with the latest artificial intelligence so organizations can tackle more complex tasks and dynamic business processes. We’ve scaled to more than 2,700 digital workers and AI agents across our own operations, resulting in over $200 million in annual savings. With SS&C leading the charge on deployment, customers can be confident in rolling out SS&C’s automation solutions securely, effectively, and responsibly.” More than 2,800 companies worldwide leverage SS&C Blue Prism for AI-powered automation, helping organizations delive
Takeda Announces U.S. FDA Approval of GAMMAGARD LIQUID ERC, the Only Ready-to-Use Liquid Immunoglobulin Therapy with Low Immunoglobulin A (IgA) Content 130.6.2025 15:00:00 EEST | Press release
Takeda(TSE:4502/NYSE:TAK) today announced that the U.S. Food and Drug Administration (FDA) has approved GAMMAGARD LIQUID ERC [immune globulin infusion (human)] with less than or equal to 2 µg/mL IgA in a 10% solution, the only ready-to-use liquid immunoglobulin (IG) therapy with low immunoglobulin A (IgA) content, as replacement therapy for people two years of age and older with primary immunodeficiency (PI). As a ready-to-use liquid, GAMMAGARD LIQUID ERC may help ease the administration burden for patients and their health care providers by eliminating the need for reconstitution and can be administered intravenously or subcutaneously.1 “The approval of GAMMAGARD LIQUID ERC reinforces our commitment to supporting individualized treatment approaches for people with primary immunodeficiency, including a therapeutic option that has the lowest IgA content of any ready-to-use liquid immunoglobulin therapy, and can be administered intravenously or subcutaneously,” said Kristina Allikmets, s
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom