Business Wire

UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models

29.5.2023 11:52:00 EEST | Business Wire | Press release

Share

Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/

To view this piece of content from mms.businesswire.com, please give your consent at the top of this page.

Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)

Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.

Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.

Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.

With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.

TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.

For more information, visit www.tii.ae

*Source: AETOSWire

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Vonage Expands Developer Ecosystem to Accelerate the Next Era of Enterprise Digital Transformation19.2.2026 11:08:00 EET | Press release

Vonage, part of Ericsson (NASDAQ: ERIC), today announced a number of new initiatives, to further expand and enable its global developer community. By growing its developer community and tools, giving developers and enterprises access to new mobile network insights and capabilities that can be embedded into existing applications and workflows, Vonage is accelerating its strategy to transform mobile networks into a programmable platform. Putting Vonage APIs in the Hands of 40 million+ More Developers Vonage is announcing the launch of its Verified Workspace on the Postman API Network, making Vonage APIs available to the 40+ million developers who use Postman. The new, verified workspace gives Postman’s community of global developer members certified access to Vonage’s portfolio of Network APIs, Communications APIs, and developer tools - enabling them to copy, or fork, collections into their own workspaces and build, test, and collaborate in real time with other developers and teams. This

Brenus Pharma Presented New Preclinical and Early Clinical Data in Colorectal Cancer During AACR Immuno-Oncology Conference 202619.2.2026 09:50:00 EET | Press release

Brenus Pharma, a clinical-stage biotechnology company pioneering in vivo “off-the-shelf” immunotherapies, announced the selection of an abstract for a poster presentation at the American Association for Cancer Research Immuno-Oncology (AACRIO) annual conference (February 18-21, 2026), in Los Angeles. The poster showcases new preclinical data and early clinical insights from the company's lead candidate, STC-1010, supporting it as a promising therapeutic option for MSS, immune-cold colorectal cancer (CRC)—a major unmet clinical need. In a syngeneic CT26 model, murine STC-1010 significantly inhibits tumor growth and extends survival by 40%. New mechanistic datas reveal expansion of CD4⁺ and CD8⁺ T cells in draining lymph nodes alongside a significant humoral response compared to control, demonstrating coordinated cellular and humoral immune activation in vivo. Other findings show that dendritic cells efficiently capture STC-1010, leading to their activation as demonstrated by transcripto

ENHERTU ® Type II Variation Application Validated in the EU as Post-Neoadjuvant Treatment for Patients with HER2 Positive Early Breast Cancer19.2.2026 09:30:00 EET | Press release

The European Medicines Agency (EMA) has validated the Type II Variation marketing authorization application for ENHERTU® (trastuzumab deruxtecan) as a monotherapy for adult patients with HER2 positive (immunohistochemistry [IHC] 3+ or in-situ hybridization [ISH]+) breast cancer who have residual invasive disease after neoadjuvant HER2 targeted treatment. ENHERTU is a specifically engineered HER2 directed DXd antibody drug conjugate (ADC) discovered by Daiichi Sankyo (TSE: 4568) and being jointly developed and commercialized by Daiichi Sankyo and AstraZeneca (LSE/STO/NYSE: AZN). The validation confirms the completion of the application and commences the scientific review process by the EMA’s Committee for Medicinal Products for Human Use (CHMP). The application is based on data from the DESTINY-Breast05 phase 3 trial presented at the 2025 European Society for Medical Oncology (#ESMO25) Congress and subsequently published in The New England Journal of Medicine. In the trial, ENHERTU demo

NHOA Energy Advances Delivery of the 51 MWh Broni Battery for Neoen in Italy19.2.2026 09:00:00 EET | Press release

NHOA Energy, global provider of utility-scale energy storage systems, is advancing the construction of the Broni Battery, a 10 MW/51 MWh project located in Broni (Pavia), Italy, executed for Neoen, one of the world’s leading independent producers of renewable energy. Civil works at the site are currently underway and advancing on schedule, with commissioning activities planned to start in late spring 2026. Once operational, the Broni Battery will provide essential flexibility services to the Italian electricity system, supporting grid stability and reliability as renewable energy penetration continues to increase. The project represents Neoen’s first battery storage asset in Italy and marks a strategic step in the company’s expansion, as it continues to grow its battery storage portfolio in key markets worldwide to support power system flexibility. The Broni Battery further strengthens NHOA Energy’s long-term commitment to the Italian market, where the company has deep industrial roots

Compass Pathways Announces Pricing of $150 Million Public Offering19.2.2026 03:18:00 EET | Press release

Compass Pathways plc (Nasdaq: CMPS), a biotechnology company dedicated to accelerating patient access to evidence-based innovation, announced today the pricing of its public offering of 17,500,000 American Depositary Shares (“ADSs”) at a public offering price of $8.00 per ADS, each representing one ordinary share, and in lieu of ADSs, to certain institutional investors, pre-funded warrants to purchase up to 1,250,000 ADSs at a public offering price of $7.9999 per pre-funded warrant. In addition, Compass Pathways has granted the underwriters a 30-day option to purchase up to an additional 2,812,500 ADSs at the public offering price, less the underwriting discounts and commissions. All of the securities are to be sold by Compass Pathways. The offering is expected to close on or about February 20, 2026, subject to the satisfaction of customary closing conditions. The gross proceeds to Compass Pathways from the offering, before deducting underwriting discounts and commissions and other est

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye