Business Wire

UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models

29.5.2023 11:52:00 EEST | Business Wire | Press release

Share

Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/

To view this piece of content from mms.businesswire.com, please give your consent at the top of this page.

Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)

Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.

Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.

Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.

With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.

TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.

For more information, visit www.tii.ae

*Source: AETOSWire

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Bial Achieves Key Milestone in Phase 2b ACTIVATE Study of BIA 28-6156 in GBA1-Associated Parkinson’s Disease27.1.2026 13:00:00 EET | Press release

Bial, an innovation-driven biopharmaceutical company focused on neurosciences and rare diseases, today announced that 75% of patients currently enrolled in its ongoing Phase 2b clinical study ACTIVATE (ClinicalTrials.gov identifier: NCT05819359) have completed the double-blind treatment period through week 78. This operational milestone represents a significant step toward the completion of the ACTIVATE study, which is evaluating the efficacy, safety, tolerability, pharmacodynamics, and pharmacokinetics of BIA 28-6156 in patients with Parkinson’s disease (PD) who have a pathogenic mutation in the glucocerebrosidase 1 (GBA1) gene (GBA-PD). The date of the Last Patient, Last Visit (LPLV) is anticipated in April 2026, with topline results on track to be released in mid-2026. “We look forward to presenting data from our Phase 2b study. BIA 28-6156 is the leading asset in GBA-PD, and clinical outcomes are just around the corner,” said António Portela, CEO of Bial. “We are witnessing growing

Libraesva and Cyber Guru Announce Strategic Combination to Create a Leading European Human-Centric Cybersecurity Provider27.1.2026 12:00:00 EET | Press release

Cyber Guru and Libraesva today announced a strategic combination that brings together two highly complementary Italian cybersecurity companies to form a new European leader in human-centric cyber protection. The combined group will offer organisations an integrated approach to defending against email-based threats, social engineering, and human-targeted attacks, integrating advanced technology with behavioural resilience. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260127531600/en/ Cyber threats are evolving rapidly, with attackers focusing less on infrastructure and more on the weak link in security – human behaviour. Phishing, business email compromise, smishing, QR-code attacks, social engineering and many other forms of attacks targeting humans continue to be primary breach vectors and are rapidly growing in sophistication with support of widely available AI tools. By integrating Libraesva’s advanced, privacy-first em

Europe’s First Meteorological Infrared Sounder Reveals the Atmosphere in 3D27.1.2026 11:30:00 EET | Press release

The first images from Europe’s pioneering meteorological infrared sounder were unveiled today at the EU Space Conference in Brussels, marking a major advance in the ability to monitor how the atmosphere evolves before and during severe weather. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260127490399/en/ Full Earth disc image from the MTG-S Infrared Sounder showing a surface-temperature-sensitive channel, captured between 12:45 and 15:30 UTC on 15 November 2025. The image distinguishes land and sea surface temperatures as well as cloud-top temperatures, highlighting cloud structures and weather systems. Hot desert regions appear in red, while cold cloud tops are shown in blue. © Image: Data acquired on 15 November 2025 and processed by industrial partners Thales and OHB under the supervision of EUMETSAT and ESA. Visual produced by EUMETSAT. The images were captured by the Infrared Sounder (IRS) flying aboard Meteosat Thir

Clearwater Analytics Embeds AI into Beacon Risk Platform to Accelerate Model Validation and Exposure Analysis27.1.2026 11:00:00 EET | Press release

Clearwater Analytics(NYSE: CWAN), the most comprehensive technology platform for investment management, today announced breakthrough embedded agentic AI capabilities within Beacon by CWAN, its enterprise risk and quantitative analytics platform, enabling risk teams to accelerate model validation, exposure analysis, and decision-making. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260126906084/en/ As regulatory scrutiny intensifies and portfolio complexity reaches unprecedented levels, traditional risk platforms are failing institutional investors when they need answers most. Built specifically for quantitative developers and risk professionals managing complex institutional portfolios, CWAN’s embedded AI operates within Beacon’s calculation engine itself, training the agents our clients deploy on data grounded in a firm’s actual positions, validated models, and real-time calculations. This breakthrough architecture process

Seasoned European Software Executive David Coste to Join Battery Ventures27.1.2026 10:00:00 EET | Press release

Battery Ventures, the global, technology-focused investment firm, announced it has hired former Forterro executive David Coste as an executive-in-residence based in its London office, effective next month. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260127122321/en/ David Coste, a former Forterro executive who begins a new executive-in-residence role with global investment firm Battery Ventures in London in February. At Battery, Coste will work closely with Battery Partner Zak Ewen and General Partner Morad Elhafed, among others, to help source and evaluate business-technology investments throughout Europe. Battery first opened its London office in 2016 and, since 2005, has completed more than 150 transactions across 13 countries including the U.K., Germany, France, the Netherlands, Belgium, Switzerland, Sweden and Norway. “We’re thrilled to have David join our team in London after watching the significant impact he’s had

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye