UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models
29.5.2023 11:52:00 EEST | Business Wire | Press release
Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/
Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)
Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.
Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.
Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.
With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.
TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230529005045/en/
Contact information
Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
comforte Launches TAMUNIO Assure to Modernize HPE Nonstop Cryptography for the Post-Quantum Era2.6.2026 11:10:00 EEST | Press release
comforte AG, a global leader in data-centric security for HPE Nonstop environments, today announced the launch of TAMUNIO Assure, a purpose-built solution for HPE Nonstop that helps organizations modernize cryptographic security without application rewrites or disruption to mission-critical systems. TAMUNIO Assure helps organizations enhance SSH & SSL/TLS security, centralizing keys, credentials, certificates, and secrets, and automating certificate and key lifecycle management across HPE Nonstop systems. The result is stronger security, more cost-effective operations, and a quantum-safe security posture without rewriting critical applications or introducing high-risk platform changes. The crypto operating model for HPE Nonstop is changing HPE Nonstop systems power some of the world’s most demanding transaction environments, including payment networks and financial infrastructure, where continuous availability is essential. As security and compliance requirements evolve, organizations
NIPPON KINZOKU Launches Sample Supply of “Internally Polished Capillary Tubes” for High-Performance Analytical Instruments2.6.2026 11:01:00 EEST | Press release
NIPPON KINZOKU CO., LTD. (TOKYO: 5491) (Headquarters: Minato-ku, Tokyo) announces the launch of a sample supply system for its "Internally Polished Capillary Tubes." This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260602111592/en/ Cross-section of the tube after polishing These tubes feature significantly enhanced internal smoothness in ultra-small sizes, realized through the development of the company’s proprietary internal polishing technology. We are currently proceeding with the design and construction of manufacturing equipment, aiming for mass production within fiscal year 2027. Background: Rising Demand for "Ultra-Small Diameter x Ultra-Smooth Internal Surfaces" We have previously developed high-precision, small-diameter tubes made of austenitic stainless steel with an internal diameter (ID) of 0.50mm and internal surface roughness of ≤ 0.5μm, which have been highly evaluated across various industries. In recent years,
OPEX ® Honored with Warehouse Automation Solution of the Year Award for First-of-its-Kind Cold Storage Solution2.6.2026 11:00:00 EEST | Press release
OPEX® Corporation, a global leader in Next Generation Automation providing innovative solutions for warehouse, document and mail automation, has been honored by Logistics Matters magazine with the 2026 Warehouse Solution of the Year Award. OPEX was recognized for the company’s first-of-its-kind, multi-temperature zone, multi-deep cold storage solution for automated warehouse fulfillment systems, enabled through a strategic technology partnership with cold chain commerce innovator Peltier. The collaboration introduced advanced, multi-temperature capabilities to OPEX’s industry-leading Perfect Pick® and Infinity® automated storage and retrieval systems (AS/RS) through the addition of the Peltier Tote™. “We’re deeply honored to receive such distinguished recognition by Logistics Matters magazine,” said Monty McVaugh, Head of Product, Warehouse Automation, OPEX. “By integrating Peltier’s tote technology into the existing framework of Perfect Pick and Infinity, OPEX can deliver a flexible,
Smartstream Launches Smart Agents for Back-Office Operations, Proven Across Tier 1 Pilots2.6.2026 10:55:00 EEST | Press release
Smartstream, a trusted data solutions provider for leading global financial institutions and enterprises, today announced the availability of Smart Agents, its agentic AI solution designed for bank operations. Proven in Tier 1 pilot deployments and natively integrated with Smart Reconciliations, it can be deployed immediately without changes to client infrastructure and is purpose built for regulated environments. Financial institutions dedicate up to 70% of operational effort to exception workflows fragmented across disconnected systems. Smart Agents transforms this dynamic: instead of analysts going to the data, the data comes to them, surfacing only the workflow steps that require human action, while everything else is handled autonomously end-to-end. This includes counterparty and internal communications that eliminate the manual effort consuming significant operational time. Thomas Steinborn, Chief Product and Technology Officer, Smartstream, commented: “The Pilot results demonstr
ThetaRay Gamifies Financial Defense at Money20/20 Europe with a Compliance Twist on “Where’s Waldo”2.6.2026 10:30:00 EEST | Press release
ThetaRay, a leader in AI infrastructure for financial crime compliance, today unveiledSpot The Money Mule at Money20/20 Europe. The high-speed online game is a compliance twist on "Where’s Waldo" for the AI age, designed to bridge the gap between complex banking infrastructure and the public’s role in stopping global crime. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260602629234/en/ The game challenges players to find a money mule hidden in plain sight across five buzzing everyday scenes, from the morning rush at an airport to the vibrant rows of a Dutch Tulip Festival. “In the AI age, the most dangerous threats are those that blend perfectly into the speed of our digital lives,” said Garima Chaudhary, VP Financial Crime & Compliance AI at ThetaRay. “In the game, you look for a 'Known' face. In the real world, there is no obvious profile. While humans struggle to spot one person in a crowd, our AI monitors behavioral bas
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
