UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models
Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/
Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)
Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.
Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.
Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.
With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.
TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230529005045/en/
Contact information
Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
NIQ and Amazon Marketing Cloud (AMC) Collaborate to Measure Reach and Impact of Cross-Platform Ad Campaigns in Italy21.11.2025 10:00:00 EET | Press release
NIQ and Amazon Marketing Cloud (AMC) have announced a new collaboration to study the effectiveness of cross-platform advertising across linear TV and Amazon Ads inventory in Italy. Through the collaboration advertisers and agencies will gain actionable insights into the relative performance of ad placements across digital, linear TV and streaming environments, including how each contributes to incremental reach and influences product purchases on Amazon’s ecommerce platforms. The insights are made possible by using high-quality data from Sinottica®—a well-established single-source consumer panel in Italy owned by NIQ—with data from Amazon Marketing Cloud. Specifically, the research will leverage Sinottica’s linear TV data alongside several Amazon inventory sources, including Amazon DSP, Sponsored Ads (Products, Brands, Display), and Streaming TV (Prime Video, Twitch, Fire TV). This approach will enable a deeper understanding of how ad exposure across digital and TV channels translates
Ahead of Holiday Season, Visa Identifies Five Transformative Forces Reshaping Global Payment Security20.11.2025 20:50:00 EET | Press release
To celebrate International Fraud Awareness Week, Visa (NYSE: V) today released its Fall 2025 Biannual Threats Report, revealing five forces that are transforming the global payments security landscape. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251120412198/en/ The report, produced by Visa's Payment Ecosystem Risk and Control (PERC) team, draws on intelligence from Visa's global network to identify how criminal operations are evolving with unprecedented speed, scale, and sophistication. "The payments ecosystem is experiencing a paradigm shift in how fraud operates," said Paul Fabara, Chief Risk and Client Services Officer at Visa. "Criminals are no longer working as opportunistic individuals-- they're operating like tech startups, building reusable infrastructure and deploying systematic, industrial-scale operations that challenge conventional defenses. Understanding these evolving forces is critical for the entire ecosy
Suzano Forms Partnership with Tencent and Ecofuturo Institute at COP30 for AI-Powered Conservation and Nature Education20.11.2025 19:25:00 EET | Press release
The world’s largest pulp supplier, Suzano, today announces the signing of a Memorandum of Understanding (MoU) with the multinational technology business Tencent and the Brazilian non-profit Ecofuturo Institute, to pioneer new digital approaches to drive public engagement in conservation. The collaboration will leverage Suzano’s expertise in sustainable forestry, Tencent’s cutting-edge digital technology, and Ecofuturo’s expertise in environmental education, to pilot joint initiatives in both Brazil and China. These include enhancing ecological awareness, accelerating biodiversity solutions, and leveraging digital platforms to foster public engagement in conservation. The partnership will result in a pilot project where advanced AI tools for species recognition developed by Tencent’s Sustainable Social Value organization will be used to identify and monitor native species at Neblinas Park and other conservation areas managed by Ecofuturo. Further initiatives include the development of i
EMVCo Working on How Global Specifications Can Support Agentic Payments20.11.2025 18:38:00 EET | Press release
EMVCo – the technical body that creates and manages EMV® Specifications and programmes that enable seamless and secure card-based payments worldwide – has announced that it is working on how global specifications can support innovation in agentic payment solutions by increasing trust and interoperability across the ecosystem. Agentic commerce is rapidly reshaping the shopping experience by enabling AI agents to act on the consumer’s behalf. This is driving significant momentum for agentic payments, which introduce AI agents as new actors that can initiate transactions with merchants – without requiring direct involvement from the consumer. While agentic payments have the potential to increase convenience and personalisation, they present unique considerations for how transactions are initiated, authenticated and secured. As industry adoption and innovation accelerate, a globally interoperable and scalable approach may be beneficial in realising trusted agentic payments for consumers, m
Andersen Consulting Adds Collaborating Firm Cloud2320.11.2025 16:30:00 EET | Press release
Andersen Consulting enters a Collaboration Agreement with Cloud23, a next-generation consulting firm integrating data and artificial intelligence to drive digital transformation. Located in South Africa, Cloud23 delivers intelligent, platform-based solutions to clients across sectors such as finance, telecom, healthcare, and manufacturing. The firm’s offerings span Salesforce consulting and implementation, managed services, and AI strategy, empowering organizations to modernize customer engagement, optimize operations, and drive measurable outcomes. “Our goal at Cloud23 has always been to simplify transformation through smart, scalable design,” said Ram Ramakrishnan, founder and CEO of Cloud23. “We focus on aligning technology with purpose, delivering outcomes that support long-term growth, customer value, and innovation. Collaborating with Andersen Consulting allows us to amplify our mission and extend the impact of our work across a global platform.” “Cloud23 has achieved impressive
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
