Business Wire

UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models

29.5.2023 11:52:00 EEST | Business Wire | Press release

Share

Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/

To view this piece of content from mms.businesswire.com, please give your consent at the top of this page.

Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)

Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.

Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.

Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.

With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.

TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.

For more information, visit www.tii.ae

*Source: AETOSWire

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Andersen Consulting Strengthens Digital Transformation Capabilities with Weexa30.4.2026 16:30:00 EEST | Press release

Andersen Consulting enters into a Collaboration Agreement with Weexa, a global provider of digital transformation, B2B integration, and supply chain digitalization solutions. Headquartered in France, Weexa delivers end-to-end services that help organizations streamline, secure, and scale their digital ecosystems. The firm specializes in B2B data-flow management and digitalization, enabling seamless communication between applications both within and across organizations through technologies such as EDI, APIs, and e-invoicing. Weexa also provides SAP integration and supply chain solutions spanning warehouse and transport management, alongside strategic consulting, project delivery, and third-party application maintenance. Serving organizations across the food, retail, wholesale, logistics, transportation, automotive, healthcare, and media sectors, Weexa supports global businesses in optimizing performance while meeting evolving regulatory and digital-compliance requirements. “Collaborati

SINOVAC Files Annual Report on Form 20-F for the Fiscal Year 202430.4.2026 16:22:00 EEST | Press release

Sinovac Biotech Ltd. (Nasdaq: SVA) (“SINOVAC” or the “Company”), a leading provider of biopharmaceutical products in China, today announced that it filed its annual report on Form 20-F for the fiscal year ended December 31, 2024 (the “Annual Report”) with the U.S. Securities and Exchange Commission (“SEC”). An electronic copy of the Annual Report can be accessed on SINOVAC’s investor relations website at https://www.sinovac.com/en-us/Investors and on the SEC’s website at www.sec.gov. About SINOVAC Sinovac Biotech Ltd. (SINOVAC) is a China-based global biopharmaceutical company, with a mission of “supply vaccines to eliminate human diseases”, the company specializes in the research, development, manufacturing and commercialization of vaccines and related biological products that protect against human infectious diseases. The company’s diversified portfolio includes vaccines for influenza, viral hepatitis, varicella, Hand-Foot-Mouth disease (HFMD), poliomyelitis, pneumococcal disease, et

Experian Announces Agent Trust to Power Trusted AI Driven Commerce30.4.2026 16:00:00 EEST | Press release

Experian today announced Experian Agent Trust™, a first-of-its-kind framework that establishes a secure, verifiable link between consumers and AI agents, bringing identity, and accountability to AI-driven transactions. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260430719198/en/ Experian Announces Agent Trust to Power Trusted AI Driven Commerce. As AI agents begin to search and transact autonomously, they introduce a fundamental challenge for businesses: how to trust an action when it is no longer driven by a human. Without a verified connection between humans and AI agents, autonomous commerce introduces new risks in fraud, misrepresentation, and unauthorized transactions. Experian Agent Trust addresses this challenge through a new “Know Your Agent” (KYA) framework, extending identity verification into the age of AI. The framework ensures that agent-initiated transactions are grounded in verified consumer identity. “Agen

Meet the AI-powered fan companion: TGR Haas F1 Team RaceMate powered by Infobip30.4.2026 16:00:00 EEST | Press release

Global AI-first cloud communications platform Infobip and TGR Haas F1 team are launching ‘TGR Haas F1 Team RaceMate powered by Infobip’, an AI-powered conversational fan companion on Apple Messages for Business and WhatsApp Business Platform. Always-on and always available, it delivers race intelligence for TGR Haas F1 Team: team race intelligence, grid positions, qualifying outcomes, sprint results, and full session schedules. The AI agent tracks drivers Ollie Bearman and Esteban Ocon with their individual championship standings and performance data in conversational format. The agent draws from a knowledge base covering driver biographies, team history, and circuit data, adapting to each user's knowledge level. Every interaction begins with a schedule check, ensuring fans always know what's happening right now. It proactively surfaces relevant information, suggests next actions, and maintains conversational flow to keep every interaction effortless and engaging. Delivered in short, s

Merck Announces First Dose in Phase 3 Study with Enpatoran for Lupus Patients with Active Skin Manifestations30.4.2026 15:05:00 EEST | Press release

Merck, a leading global science and technology company, today announced the first patient was dosed in the Phase 3 program, ELOWEN-1 (NCT07332481) and ELOWEN-2 (NCT07355218), evaluating enpatoran in people living with lupus who experience active skin manifestations. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260430733656/en/ David Weinreich, Global Head of R&D and Chief Medical Officer “People living with lupus continue to face significant challenges in achieving disease control and are very often affected by itchy, painful and stigmatized skin manifestations,” said David Weinreich, Global Head of R&D, Merck. “With enpatoran, we aim to target the underlying drivers of lupus and redefine how to approach the disease by understanding both visible skin manifestations and systemic activity.” Enpatoran is an oral selective toll-like receptor (TLR) 7/8 inhibitor designed to modulate pathways central to lupus-related inflammatio

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye