UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models
Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/
Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)
Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.
Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.
Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.
With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.
TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230529005045/en/
Contact information
Jennifer Dewan
        Senior Director of Communications
        jennifer.dewan@tii.ae
      
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Norton Motorcycles Launches Bold New Product and Brand Resurgence at EICMA 20254.11.2025 15:38:00 EET | Press release
Norton Motorcycles has unveiled its Resurgence strategy at EICMA, a long-term approach underpinned by the substantial investment by TVS Motor Company (TVS) since it acquired Norton Motorcycles in April 2020. Five years in the making, this rebirth is based on a cascade of innovative new models defined by Norton’s core attributes – a combination of Design, Dynamics and Detail that will create the world’s most desirable motorcycles. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251104808619/en/ L to R: Sharad Mohan Mishra, President, Group Strategy, TVSM; Sudarshan Venu, Chairman, TVSM; Richard Arnold, Executive Director, Norton Motorcycles; Nevijo Mance, Executive Director, Norton Motorcycles Benefiting from investments that have exceeded £200 million into a flagship manufacturing base and company headquarters in Solihull, Norton’s rebirth within the TVS family is a measured and strategic reconstruction of a brand. This Resur
Corpay and National Hockey League Announce Multiyear North American Partnership4.11.2025 15:30:00 EET | Press release
Corpay, Inc.* (NYSE: CPAY), a global leader in corporate payments and the National Hockey League (NHL®) today announced a multiyear North American partnership, naming Corpay’s Cross-Border business the Official Foreign Exchange (FX) Provider of the NHL. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251104740136/en/ Through this partnership, the NHL will be able to use Corpay Cross-Border’s innovative solutions to help mitigate foreign exchange exposure in their daily operations. The NHL will also benefit from Corpay’s award-winning platform, which allows for seamless global payment management from a single point of access. As part of the partnership, Corpay will have brand presence on the NHL’s digitally enhanced dasherboards and with in-stadium signage at NHL Winter Classic®, NHL Stadium Series™, and NHL Heritage Classic™, in addition to other activation opportunities at NHL tentpole events in future seasons. Corpay and th
Esri Supports IUCN Global Initiative for Nature-Based Education4.11.2025 15:00:00 EET | Press release
At the 2025 IUCN World Conservation Congress, Esri, the global leader in geographic information system (GIS) technology, announced a joint initiative with the International Union for Conservation of Nature (IUCN) and its Commission on Education and Communication to expand access to data, maps, and GIS technology in support of Nature-based Education (NbE) worldwide. Esri is making a $10 million in-kind donation to enable the development of a new NbE geoportal that will allow students, teachers, and IUCN conservationists around the world to understand and explore our planet through maps and the science of geography, collaborate on critical conservation solutions, and develop critical technical skills for future careers in conservation. This new initiative, now in the pilot phase of consultation and design, will support IUCN’s NbE aspirations to promote environmental learning and action as adopted by the Members during the IUCN World Conservation Congress in Abu Dhabi this October. This c
Visa and Transcard Collaborate to Deliver Advanced Embedded Working Capital Solutions in Freight & Logistics4.11.2025 15:00:00 EET | Press release
Visa and Transcard today announced the launch of a next-generation embedded finance platform, designed to power payments and working capital solutions in the freight and logistics industry. As a part of their joint effort, Visa and Transcard are also collaborating on agentic initiatives for the B2B segment. This alliance puts embedded credit and working capital solutions in the hands of freight forwarders and airline carriers on WebCargo by Freightos (Nasdaq: CRGO), a leading digital booking and payments platform in the freight industry. By combining Visa’s global expertise in commercial solutions with Transcard’s leading-edge payment orchestration technology, WebCargo users can now gain access to flexible credit terms, seamless onboarding and automated reconciliation for air cargo transactions. The solution, built on Visa’s virtual card infrastructure, unlocks the cash flow bottlenecks between freight forwarders and airlines, and elevates the digital freight booking and payment experi
Agendia to Present Five Posters Demonstrating the Expanding Clinical Utility of MammaPrint + BluePrint at the 2025 San Antonio Breast Cancer Symposium4.11.2025 14:00:00 EET | Press release
Agendia®, Inc., a leader in precision oncology for breast cancer, today announced it will present new results from the ongoing real-world FLEX Study (NCT03053193) at the 2025 San Antonio Breast Cancer Symposium (SABCS), taking place December 9-12 in San Antonio, Texas. The company will present five posters, led by Agendia investigators and independent academic collaborators, that collectively highlight the broad clinical impact of MammaPrint® + BluePrint® genomic profiling in optimizing treatment decisions and improving outcomes for patients with hormone receptor positive, HER2-negative (HR+/HER2–) early breast cancer (EBC). “We look forward to sharing these new findings from the FLEX Study, which spans more than 20,000 participants across 100 global sites, making it the largest and most diverse real-world evidence cohort for early-stage breast cancer,” said William Audeh, M.D., Chief Medical Officer at Agendia. “These results underscore our commitment to generating robust clinical evi
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
