UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models
29.5.2023 11:52:00 EEST | Business Wire | Press release
Falcon 40B, the UAE’s first large-scale open-source, 40-billion-parameter AI model launched by Abu Dhabi’s Technology Innovation Institute (TII) last week, soared to the top spot on Hugging Face’s latest Open Large Language Model (LLM) Leaderboard. Hugging Face, an American company seeking to democratize artificial intelligence through open-source and open science, is considered the world’s definitive independent verifier of AI models.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230529005045/en/
Falcon 40B ranks 1st globally in Hugging Face Open LLM Leaderboard. (Graphic: AETOSWire)
Falcon 40B managed to beat back established models such as LLaMA from Meta (including its 65B model), StableLM from Stability AI, and RedPajama from Together to achieve the coveted ranking. The index utilizes four key benchmarks from the Eleuther AI Language Model Evaluation Harness, a consolidated framework that assesses generative language models on: the AI2 Reasoning Challenge (25-shot), a set of grade-school science questions; HellaSwag (10-shot), a test of common sense inference, which is easy for humans but challenging for SOTA models; MMLU (5-shot), a test to measure a text model’s multitask accuracy; and TruthfulQA (0-shot), a test to measure whether a language model is truthful in generating answers to questions.
Hugging Face’s Open LLM Leaderboard is an objective evaluation tool open to the AI community that tracks, ranks, and evaluates LLMs and chatbots as they are launched.
Trained on one trillion tokens, Falcon 40B marks a significant turning point for the UAE in its journey towards AI leadership, enabling widespread access to the model's weights for both research and commercial utilization. The new ranking confirms the model’s prowess in making AI more transparent, inclusive, and accessible for the greater good of humanity.
With this latest development, TII has managed to secure the UAE a seat at the table when it comes to generative AI models, allowing it to join an exclusive list of countries that are working to drive AI innovation and collaboration.
TII has already embarked work on its next version of Falcon - the 180B AI model. To learn more about the current open sourced Falcon 40B AI model, please visit: FalconLLM.TII.ae. The initial announcement on Falcon 40B can be found here: UAE's Technology Innovation Institute Launches Open-Source "Falcon 40B" Large Language Model for Research & Commercial Utilization.
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20230529005045/en/
Contact information
Jennifer Dewan
Senior Director of Communications
jennifer.dewan@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Caidya Announces Strategic Combination with Simbec-Orion Bridging Early Scientific Insight and Global Clinical Execution30.6.2026 18:00:00 EEST | Press release
Caidya today announced a strategic combination with Simbec-Orion designed to close the divide between early scientific insight and global clinical execution. The combination of Caidya and Simbec-Orion creates a differentiated specialty clinical CRO platform that enables programs to scale, maintaining focus, speed, and accountability. The strategic combination brings together complementary strengths to create a more complete development partner for innovative biopharma companies. With established operations across Europe, the Americas, APAC, and China, the combined organization provides meaningful expertise and execution capabilities in the regions that matter most. Simbec-Orion brings early-phase clinical pharmacology capabilities alongside deep therapeutic expertise for later stage complex oncology and rare disease trials, helping sponsors shape critical decisions early in the development lifecycle. Together, the organizations strengthen their ability to support complex, cross-border
Archer® Proves Purpose-Built AI Beats General-Purpose LLMs on Regulatory Change Management: 95% Verified Accuracy, 80x Faster, 92% Lower Cost30.6.2026 17:13:00 EEST | Press release
For enterprises deploying AI in compliance, a wrong date is a missed deadline. The more dangerous failure is a wrong answer the model returns with high confidence, one that flows silently into a compliance calendar and is only discovered after the window has passed. Archer® today released results showing purpose-built AI beats a general-purpose large language model (LLM) on regulatory work, and it’s not close. This head-to-head test compared Archer’s purpose-built, vertical-specific AI and proprietary data sets against a leading general-purpose LLM, on a core compliance task: determining the publication, effective and comment-close dates of regulatory documents across six jurisdictions. General-purpose models are a genuine breakthrough, and this is no referendum on their quality. The question Archer set out to answer is narrower and more practical: what it takes to make a specific, high-stakes determination reliable, fast and affordable at scale. A vertical, domain-focused process, gro
Altasciences Supports Key Development Milestone for Steel Therapeutics’ Lead Therapeutic Candidate, Fizurex™30.6.2026 17:08:00 EEST | Press release
Altasciences, a leading drug development organization, today announced a significant milestone in the development of Steel Therapeutics, Inc.’s pivotal toxicology study for its lead product candidate, Fizurex™, for the treatment of anal fissures. The successful completion of the study plays a significant role in the advancement of Fizurex™ toward first-in-human trials. The GLP-compliant study demonstrated a favorable safety profile, which has advanced Steel Therapeutics' plans to submit an Investigational New Drug (IND) application for Fizurex™ to the FDA in Q3 2026. Fizurex™, a patent-pending, single-use topical wipe, was designed to provide a standardized, accessible treatment option for a painful and often undertreated medical condition. The product builds on years of use through compounding pharmacy prescriptions and is now advancing toward clinical development and regulatory review. "We are proud to have supported Steel Therapeutics with the generation of the high-quality safety d
Interactive Brokers Expands Access to Korean Equities with Launch of Nextrade ATS30.6.2026 17:00:00 EEST | Press release
Interactive Brokers (Nasdaq: IBKR), an automated global broker, today announced the launch of select Korean equities through Nextrade, South Korea's first Alternative Trading System (ATS). The addition of Nextrade builds on Interactive Brokers' earlier launch of the Korea Exchange (KRX), through which it became the first major US-based broker to provide global investors with direct access to Korean equities. Clients trading on Nextrade benefit from significantly extended trading hours and access to additional liquidity. Interactive Brokers has enabled IB SmartRouting℠ across both the Korea Exchange (KRX) and Nextrade, automatically routing orders to the venue offering the best price. This helps clients achieve best execution while providing greater flexibility and more opportunities to participate in one of Asia's most dynamic equity markets. Korea's equity market ranks among the top global exchanges by market capitalization and is home to world-leading companies such as Samsung Electr
Andersen Global Adds Depth to Tax and Global Mobility Capabilities in Germany30.6.2026 16:30:00 EEST | Press release
Andersen Global strengthens its presence through a Collaboration Agreement with Lohr and Company (L+C), a senior-led tax advisory platform, providing practical, responsive solutions in tax compliance, cross-border tax, global mobility, and transfer pricing. Headquartered in Germany with a presence in Austria, L+C advises large multinationals and family-owned companies, family offices, foundations, and high-net-worth individuals. The firm, founded in 2001, specializes in areas such as global mobility, M&A, international tax law, country-by-country reporting, and transfer pricing, including Pillar 2 reporting. Additionally, L+C supports clients with trusts and foundations, tax compliance, payroll and financial accounting, and private financial advisory services. “Collaborating with Andersen Global represents an important step in expanding our international capabilities and strengthening the value we provide to clients navigating increasingly complex cross-border matters,” said Jörg-Andre
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
