Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Askey and Canoga Perkins Announce Strategic Collaboration at MWC Barcelona to Deliver Rapid-Deploy 5G Critical Communications Solutions27.3.2026 13:00:00 EET | Press release
Askeyand Canoga Perkinsannounced at Mobile World Congress Barcelona a Global Partnership to Deliver SyncMetra® Network Connectivity Solution, combining Canoga Perkins’ software-defined, IT-operated private 5G network transport along with Askey’s carrier-grade 5G radio access technology. At MWC Barcelona 2026, Askey Computer Corporation and Canoga Perkins announced a strategic partnership to deploy Canoga Perkins' SyncMetra® Platform across enterprise and service provider markets with Askey. This partnership pairs Askey’s carrier-grade radio access capabilities with Canoga Perkins’ industry-leading time-sensitive networking (TSN) and synchronization technology, enabling customers to simplify deployment of ultra-low-latency, highly reliable network services for 5G, edge compute, industrial automation, and mission-critical enterprise applications. The partnership enables joint go-to-market efforts, integrated product offerings, and expanded access to SyncMetra through Askey’s sales channe
SBC Medical Announces Fourth Quarter and Full Year 2025 Financial Results27.3.2026 12:40:00 EET | Press release
SBC Medical Group Holdings Incorporated (Nasdaq: SBC) (“SBC Medical” or the “Company”), a Management Services Organization operating a wide range of franchise businesses across diverse medical fields, today announced its financial results for the fourth quarter of fiscal year 2025 (three months ended December 31, 2025) and for the full fiscal year 2025 (twelve months ended December 31, 2025). Fourth Quarter 2025 Highlights Total revenues were $40 million, representing an 11% year-over-year decrease. Net Income attributable to SBC Medical Group was $14 million, representing a 117% year-over-year increase. Earnings per share, which is defined as net income attributable to the Company divided by the weighted average number of outstanding shares, was $0.14 for the three months ended December 31, 2025, representing 133% year-over-year increase. EBITDA1, which is calculated by adding depreciation and amortization expense and impairment loss on intangible assets to income from operations was
NIQ Redefines Packaging Intelligence with Monthly, Harmonized Global Performance Visibility27.3.2026 12:00:00 EET | Press release
NielsenIQ (NYSE: NIQ), a global leader in consumer intelligence, today announced the launch of its Packaging Strategic Planner Global (SPG) Solution, the first harmonized global platform to deliver monthly visibility into packaging performance across materials, formats, and pack configurations. As packaging innovation accelerates, many organizations continue to rely on fragmented or annual data to inform packaging decisions. The Packaging SPG Solution closes this gap by providing real-time data delivered monthly across regions, enabling brands and packaging partners to uncover trends, grow revenue, and strengthen relationships with CPG and retail partners. Key Highlights: New monthly global packaging tracking capability Coverage across 200+ categories Visibility into 30+ package types and 20 package materials 10+ markets at launch, expanding to 30 by the end of 2026 Introduction of NIQ’s exclusive EQ2 metric, multiplying units by number in pack to reflect true consumption “The pace of
European DataWarehouse Claims Its First “Fintech Provider of the Year” Award and a Sixth “Data Provider of the Year” Title at GlobalCapital’s 2026 European Securitisation Awards27.3.2026 09:47:00 EET | Press release
European DataWarehouse (EDW) is pleased to announce that it has been named both “Data Provider of the Year” and “Fintech Provider of the Year” at the 2026 GlobalCapital European Securitization Awards in London. The award ceremony recognises outstanding achievements in European structured finance, with winners selected by popular vote from across the industry. As defined by GlobalCapital, the programme celebrates “the very best in the market, as chosen by the market.” This latest recognition marks the sixth time that EDW has received the prestigious Data Provider of the Year award, having previously been honoured in 2019, 2022, 2023, 2024 and 2025, reaffirming its long-standing commitment to transparency, data quality and innovation in European securitisation. Prof. José Manuel González-Páramo, Chairman of EDW, later commented: “ Winning this award for the sixth time highlights the continued trust the European securitisation market places in EDW. Transparency, data quality and reliabili
Biocytogen Announces FDA IND Clearance for Partner NEOK Bio’s NEOK002 Targeting Solid Tumors27.3.2026 02:00:00 EET | Press release
Biocytogen Pharmaceuticals (Beijing) Co., Ltd. (Biocytogen, SSE: 688796; HKEX: 02315), a global biotechnology company that drives the research and development of novel antibody-based drugs with innovative technologies, today announced that its partner NEOK Bio, Inc. recently received clearance from the U.S. Food and Drug Administration (FDA) of an investigational new drug (IND) application for NEOK002, an EGFR/MUC1-targeting ADC program for solid tumors. NEOK Bio plans to initiate a Phase 1 clinical study in the second quarter of 2026 and expects to report initial data in 2027. This IND clearance marks an important milestone for NEOK002, an EGFR/MUC1-targeting ADC candidate developed by NEOK Bio and built on a bispecific antibody originally developed by Biocytogen and licensed in 2024. According to NEOK Bio, NEOK002 is being advanced for solid tumors and may offer differentiated efficacy and safety compared with monospecific ADC approaches directed at either target alone. Dr. Yuelei Sh
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
