Business Wire

Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model

11.4.2022 14:16:00 EEST | Business Wire | Press release

Share

Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/

To view this piece of content from mms.businesswire.com, please give your consent at the top of this page.

Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)

TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.

Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”

Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”

Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”

To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.

Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.

Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.

The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.

Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.

About Technology Innovation Institute (TII)

For more information, visit www.tii.ae

*Source: AETOSWire

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

ICE Brent and ICE WTI Perpetual Futures to Launch on OKX22.5.2026 15:30:00 EEST | Press release

OKX, a blockchain technology and trading company serving more than 120 million customers globally,and Intercontinental Exchange (NYSE: ICE), one of the world's leading providers of financial market technology and data powering global capital markets including the New York Stock Exchange, today announced plans for OKX to launch perpetual futures based on ICE's Brent Crude and WTI Crude energy benchmarks. The products are expected to be available to trade on OKX’s platform in jurisdictions where OKX is licensed to offer perpetual futures products. The new OKX contracts represent a major step forward in expanding regulated access to global commodity markets through digital asset infrastructure. This first product collaboration between OKX and ICE comes after the companies established a strategic relationship in March 2026. ICE operates some of the world’s leading exchanges, clearing houses and market data services across energy, commodities, fixed income and equities markets. ICE’s future

Enhertu ® Recommended for Approval in the EU by CHMP for Patients with Previously Treated HER2 Positive Metastatic Solid Tumors22.5.2026 15:00:00 EEST | Press release

Enhertu® (trastuzumab deruxtecan) has been recommended for approval in the European Union (EU) as a monotherapy for the treatment of adult patients with unresectable or metastatic HER2 positive (immunohistochemistry [IHC] 3+) solid tumors who have received prior treatment and who have no satisfactory treatment options. Enhertu is a specifically engineered HER2 directed DXd antibody drug conjugate (ADC) discovered by Daiichi Sankyo (TSE: 4568) and being jointly developed and commercialized by Daiichi Sankyo and AstraZeneca (LSE/STO/NYSE: AZN). The Committee for Medicinal Products for Human Use (CHMP) of the European Medicines Agency (EMA) based its positive opinion on results from patients with HER2 positive (IHC 3+) tumors in three phase 2 trials including DESTINY-PanTumor02,DESTINY-Lung01 andDESTINY-CRC02 where Enhertu demonstrated clinically meaningful responses across a broad range of tumors. The recommendation will now be reviewed by the European Commission, which has the authority

Future Health Challenge Awards USD 300,000 to Early Detection and Population Health Sensing Tools on Sidelines of World Health Assembly22.5.2026 14:45:00 EEST | Press release

Three global teams developing early detection and real-time population health monitoring solutions have secured a total of USD 300,000 on the sidelines of the 79th World Health Assembly. The winning solutions address critical challenges in early detection, continuous population insight and more timely decision making, signalling a shift in health systems from late-stage treatment to earlier intervention. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260522587414/en/ Selected from 393 submissions across 68 countries, the winning teams were recognised through the inaugural ‘Future Health Challenge 2026: Building Anticipatory Health Systems through Population Sensing’, delivered by Future Health – A Global Initiative by Abu Dhabi in collaboration with MIT Solve. Health systems globally are facing rising costs and persistent delays in diagnosis, with many conditions still identified only after symptoms become severe. At the sam

Pivotal Trial Data for EP0031 (A400), a Next-Generation Selective RET Inhibitor (SRI), in RET Positive Advanced NSCLC, to be Presented at ASCO 202622.5.2026 12:18:00 EEST | Press release

Ellipses Pharma (“Ellipses”), a global oncology drug development company with a pipeline of innovative programmes, announced today that its partner, Kelun-Biotech, is presenting pivotal trial data for EP0031/A400, for the potential treatment of RET-fusion positive Non-Small Cell Lung Cancer (NSCLC), at the 2026 American Society of Clinical Oncology (ASCO) Annual Meeting Chicago, May 29 to June 2. Efficacy and safety of lunbotinib (A400/EP0031), a next-generation selective RET inhibitor (SRI), from a pivotal phase Ⅱ study in patients with advanced RET-fusion positive non-small cell lung cancer (NSCLC), will be presented as an oral presentation scheduled on May 29, 2026, 14:36-14:48 local time (Abstract #8505: Lung Cancer – Non-Small Cell Metastatic). The oral presentation of these data at the prestigious ASCO annual meeting, represents another major milestone in the global development of EP0031/A400 as a next generation SRI. The data were generated in Kelun-Biotech’s Phase 2 study (NCT0

FPT Launches Flezi Foundry™, Advancing AI-Augmented Delivery for Global Enterprises22.5.2026 11:11:00 EEST | Press release

Global IT corporation FPT announced the launch of Flezi Foundry™ (FPT Digital Foundry™), an AI-augmented delivery platform for software development and IT operations. Built around a governed Service-as-a-Software model, the platform combines autonomous AI agents, human expert oversight, secure infrastructure, and outcome-based delivery mechanisms to help enterprises modernize technology delivery as AI agents become part of software engineering and IT operations. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260521235556/en/ Flezi Foundry applies Agentic Engineering, a structured delivery approach for software development and IT operations. The model brings AI agents into delivery workflows with human supervision, governance, transparency, and performance measurement built into the process. Flezi Foundry operates through two service modes: Agentic Development Lifecycle (ADLC) supports software development by using specialize

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye