Business Wire

Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model

11.4.2022 14:16:00 EEST | Business Wire | Press release

Share

Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/

To view this piece of content from mms.businesswire.com, please give your consent at the top of this page.

Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)

TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.

Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”

Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”

Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”

To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.

Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.

Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.

The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.

Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.

About Technology Innovation Institute (TII)

For more information, visit www.tii.ae

*Source: AETOSWire

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Digital Cooperation Organization Launches Global Expert Community to Accelerate International Digital Cooperation15.6.2026 19:18:00 EEST | Press release

The Digital Cooperation Organization (DCO), the world's first standalone international organization dedicated to inclusive and sustainable digital economy growth, today announced the launch of the Global Expert Community (GEC) — a new platform designed to mobilize expertise and advance international collaboration in support of high-impact digital initiatives across DCO Member States and beyond. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615565781/en/ Digital Cooperation Organization Launches Global Expert Community to Accelerate International Digital Cooperation (Graphic: AETOSWire) The GEC reflects the DCO's continued commitment to turning digital cooperation into action by expanding access to specialized expertise and strengthening collaboration across sectors and borders. As digital transformation reshapes economies and societies worldwide, the Community is designed to convert global perspectives and practical expe

New Pivotal Study Data Show Takeda’s Oveporexton Improved Daily Function, Cognition and Nighttime Sleep for People with Narcolepsy Type 115.6.2026 19:00:00 EEST | Press release

Takeda (TSE:4502/NYSE:TAK)today presented additional results from two pivotal studies at SLEEP 2026, showing oveporexton (TAK-861), an oral orexin receptor 2 (OX2R)-selective agonist, improved daily functioning as well as cognitive and sleep-related symptoms associated with narcolepsy type 1 (NT1).1,2,3 Oveporexton is designed to address the underlying orexin deficiency that causes NT1 by restoring orexin signaling. These data, along with previously disclosed Phase 3 results, demonstrated improvement across the broad disease spectrum, supporting the potential of oveporexton to redefine the standard of care for NT1.4 "Narcolepsy type 1 is a 24-hour disease driven by orexin deficiency, and while excessive daytime sleepiness and cataplexy are the most recognized symptoms, many people experience additional bothersome symptoms such as cognitive difficulties and disrupted nighttime sleep," said Emmanuel Mignot, M.D., Ph.D., principal investigator for the FirstLight (TAK-861-3001) Phase 3 stu

Boomi Named a Pioneer in June 2026 Gartner® Emerging Market Quadrant for No-Code Agent Builders15.6.2026 18:30:00 EEST | Press release

Boomi, the data activation company for AI, today announced it has been recognized as aPioneer in the Gartner® Emerging Market Quadrant for No-Code Agent Builders (NCAB). Gartner defines the NCABs market as SaaS-delivered products that offer an integrated design and runtime environment to build, publish and manage AI-powered agents without using coding. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615413216/en/ Boomi Named a Pioneer in June 2026 Gartner® Emerging Market Quadrant for No-Code Agent Builders Boomi sees this recognition as a reflection of the company's rapid evolution from an integration and automation powerhouse to a full-scale agentic infrastructure platform, expanding Boomi’s role in the emerging agentic AI market. A New Chapter in Enterprise Agentic AI According to Gartner, vendors recognized as a Pioneer in this quadrant have established themselves in enterprises for integration and automation workloads

Actiphy Inc. Unveils Actiphy ImageReplicator™15.6.2026 18:00:00 EEST | Press release

Actiphy Inc., a leading provider of backup, disaster recovery, and virtualization software, today announced the release of Actiphy ImageReplicator, a dedicated replication solution for ActiveImage Protector backup images. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615886945/en/ Actiphy ImageReplicator dashboard displaying centralized replication management, job status, replication history, and retention monitoring across protected backup images. As ransomware attacks, cyber threats, and infrastructure failures continue to grow in frequency and sophistication, organizations need reliable ways to protect backup data from loss, corruption, and unauthorized access. When primary systems are compromised, backup data becomes the final line of defense for maintaining business continuity and ensuring rapid recovery. Organizations increasingly rely on 3-2-1 backup strategies that include offsite and immutable copies of critical

Energy Dome and SRP to Add Long-Duration Energy Storage Project to the Grid, Expand Google Collaboration15.6.2026 16:30:00 EEST | Press release

Energy Dome, a leading provider of innovative capacity solutions for utilities and AI infrastructure, and Salt River Project (SRP), a not-for-profit public power utility serving the greater Phoenix metropolitan area, today announced an agreement to add a 19 megawatt (MW), 10-hour carbon dioxide-based (CO2) battery system to the grid. The project is planned to be co-located on the site of SRP’s Coronado Generating Station (CGS) in St. Johns, Arizona, and it will be developed under a 20-year tolling agreement, with Energy Dome owning and operating the facility and SRP dispatching its output. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615027901/en/ Image: Rendering of Energy Dome’s energy storage system located at the Coronado Generating Station site The project is also part of Google and SRP’s innovative collaboration to accelerate deployment of non-lithium-ion long-duration energy storage (LDES) technologies that suppo

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye