Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Onera Announces Integration of the Onera hPSG® Solution With Somnoware15.6.2026 20:35:00 EEST | Press release
Onera Health, a leader in transforming sleep medicine, announces that its end-to-end home polysomnography solution, the Onera hPSG® solution, now integrates with Somnoware by ResMed sleep lab management software. This integration enables clinicians to conduct Polysomnography tests (PSGs) where patients sleep most comfortably, in their own home, while managing the entire workflow in Somnoware. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615106079/en/ Onera hPSG®, an end-to-end home polysomnography solution from Onera Health, is now integrated into Somnoware, enabling their shared customers to conduct Polysomnography tests (PSGs) in the patient's home while managing the entire workflow in Somnoware. “The integration with Somnoware is a welcomed enhancement that broadens access to the Onera hPSG® solution,” states Ruben de Francisco, Founder and CEO of Onera Health. “Many sleep centers are customers of both Onera and Somn
Digital Cooperation Organization Launches Global Expert Community to Accelerate International Digital Cooperation15.6.2026 19:18:00 EEST | Press release
The Digital Cooperation Organization (DCO), the world's first standalone international organization dedicated to inclusive and sustainable digital economy growth, today announced the launch of the Global Expert Community (GEC) — a new platform designed to mobilize expertise and advance international collaboration in support of high-impact digital initiatives across DCO Member States and beyond. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615565781/en/ Digital Cooperation Organization Launches Global Expert Community to Accelerate International Digital Cooperation (Graphic: AETOSWire) The GEC reflects the DCO's continued commitment to turning digital cooperation into action by expanding access to specialized expertise and strengthening collaboration across sectors and borders. As digital transformation reshapes economies and societies worldwide, the Community is designed to convert global perspectives and practical expe
New Pivotal Study Data Show Takeda’s Oveporexton Improved Daily Function, Cognition and Nighttime Sleep for People with Narcolepsy Type 115.6.2026 19:00:00 EEST | Press release
Takeda (TSE:4502/NYSE:TAK)today presented additional results from two pivotal studies at SLEEP 2026, showing oveporexton (TAK-861), an oral orexin receptor 2 (OX2R)-selective agonist, improved daily functioning as well as cognitive and sleep-related symptoms associated with narcolepsy type 1 (NT1).1,2,3 Oveporexton is designed to address the underlying orexin deficiency that causes NT1 by restoring orexin signaling. These data, along with previously disclosed Phase 3 results, demonstrated improvement across the broad disease spectrum, supporting the potential of oveporexton to redefine the standard of care for NT1.4 "Narcolepsy type 1 is a 24-hour disease driven by orexin deficiency, and while excessive daytime sleepiness and cataplexy are the most recognized symptoms, many people experience additional bothersome symptoms such as cognitive difficulties and disrupted nighttime sleep," said Emmanuel Mignot, M.D., Ph.D., principal investigator for the FirstLight (TAK-861-3001) Phase 3 stu
Boomi Named a Pioneer in June 2026 Gartner® Emerging Market Quadrant for No-Code Agent Builders15.6.2026 18:30:00 EEST | Press release
Boomi, the data activation company for AI, today announced it has been recognized as aPioneer in the Gartner® Emerging Market Quadrant for No-Code Agent Builders (NCAB). Gartner defines the NCABs market as SaaS-delivered products that offer an integrated design and runtime environment to build, publish and manage AI-powered agents without using coding. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615413216/en/ Boomi Named a Pioneer in June 2026 Gartner® Emerging Market Quadrant for No-Code Agent Builders Boomi sees this recognition as a reflection of the company's rapid evolution from an integration and automation powerhouse to a full-scale agentic infrastructure platform, expanding Boomi’s role in the emerging agentic AI market. A New Chapter in Enterprise Agentic AI According to Gartner, vendors recognized as a Pioneer in this quadrant have established themselves in enterprises for integration and automation workloads
Actiphy Inc. Unveils Actiphy ImageReplicator™15.6.2026 18:00:00 EEST | Press release
Actiphy Inc., a leading provider of backup, disaster recovery, and virtualization software, today announced the release of Actiphy ImageReplicator, a dedicated replication solution for ActiveImage Protector backup images. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615886945/en/ Actiphy ImageReplicator dashboard displaying centralized replication management, job status, replication history, and retention monitoring across protected backup images. As ransomware attacks, cyber threats, and infrastructure failures continue to grow in frequency and sophistication, organizations need reliable ways to protect backup data from loss, corruption, and unauthorized access. When primary systems are compromised, backup data becomes the final line of defense for maintaining business continuity and ensuring rapid recovery. Organizations increasingly rely on 3-2-1 backup strategies that include offsite and immutable copies of critical
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
