Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Hydnum Steel Secures 500 MW of Electrical Power, a Key Step Forward in the Construction of Its Clean Steel Plant in Spain27.2.2026 01:00:00 EET | Press release
Hydnum Steel has taken a decisive step towards constructing Spain’s first clean steel plant after being granted access to the electricity grid at the Brazatortas node in the province of Ciudad Real. The company has been granted an electricity capacity of 500 MW, as published in the Official State Gazette, which should be enough to guarantee supply to its electric arc furnace. This concession marks a significant milestone for a pioneering project in the Iberian Peninsula. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260225309290/en/ Image recreating the steel plant that Hydnum Steel will build in Puertollano, Spain Hydnum Steel is consolidating its position as a reliable supplier of clean European steel. The fully digitally integrated plant will produce hot-rolled steel coils efficiently and sustainably, with benefits for the environment and the economy. Hydnum Steel will deliver a solution for steel-consuming industries th
1089 Inc. Partners with Price Forbes and Oka-Lloyd's Syndicate 1922 to Launch Market-Defining First: Carbon Asset Insurance Framework for Transportation and Energy Sectors27.2.2026 00:01:00 EET | Press release
1089 Inc., in collaboration with Price Forbes and Oka, The Carbon Insurance Company, announce launch of an insured carbon asset designed to bring institutional safeguards, disciplined financial architecture, and verifiable data integrity to carbon markets. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260226012782/en/ 1089 Inc. is Advancing Carbon™, focused on evolving global carbon markets and decarbonizing the highest-emitting sectors on the planet: Transportation and Energy. The framework provides defined risk coverage for 1089’s CX89 Advanced Fuels Carbon Assets, underwritten by Lloyd’s Syndicate 1922 and placed with the support of Price Forbes and Oka. The program introduces institutional-grade protection designed to prevent performance losses resulting from credit degradation between wrapping and minting. Luke Hanley, Founder and CEO of 1089 Inc., shared the framework’s core thesis: “The future of carbon markets is no
Sun Nuclear QADS 2026 Event Combines Clinical Insight and New Innovations for Advancing Radiation Therapy QA27.2.2026 00:00:00 EET | Press release
Sun Nuclear, a Mirion Medical company, today opened the QA & Dosimetry Symposium (QADS), taking place over the next two days in Rome, Italy. The 15th installment brings together more than 230 clinical physicists and radiation medicine professionals from nearly 40 countries. Participants from diverse clinical environments will earn continuing education credits while sharing practical insights, emerging approaches, and real-world experiences shaping the future of quality and patient safety in cancer care. Building on its legacy as a peer-driven forum with practical applications, QADS 2026 features sessions delivered by 24 speakers spanning: Future Directions in Machine and Patient QA and In-vivo Dosimetry Stereotactic Radiosurgery (SRS) and Stereotactic Body Radiation Therapy (SBRT) QA Safety and Guideline-driven Tool Selection Emerging Technologies, including Theranostics, AI and Automation Reinforcing its role as a launch point for clinically grounded innovation, QADS 2026 will serve a
Lattice to Highlight Low Power, Edge-Ready Programmable Solutions at embedded world 202626.2.2026 23:00:00 EET | Press release
Lattice Semiconductor (NASDAQ: LSCC), the low power programmable leader, today announced its exhibition plan for embedded world 2026, where it will demonstrate how its low power, small form factor FPGA innovations help engineers accelerate intelligent, scalable designs from cloud to sensor. At the event, Lattice will participate in expert panel discussions, lead conference sessions, and host an interactive booth experience filled with real-world solutions for the Automotive, Industrial, and Security markets from Lattice and its innovation partners. Who: Lattice Semiconductor What / When: Lattice Booth and Demo Showcase: March 10 – 12, Hall 4, Booth #528 Expert Panel featuring Lattice Chief Strategy and Marketing Officer Esam Elashmawi March 10 at 1:30 p.m. GMT+1, Hall 3, Booth #611 Conference Sessions March 10 at 5 p.m. GMT+1 Safety & Security: “Trusted Resilience Edge – Unified FPGA-TPM for Post-Quantum Cryptography RED & Cyber Resilience Act” March 11 at 10:30 a.m. GMT+1 IoT & Connec
Loomis Sayles Euro Credit Team Celebrates Five-Year Milestones26.2.2026 18:00:00 EET | Press release
Loomis, Sayles & Company, the century-old investment manager with €363.8 billion in assets under management, celebrates the five-year anniversaries of its Loomis Sayles Euro Credit and Loomis Sayles Euro High Yield strategies. The Loomis Sayles Euro Credit Team, led by Co-Heads and Portfolio Managers Rik den Hartog and Pim van Mourik Broekman, manages €3.5 billion in assets across three strategies. Backed by Loomis Sayles’ industry-leading technology infrastructure and focused investment culture, the Euro Credit team seeks to generate consistent excess return versus the benchmark. The team believes this can be accomplished by using an active, conservative alpha investment process that aims to capitalize on inefficiencies in the euro credit market. Loomis Sayles Euro Credit invests primarily in investment grade, euro-denominated corporate bonds while Loomis Sayles Euro High Yield invests primarily in the BB segment of the euro-denominated high yield corporate bond market. The team’s Loo
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
