Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Xsolla Brings Direct-to-Consumer Commerce Expertise to Gamesforum Hamburg 2026 With Keynote and Panel Appearances9.6.2026 15:01:00 EEST | Press release
Xsolla, a global leader in video game commerce, today announced its participation in Gamesforum Hamburg 2026, taking place June 9–10, 2026, at NORD EVENT Panoramadeck in Hamburg, Germany. As an Exclusive Global Partner of the Gamesforum series, Xsolla will bring its direct-to-consumer commerce expertise to one of Europe's leading gatherings for mobile game growth and monetization, with Jane Startseva, Vice President of Business Development, EMEA, taking the stage across two sessions. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260609166526/en/ Graphic: Xsolla Xsolla's presence in Hamburg underscores its role at the center of gaming's shift toward direct-to-consumer commerce: On June 9 at 10:00 AM, Jane joined Adam Smart, Global Director of Product – Gaming at AppsFlyer, for a 30-minute keynote titled "Web Shops and Attribution: Closing the Revenue Gap." As web shops cement their role as a primary monetization channel for
Duetti Expands Funding Offerings for Independent Artists With Two New Programs9.6.2026 15:01:00 EEST | Press release
Duetti Inc. (“Duetti” or the “Company”), the music company that partners with independent artists and songwriters to buy, manage, and market their catalogs, announces today a significant expansion of its funding offering. Artists can now sell their master recording tracks, or parts thereof, in as little as six months after release - substantially earlier than the company’s prior two-year threshold. Duetti is also launching its new Remix Program, which allows independent artists to create remixes, covers, and other derivative works from Duetti’s 30,000-track catalog (across both masters and compositions). In return, artists can receive upfront cash compensation, ongoing royalty % share, 0% distribution fees, and dedicated marketing support. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260609063445/en/ UNLOCKING VALUE, EARLIER THAN EVER Duetti is leveraging its proprietary AI-driven predictive models to materially lower the
The Open Group Releases OSDU® Data Platform Standard, Version 1.09.6.2026 15:00:00 EEST | Press release
The Open Group, the global vendor‑neutral technology consortium, today announced the release of the OSDU® Data Platform Standard, Version 1.0. The new standard establishes a stable, clearly defined baseline for data platform capabilities, supporting greater interoperability, consistency, and confidence across the energy industry. The OSDU Data Platform Standard, Version 1.0 is designed to help organizations better manage, access, and use enterprise data by reducing fragmentation and breaking down data silos. It provides a common framework for organizing and accessing data securely and efficiently, supporting improved collaboration, innovation, and decision‑making. Key benefits of the standard include interoperability by design, increased stability for application development and deployment, and a clear foundation for certification. Operators benefit from greater choice and reduced integration effort, independent software vendors can develop against a defined standard, and platform prov
Clearlake Completes Strategic Acquisition of Pathway Capital Management9.6.2026 15:00:00 EEST | Press release
Clearlake Capital Group, L.P. (“Clearlake” or the “Firm”), a global investment firm managing integrated platforms spanning private equity, liquid and private credit, and other related strategies, today announced the completion of its acquisition of Pathway Capital Management (“Pathway”), a global provider of private market solutions for institutional and wealth markets. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260609839186/en/ The strategic acquisition significantly expands Clearlake’s private markets platform, adding complementary capabilities and reinforcing its position as a leading diversified alternative asset manager. Founded in 1991, Pathway manages more than $95 billion in assets and brings a proven track record across private equity, private credit, infrastructure, secondaries and co-investments through customized private markets programs and multi-investor funds. The combination will bolster Clearlake’s priva
Watchmaker to Unveil New Technologies Addressing NGS Workflow Bottlenecks and Sequencing Accuracy Challenges at ESHG 20269.6.2026 15:00:00 EEST | Press release
Watchmaker Genomics, a leader in high-performance NGS workflow solutions, today announced it will unveil two new technologies at the European Society of Human Genetics (ESHG) 2026 conference, designed to address persistent bottlenecks in sequencing workflow scalability and amplification accuracy. The launches include the EquiPlex™ Normalization Kit, which simplifies library normalization using a novel Cas9-based approach, and Equinox® Prime Library Amplification Master Mix, engineered to improve indel accuracy in repetitive genomic regions while delivering superior amplification performance across sequencing applications. Built on Watchmaker’s expertise in protein engineering and integrated workflow design, the technologies reflect the company’s continued focus on solving real-world sequencing challenges that impact operational efficiency, scalability, and downstream data quality. “Sequencing workflows continue to grow in scale and complexity, but many of the core bottlenecks researche
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
