Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Original “Titanic Cherub” From James Cameron’s Epic Film Heads to Auction December 9 & 1024.11.2025 22:48:00 EET | Press release
One of the most recognizable and beloved set pieces from James Cameron’s Titanic heads to auction on December 9 &10 —the original Grand Staircase Cherub, seen in multiple scenes of the 1997 blockbuster, including the pivotal moment when Jack and Rose meet in front of the First Class Dining Room and the climactic moment when the Atlantic Ocean bursts through the skylight and floods the staircase, and cherub. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251124056883/en/ The iconic “cherub” with Leonardo DiCaprio & Kate Winslet in James Cameron’s “TITANIC”. The ornate fixture—crafted for the full-scale recreation of Titanic’s Grand Staircase—was gifted by the production to Martin Biallas, CEO of SEE Global Entertainment, whose immersive exhibitions have brought the world’s most famous ship to millions of fans. It now resurfaces as a rare offering in Heritage Auctions’ Entertainment & Music Memorabilia Signature Auction (Sale
Access Advance Announces Major Growth in Its HEVC and VVC Patent Pools24.11.2025 17:10:00 EET | Press release
Access Advance LLC today announced significant expansions of both its HEVC Advance and VVC Advance Patent Pools during the second and third quarters of 2025, underscoring continued industry confidence in the company's balanced and transparent approach to video codec licensing. This growth follows the successful January 2025 launch of Access Advance's Video Distribution Patent ("VDP") Pool, demonstrating the company's expanding role in comprehensive video codec patent licensing solutions. Among the many highlights, Sharp Corporation joined the HEVC Advance Patent Pool as a Licensor, bringing valuable intellectual property assets to the pool's already extensive patent portfolio. Additionally, Huawei Technologies Co., Ltd., already an HEVC Advance Licensor and Licensee, expanded its collaboration with Access Advance by joining the VVC Advance Patent Pool as a Licensee. HP Inc. also expanded its license to include the VVC Advance Patent Pool after previously joining HEVC Advance in 2024, w
Andersen Global Strengthens Platform in Turkey with Addition of Member Firm24.11.2025 16:30:00 EET | Press release
Andersen Global enhances its presence in Turkey as Celen Corporate Property Valuation & Counseling Inc. becomes Andersen in Turkey, adding breadth to the capabilities provided under the Andersen brand in the country. Founded in 1995 and led by Managing Partner Guniz Celen, the Istanbul-based firm delivers a broad spectrum of services for domestic and international clients. With expertise in real estate corporate finance, tangible and intangible asset valuation, and asset management, Andersen in Turkey delivers solutions that support complex corporate finance decisions to clients in more than 18 countries. “Our mission has always been to provide solutions to the most complex challenges in the real estate and investment sectors,” said Guniz. “Joining the Andersen brand strengthens our capabilities as a trusted advisor and gives us access to global resources, enabling us to create even greater long-term value for our clients.” Global Chairman and CEO of Andersen Mark L. Vorsatz added, “Ce
Microsize and Schedio Group to Acquire Lonza’s Micro-Macinazione Site in Switzerland24.11.2025 16:05:00 EET | Press release
Microsize, a leading CDMO specializing in particle size reduction and control technologies, today announced it has signed an agreement to acquire Micro-Macinazione (Mic Mac), a dedicated micronization facility in Monteggio, Switzerland, from Lonza. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251124545344/en/ The agreement represents Microsize’s second acquisition from Lonza, following the successful 2022 divestment of its Quakertown, Pennsylvania site. In this transaction, Schedio Group – a Swiss-based provider of jet mills, isolators, spray dryers, and engineering services – is investing alongside Microsize to strengthen and localize its operational base in Europe, advancing a shared vision to lead the next generation of integrated particle engineering solutions. With more than 30 years of experience, Mic Mac has served the pharmaceutical industry with proven GMP-compliant jet milling and micronization capabilities for b
Hytera to Debut S1 E at PMRExpo 202524.11.2025 15:17:00 EET | Press release
Hytera, a leading global provider of critical communications technologies and solutions, today introduced the S1 E, a business-ready, palm-sized two-way radio designed specifically for the retail sector, expanding the portfolio of S Series and providing one more option for retail users to choose for their daily operations. The S1 E will make its debut at PMRExpo, the Europe's premier trade fair for secure, mission- and business-critical communication, taking place from November 25th to 27th, 2025, at Koelnmesse in Cologne, Germany. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251124768341/en/ Hytera New Released Licence-free Analogue Business Radio S1 E Adhering to the S Series’ signature design language, the S1 E combines a stylish, modern, and minimalist aesthetic with practical functionality. Weighing under 85g, the S1 E provides all-day wearing comfort without tugging or weighing down uniforms. Key enhancements and sta
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
