Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Navan Unlocks Savings for Travelers with First SAS NDC Direct Connect28.5.2026 10:01:00 EEST | Press release
Navan (NASDAQ: NAVN), the global AI-powered business travel and expense platform, today announced a New Distribution Capability (NDC) integration with Scandinavian Airlines (SAS). By allowing the airline to share its fares, availability, and offers directly in real time, the integration provides an expanded portfolio of SAS fares and services to Navan customers. This makes Navan the first Travel Management Company (TMC) to access SAS NDC content via a direct connection, leveraging version 21.3 of the NDC API. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260528812727/en/ Direct connection gives travelers access to lower fares and streamlined post-booking services “Our Modern Airline Retailing journey is centered on delivering more relevant offers, greater transparency, and better value for travelers,” said Edward Fotheringham, VP Sales & Distribution at SAS. “By connecting directly with Navan via our NDC channel, we’re expa
Navan Strengthens European Train Offering with Swedish Rail Integration28.5.2026 10:00:00 EEST | Press release
Navan (NASDAQ: NAVN), the global AI-powered business travel and expense platform, today announced the addition of more than 20 Swedish rail carriers to its platform, including Sweden’s largest operators, SJ and VR. Powered by SilverRail's global rail distribution platform, the API integration unlocks access for Navan customers to domestic rail routes in Sweden, as well as many popular cross-border routes in the region, such as between Stockholm and Copenhagen. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260528696890/en/ Through a direct connection with SilverRail, Navan adds more than 20 Swedish rail carriers to its long list of European rail options “We’re seeing strong momentum in Sweden,” says Michael Riegel, Chief Customer Officer at Navan. “It’s a unique economy where you have this intersection of industrial companies, in manufacturing, maritime, and automotive, colliding with a world-class tech and AI scene. Our gro
KKR to Open New Office in Milan, Strengthening Long-Term Commitment to Italy28.5.2026 10:00:00 EEST | Press release
KKR, a leading global investment firm, today announced plans to open an office in Milan, further strengthening its long-term commitment to Italy and expanding its local presence in one of Europe’s largest economies. The office will support the firm’s investment activity across Private Equity, Real Assets, Credit and Insurance, while deepening client partnerships and advancing the continued development of KKR’s private wealth business in Italy. Italy has been an important market for KKR for over two decades, with over €10 billion of capital deployed since 2005 across Private Equity, Real Assets and Credit. The firm’s investments include FiberCop, Europe’s first wholesale-only, open-access fibre network, Enilive, a key player in advancing Italy’s energy transition, and CMC, a sustainable packaging leader using robotics to drive innovation. These investments reflect KKR’s focus on partnering with businesses in sectors critical to long-term economic growth and transformation, and on suppor
Merz Therapeutics Presents New Research at World Parkinson Congress 2026, Revealing the Hidden Burden of "OFF" Episodes in Parkinson’s Disease28.5.2026 10:00:00 EEST | Press release
Merz Therapeutics, a leading player in neurology-focused specialty pharma, today announced the presentation of new research at the World Parkinson Congress (WPC) 2026 that uncovers the multifaceted burden of "OFF" episodes in Parkinson's disease (PD). The qualitative literature review demonstrates that these episodes are not only a re-emergence of motor symptoms, but also a complex mix of debilitating motor and non-motor symptoms that impact the lives of people with Parkinson’s disease.1 Additional data presented at the congress also confirm the clinical profile of levodopa inhalation powder (INBRIJA®) as a reliable and well-tolerated treatment for these debilitating events. The new research moves beyond well-recognized physical signs to create a more comprehensive model for understanding the true patient experience of an OFF episode.1 The systematic review identified 132 distinct concepts, detailing the profound impact of "invisible" non-motor symptoms such as fatigue, memory problems
SMBC and Toshiba Jointly Develop New Equity Indices Using Advanced Quantum-Driven Technologies28.5.2026 04:00:00 EEST | Press release
Sumitomo Mitsui Banking Corporation (“SMBC”) and Toshiba Corporation (“Toshiba”) today announced the joint development of the SMBC/TOSHIBA Quantum Driven Diversified Japan Equity Index and the SMBC/TOSHIBA Quantum Driven Diversified U.S. Equity Index, new equity indices realized with advanced quantum-driven technologies. Collectively, the indices are referred to as “SMBC/TOSHIBA Quantum Diversified” (the “Indices”). This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260519448161/en/ Toshiba’s Simulated Bifurcation Machine 1. Background and Objectives Equity investment is central to asset management, but it also carries the ever-present risk of abrupt and substantial market fluctuations driven by geopolitical developments, changes in economic policy, and other external factors. In uncertain markets, investors are constantly seeking innovations in risk diversification that can protect their assets from unexpected market shocks. SM
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
