Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
DC Secretary Announces Annual Determinations Committees Outcome29.4.2026 16:36:00 EEST | Press release
DC Administration Services, Inc. has today announced the composition of five regional Determinations Committees (DCs), effective from April 29, 2026. Global Dealer Voting Members (for all Regions): Non-Dealer Voting Members (for all Regions): Bank of America, N.A. Citadel Americas LLC Barclays Bank plc Elliott Investment Management L.P. BNP Paribas Pacific Investment Management Company LLC Citibank, N.A. Deutsche Bank AG Goldman Sachs International JPMorgan Chase Bank, N.A. Regional Dealer Voting Member for the Americas, EMEA, Asia Ex-Japan, and Japan Determination Committees: CCP Members for the Americas, EMEA, Asia Ex-Japan, and Australia-New Zealand Determinations Committees: Mizuho Securities Co., Ltd. ICE Clear Credit LLC LCH S.A. The process for selecting DC members is specified in the DC rules. The DC rules, along with more information about the Determinations Committees and what they do can be found at the Determinations Committees website: https://www.cdsdeterminationscommitte
Boomi Builds Analyst Momentum Across Integration, API Management, Data Management, and Agentic AI29.4.2026 16:00:00 EEST | Press release
Boomi, the data activation company, today announced continued analyst recognition across multiple strategic technology categories, underscoring the company’s momentum as enterprises look for a unified foundation to connect data, applications, APIs, automation, and AI. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260429987428/en/ Boomi Builds Analyst Momentum Across Integration, API Management, Data Management, and Agentic AI Over the past several months, Boomi has been recognized across integration, API management, data management, and agentic AI-related categories. The company was named a Leader and positioned highest for Ability to Execute in the 2026 Gartner® Magic Quadrant™ for Integration Platform as a Service, marking Boomi’s 12th consecutive year as a Leader. Boomi was also named a Leader in the IDC MarketScape: Worldwide API Management 2026 Vendor Assessment (doc #US52034025, March 2026) and was recognized as a Cha
CSC Urges Enterprises Evaluate Applying for .BRAND Domains to Navigate AI-Driven Domain Threats and Opportunities29.4.2026 16:00:00 EEST | Press release
CSC, an enterprise-class domain registrar and world leader in mitigating brand, fraud, domain, and domain name system (DNS) threats, today announced a new program to coincide with ICANN’s new Generic Top-Level Domain (gTLD) application window and to support enterprises submitting a .BRAND TLD application between April 30 and August 12, 2026. Owning a .BRAND domain gives an organization exclusive control over its entire domain infrastructure, mitigating third-party lookalike domain registrations that lead to phishing and domain spoofing. This will be the first time ICANN has opened applications for new gTLDs, including .BRANDs, since the inaugural round in 2012. There is no known date for a third window opening. As the largest provider of these domain services globally, CSC manages more than one-third (160+) of all .BRANDs, helping to secure many of the world’s most recognizable brands. Since the round one application window closed in 2012, CSC has provided continuous .BRAND registry ma
Driscoll's Names Wyard Stomp Chief Operating Officer and Expands Shaily Sanghvi's Role to Lead Global Strategy29.4.2026 16:00:00 EEST | Press release
Driscoll's, the world's leading berry brand, today announced two leadership appointments to support CEO Soren Bjorn's long-term strategy to scale the company's proven, flavor-first business model globally, bringing the same deliberate approach that made Driscoll's the #2 retail food and beverage brand in the United States to consumers in every market the company serves. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260429432633/en/ Wyard Stomp has been appointed Chief Operating Officer (COO), a newly created role, while continuing to lead Driscoll's Europe, Middle East, and Africa (EMEA) business. As COO, Stomp will partner closely with the Executive Leadership Team to turn strategy into action, lead cross-functional initiatives, and ensure the company executes at the pace required to support its global growth ambitions. Stomp joined Driscoll's in 2012 and has held several senior leadership roles in Europe and the United St
Dubai Records the World’s Lowest Electricity Customer Minutes Lost at Just 49 Seconds Per Year29.4.2026 15:47:00 EEST | Press release
HE Saeed Mohammed Al Tayer, MD & CEO of Dubai Electricity and Water Authority (DEWA), announced that DEWA has set a new world record for the lowest electricity customer minutes lost (CML), at just 0.82 minutes (about 49 seconds) per year. With this significant achievement, DEWA has surpassed its own previous world record of 0.94 minutes in 2024, representing an improvement of around 13%. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260429386479/en/ Dubai records the world’s lowest electricity customer minutes lost at just 49 seconds per year (Photo: AETOSWire) “We work in line with the vision and directives of His Highness Sheikh Mohammed bin Rashid Al Maktoum, Vice President and Prime Minister of the UAE and Ruler of Dubai, to provide the best electricity and water infrastructure in the world. We utilise the latest technologies of the Fourth Industrial Revolution, particularly artificial intelligence, which we are fully i
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
