Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
11.4.2022 14:16:00 EEST | Business Wire | Press release
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
*Source: AETOSWire
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20220411005085/en/
Contact information
Technology Innovation Institute
Sneha Sivanand, sneha.sivanand@tii.ae
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
TVCMALL MWC 2026 -tapahtumassa: johtava mobiilitarvikkeiden tukkukauppa ja älykäs hankinta1.3.2026 08:00:00 EET | Tiedote
MWC Barcelona 2026 korostaa tekoälyn, yhteyksien ja älykkäämpien teknologisten järjestelmien kasvavaa merkitystä, ja samalla mobiilitarvikemarkkinat siirtyvät uuteen vaiheeseen, jota leimaavat nopeus ja monimutkaisuus. Tuotekategoriat laajenevat jatkuvasti, muotoilut ja tyylit päivittyvät yhä nopeammin, ja jälleenmyyjiltä odotetaan reagointia markkinamuutoksiin entistä lyhyemmissä sykleissä. Tuotevalikoiman ajantasaisena pitäminen samalla kun hankintaa hallitaan tehokkaasti on muodostunut todelliseksi haasteeksi jälleenmyyjille ja jakelijoille eri puolilla Eurooppaa. Tämä lehdistötiedote sisältää multimediaa. Katso koko julkaisu täällä: https://www.businesswire.com/news/home/20260121785166/fi/ TVCMALL MWC 2026 -tapahtumassa: johtava mobiilitarvikkeiden tukkukauppa ja älykäs hankinta MWC Barcelona 2026 - Tapahtumassa TVCMALL korostaa rooliaan Euroopan johtavana yhden luukun puhelin tarvikkeita tukkutoimittajana, keskittyen selkeästi siihen, että tukkukauppa ja hankinta olisivat helpompi
TVCMALL at MWC 2026: Leading Mobile Accessories Wholesale and Smarter Sourcing1.3.2026 08:00:00 EET | Press release
As MWC Barcelona 2026 highlights the growing role of AI, connectivity, and smarter technology systems, the mobile accessories market is entering a new phase defined by speed and complexity. Product categories continue to expand, designs and styles update faster, and retailers are expected to respond to market changes in shorter cycles. Keeping product lines up to date while managing sourcing efficiently has become a real challenge for retailers and distributors across Europe. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260121405485/en/ TVCMALL at MWC 2026: Leading Mobile Accessories Wholesale and Smarter Sourcing At MWC Barcelona 2026, TVCMALL highlights its role as a leading one-stop mobile accessories wholesaler in Europe, with a clear focus on making wholesale and sourcing easier, faster, and more reliable. With more than 18 years of experience, TVCMALL works closely with 30+ leading retail partners across Europe, supp
Biocytogen Announces Clinical Milestone with First Patient Dosed in Phase 1 Trial of IDEAYA’s First-in-Class B7H3/PTK7 Bispecific TOP1 ADC IDE03428.2.2026 02:00:00 EET | Press release
Biocytogen Pharmaceuticals (Beijing) Co., Ltd. (Biocytogen, SSE: 688796; HKEX: 02315), a global biotechnology company that drives the research and development of novel antibody-based drugs with innovative technologies, today announced that its partner IDEAYA Biosciences, Inc. (“IDEAYA”; Nasdaq: IDYA) has dosed the first patient in IDEAYA’s Phase 1 dose-escalation/expansion clinical trial of IDE034, an investigational B7H3/PTK7 bispecific TOP1 ADC. Pursuant to the companies’ option and license agreement, first patient dosing triggers a $5 million milestone payment to Biocytogen. According to IDEAYA, the Phase 1 study is designed to characterize IDE034’s safety profile, tolerability, and PK as a monotherapy, and IDEAYA also intends to evaluate combination regimens with DNA damage response (DDR) -targeting agents such as its oral PARG inhibitor IDE161 as the program advances. IDE034 is a potential first-in-class bispecific B7H3/PTK7 TOP1 ADC, independently developed by Biocytogen and lice
IQM and Real Asset Acquisition Corp. to Host Conference Call/Webcast to Discuss Proposed Transaction27.2.2026 14:00:00 EET | Press release
IQM Finland Oy, a global leader in full-stack superconducting quantum computers (“IQM”, “IQM Quantum Computers” or the “Company”), and Real Asset Acquisition Corp. (Nasdaq: RAAQ), a special purpose acquisition company (“RAAQ”), announced that they will host a conference call to discuss their recently announced business combination, including certain transaction highlights. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260227472716/en/ IQM Radiance quantum computer As previously disclosed, on February 23, 2026, IQM and RAAQ announced they have entered into a definitive business combination agreement, which will result in IQM becoming a public company and listing American Depositary Shares on one of the two leading U.S. stock exchanges. The transaction provides funding with the aim to accelerate IQM’s technology and commercial development towards fault-tolerance quantum computing, further advancing its position as a leading p
HighRadius Launches $0 Implementation Fee, $0 Subscription Fee via Outcome Based Pricing for oCFO Software27.2.2026 13:00:00 EET | Press release
HighRadius launches Office of the CFO first Outcome Based Pricing with $0 Implementation fee and $0 Subscription until Go-Live. Customers only pay a fraction of realized gains based on P&L impact. Chapter 1: Outcome Based Pricing (OBP) Introduction of OBP: HighRadius, a provider of 190+ AI agents for Order-to-Cash, Accounts Payable, Record-to-Report, and Treasury introduces Outcome Based Pricing (OBP). Three Components of OBP: Customers pay a) $0 in Implementation fees, b) $0 in Subscription fees until Go Live, c) HighRadius earns a fraction of the actual savings realized by the client. Chapter 2: US GAAP & ASC 606 Constraints Not Designed for Innovation: The traditional ASC 606 model requires companies to standardize and recognize revenue based on contractual obligations. For a traditional SaaS subscription, the obligation is access to software over time. AI agents are designed to deliver quantifiable, real-time Business Outcomes that do not fit the traditional accounting framework. C
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
