Business Wire

Dataocean AI Launched High Quality Off-the-Shelf Datasets and Frontier Data Solutions at Interspeech 2024

Share

In the rapidly growing AI market that especially focused on foundation models and Generative AI, the quality of datasets directly impacts the performance. In real-world applications, data is messy and improving models is not the only way to get better performance. As AI continues to transform industries, the need for high quality datasets has become critical for developing responsive, adaptable, and intelligent systems.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240919575026/en/

Dataocean AI at Interspeech 2024 (Photo: Business Wire)

At the Interspeech 2024, Dataocean AI, a global leader in AI data solutions, officially launched its latest offerings: high-quality off-the-shelf datasets. This exciting announcement further illustrates the company's position as a pioneer in the AI technology domain.

Dataocean AI introduced its newest corpus designed to meet the demands of various application scenarios - “Massively Multilingual Speech Corpus”. This corpus was recording from 215,891 speakers with total of 259,672 hours, covering over 100 languages. Along with this corpus, Dataocean AI also showcased its datasets in European languages. These meticulously labeled high quality datasets, covering English, French, Spanish, Turkish and Swedish, known for their diversity and accuracy, promise to enhance the performance of AI models across industries, such as smart finance, AI assistant, in-cabin, smart home, and other trendy topics related to AI.

The key strength of Dataocean AI’s datasets lies in their ability to deliver high precision across different fields.

  • For data collection process, Dataocean AI leverages its extensive global network, comprising native speakers who professionally record in over 200+ languages. The company owns a team of native and professional speakers for these recordings and employs high-fidelity equipment within professional recording studios including indoor, outdoor, and in-cabin environments.
  • For data labeling process, the company offer datasets that are labeled with their advanced self-developed platform with human in the loop. The expert team consist of scholars and specialists that covering multiple scenarios, and they have successfully build over 1100 speech datasets that match top quality standards, fulfilling the evolving needs of the AI industry.

In addition to speech datasets, Dataocean AI also owns over 1600 high-quality training datasets with proprietary intellectual property rights, covering a wide range of fields including foundation models, autonomous driving, finance, healthcare, and law. At the same time, its self-developed data processing platform, DOTS, equipped with more than 200 algorithms and hundreds of data processing tools, can achieve powerful functions such as automated labeling and assisted labeling, better helping customers reduce costs and increase efficiency. Additionally, they have earned data security regulations such as the European GDPR, and obtained certifications for ISO 9001, ISO 27001, and ISO 27001, ensuring safety and compliance.

Along with the high-quality datasets, Dataocean AI also empower LLMs through world-class live data collection for pre-trained and SFT/RLHF/red teaming for fine-tuning, as well as model evaluation.

Dataocean AI’s goal is to deliver one-stop data solution that ensuring their partners and clients can build reliable, adaptable AI models. This commitment to excellence is central to the company's mission of driving innovation in AI.

For more information about Dataocean AI’s latest datasets and their innovative data solutions, visit their official website at www.dataoceanai.com.

About Dataocean AI

With nearly 20 years project experience, Dataocean AI empower more than 1000 internet companies, AI enterprises and academic institutes with data total solutions. We offer over 1600 high quality off-the-shelf datasets and frontier data services, including data collection and data labeling serving for deep learning technology and enable clients’ AI models leading in the market.

View source version on businesswire.com: https://www.businesswire.com/news/home/20240919575026/en/

Contacts

contact@dataoceanai.com

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

www.businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Fujirebio Receives Marketing Clearance for Lumipulse ® G pTau 217/ β-Amyloid 1-42 Plasma Ratio In-vitro Diagnostic Test as an Aid to Identify Patients With Amyloid Pathology Associated With Alzheimer’s Disease17.5.2025 09:58:00 EEST | Press release

Fujirebio today announced that the U.S. Food and Drug Administration (FDA) has granted 510(k) clearance for the company’s Lumipulse® G pTau 217/β-Amyloid 1-42 Plasma Ratio in-vitro diagnostic (IVD) test for the assessment of amyloid pathology in patients being evaluated for Alzheimer’s disease and other causes of cognitive decline. The test, which was granted Breakthrough Device Designation by the FDA, is the first FDA cleared blood-based IVD test in the U.S. to aid to identify patients with amyloid pathology associated with Alzheimer’s Disease (AD). Alzheimer’s disease currently affects an estimated 7.2 million Americans, a number projected to rise to nearly 14 million by 2060.1 It is a leading cause of disability and death. AD develops over many years, long before symptoms are evident, but the lack of accessible, minimally invasive diagnostics results in many patients remaining undiagnosed until the disease is well advanced, when few effective interventions remain. The Lumipulse G pT

IFF Announces Pricing of Tender Offers For Certain Outstanding Series of Notes17.5.2025 00:17:00 EEST | Press release

IFF (NYSE: IFF) announced today the Total Consideration (as defined below) payable in connection with its previously announced tender offers to purchase for cash: (i) up to $1,100,000,000 aggregate purchase price, excluding accrued and unpaid interest (the “Amended Pool 1 Maximum Amount”), of its 1.230% Senior Notes due 2025 (the “2025 Notes”), 1.832% Senior Notes due 2027 (the “2027 Notes”), 4.450% Senior Notes due 2028 (the “2028 Notes”) and 2.300% Senior Notes due 2030 (the “2030 Notes” and collectively with the 2025 Notes, the 2027 Notes and the 2028 Notes, the “Pool 1 Notes”) and (ii) up to $900,000,000 aggregate purchase price, excluding accrued and unpaid interest (the “Amended Pool 2 Maximum Amount” and, together with the Amended Pool 1 Maximum Amount, the “Amended Maximum Amounts”), of its 3.268% Senior Notes due 2040 (the “2040 Notes”), 4.375% Senior Notes due 2047 (the “2047 Notes”), 5.000% Senior Notes due 2048 (the “2048 Notes”) and 3.468% Senior Notes due 2050 (the “2050

Origins Launches in the U.S. Amazon Premium Beauty Store16.5.2025 16:00:00 EEST | Press release

Origins, with over 30 years of expertise in combining naturally-derived and scientifically crafted ingredients for powerful skincare, announced its official debut in the U.S. Amazon Premium Beauty store today. Origins will offer its iconic skincare and body care products along with giftable sets that are perfect for any occasion. Origins will now bring naturally-derived, effective beauty to Amazon customers nationwide with convenience and thoughtful gifting in mind. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250516435958/en/ This launch marks a strategic step in Origins’ ongoing efforts to meet the evolving needs of skincare shoppers, seeking high-performance, conscious beauty on their favorite platforms. By expanding to the U.S. Amazon Premium Beauty store, Origins reinforces its commitment to delivering both efficacy and accessibility to even more consumers. Amazon customers will now be able to discover and purchase Or

The smarter E Europe 2025: Studies, Technologies and Market Trends for the Energy System of Tomorrow16.5.2025 13:47:00 EEST | Press release

Exactly one week has passed since The smarter E Europe closed its doors in Munich. Once again, Europe’s largest alliance of exhibitions for the energy industry turned the Bavarian capital into the epicenter of the global energy sector and impressed with outstanding results. Over the course of three days, 2,737 exhibitors from 57 countries showcased their technologies, business models and market-ready solutions for an intelligent, interconnected and fully renewable energy system. Around 107,000 professionals from 157 nations took the opportunity to connect, initiate partnerships and launch new projects. The accompanying conferences and side events also attracted strong interest, drawing more than 2,600 participants. The message sent out by The smarter E Europe and its four exhibitions – Intersolar Europe, ees Europe, Power2Drive Europe and EM-Power Europe – was clear: We are the energy system. This press release features multimedia. View the full release here: https://www.businesswire.c

IFF Announces Early Tender Results and Increase of Tender Offers for Certain Outstanding Series of Notes16.5.2025 13:30:00 EEST | Press release

IFF (NYSE: IFF) announced today the early tender results for its tender offers to purchase for cash certain of its outstanding series of Notes. IFF also announced it has increased the previously announced Pool 1 Maximum Amount (as defined below) from $1,000,000,000 to $1,100,000,000 (the “Amended Pool 1 Maximum Amount”), the Pool 2 Maximum Amount (as defined below) from $800,000,000 to $900,000,000 (the “Amended Pool 2 Maximum Amount”, and together with the Amended Pool 1 Maximum Amount, the “Amended Maximum Amounts”), the 2027 Series Tender Cap (as defined below) from $300,000,000 to $400,000,000 and the 2050 Series Tender Cap (as defined below) from $600,000,000 to $649,114,000. The 2025 Notes Series Tender Cap and the 2040 Notes Series Tender Cap (each as defined below) remain unchanged at $500,000,000 and $450,000,000, respectively. Details of tender offers IFF initially offered to purchase for cash: (i) up to $1,000,000,000 aggregate purchase price, excluding accrued and unpaid in

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye