Business Wire

KAYTUS Enhances KSManage for Intelligent Management of Liquid-Cooled AI Data Centers

Share

KAYTUS, a leading provider of end-to-end AI and liquid cooling solutions, has announced the release of the enhanced KSManage V2.3, its advanced device management platform for AI data centers. The latest version introduces expanded monitoring and control capabilities tailored for GB200 and B200 systems, including integrated liquid cooling detection features. Leveraging intelligent automation, KSManage V2.3 enables AI data centers to operate with greater precision, efficiency, and sustainability, delivering comprehensive refined management across IT infrastructure and maximizing overall performance.

As Generative AI technology accelerates, AI data centers have emerged as critical infrastructure for enabling innovations in artificial intelligence and big data. Next-generation devices such as NVIDIA’s B200 and GB200 are being rapidly adopted to meet growing AI compute demands. However, their advanced architectures differ substantially from traditional systems, driving the need for more sophisticated management solutions. For instance, the GB200 integrates two B200 Blackwell GPUs with an Arm-based Grace CPU, creating a high-performance configuration that poses new management challenges. From hardware status monitoring to software scheduling, more precise and intelligent control mechanisms are essential to maintain operational efficiency. Moreover, the elevated computing power of these devices leads to higher energy consumption, increasing the risk of performance bottlenecks, or even system outages in the event of failures. As a result, energy efficiency and real-time system monitoring have become mission-critical for ensuring the stability and sustainability of AI data center operations.

KSManage Provides Intelligent, Refined Management for AI Data Centers

KSManage builds on a wealth of experience in traditional device management and supports more than 5,000 device models. Its comprehensive management framework spans IT, network, security, and other infrastructure components. The platform enables real-time monitoring of critical server components, including CPU, memory, and storage drives. Leveraging intelligent algorithms, KSManage can predict potential faults, issue early warnings, and support preventive maintenance, helping ensure servers operate at peak performance and reducing the risk of unplanned downtime.

The upgraded KSManage delivers comprehensive monitoring of key performance indicators for GB200 and B200 devices, including GPU performance, CPU utilization, and memory bandwidth. Through 3D real-time modeling, it dynamically visualizes resource distribution and intelligently adjusts allocation based on workload demands. The platform also features automated network topology management, enabling real-time optimization of NVLink connectivity, and contributing to a 90% boost in operational efficiency. During large model training, KSManage automatically allocates more computing resources to relevant tasks, optimizing the distribution of CPU, GPU, and other components. This ensures higher device utilization, improved computational efficiency, and significantly faster training times.

Specific for intelligent fault detection, the upgraded KSManage introduces a three-tier monitoring framework spanning the component, machine, and cluster levels. At the component level, it leverages the PLDM protocol to enable precise monitoring of critical metrics such as GPU memory status. When computational errors are detected in B200 GPUs, KSManage rapidly analyzes error logs to distinguish between hardware faults and software conflicts, achieving over 92% accuracy in fault localization and taking timely corrective actions. At the machine level, KSManage integrates both BMC out-of-band logs and OS in-band logs to support fast and reliable hardware diagnostics. At the cluster level, federated management technology enables cross-domain alarm correlation and analysis, and triggers self-healing engines capable of responding to risks within seconds. The system also synchronizes with a high-precision liquid leak monitoring solution to enhance equipment safety. Collectively, these capabilities significantly reduce Mean Time to Repair (MTTR) and improve Mean Time Between Failures (MTBF), ensuring higher stability and resilience across AI data center operations.

Intelligent Management of Green, Liquid-Cooled AI Data Centers

As power density in AI data centers continues to increase, cooling has become a critical factor influencing both device performance and operational lifespan. To address this challenge, liquid cooling technology—recognized for its high thermal efficiency—has been widely adopted across next-generation AI infrastructure.

The upgraded KSManage introduces a new liquid cooling detection feature that enhances both the efficiency and safety of liquid cooling operations in AI data centers. The system provides real-time monitoring of key parameters such as coolant flow rate, temperature, and pressure, ensuring stable and optimal performance of the liquid cooling infrastructure. By analyzing data from chip power consumption and cooling circuit pressure, KSManage employs a multi-objective optimization algorithm to dynamically adjust flow rates and calculate the optimal coolant distribution under varying workloads. Powered by AI-driven precision control, the platform achieves a 50% improvement in flow utilization and delivers up to 10% additional energy savings in the liquid cooling system.

In addition, KSManage enhances operational reliability by providing real-time anomaly detection in the liquid cooling system. When issues such as abnormal flow rates, pressure fluctuations, temperature control failures, or condensation are detected, the system triggers instant alerts and delivers detailed fault diagnostics, enabling maintenance teams to quickly identify and resolve problems. In the event of a critical coolant leak, KSManage coordinates with the Coolant Distribution Unit (CDU) to deliver a millisecond-level response. Upon detection, the system immediately shuts off coolant flow and initiates an automatic power-down of the CDU, ensuring maximum protection of devices and infrastructure.

For high-power devices such as the GB200 and B200, KSManage delivers fine-grained energy consumption management at the GPU level. It dynamically adjusts the Thermal Design Power (TDP) thresholds of H100/B200 GPUs, while integrating intelligent temperature regulation technologies—such as variable-frequency fluorine pumps—within the liquid cooling system. These optimizations help reduce Power Usage Effectiveness (PUE) to below 1.3. Additionally, the platform’s power-environment interaction module leverages AI algorithms to predict potential cooling system failures. Through synergistic optimization of computing power and energy consumption, KSManage reduces the power usage per cabinet by 20%, effectively lowering device failure rates and improving overall energy efficiency.

KSManage has been successfully deployed across a wide range of industries globally, including internet, finance, and telecommunications. With its intelligent, refined, and sustainable management capabilities, it has become an essential tool for overseeing device operations in AI data centers. In one notable case, an AI data center in Central Asia achieved more than a fourfold increase in operational efficiency by leveraging KSManage’s intelligent diagnostic features. Device fault handling time was also reduced by 80%. Monitoring and control of the liquid cooling system, and firmware optimization collectively contributed to a 20% reduction in energy consumption. Additionally, the hardware service lifespan was extended by one to two years.

KSManage continues to play a critical role in ensuring the efficient, stable, and sustainable operation of AI data center infrastructure.

About KAYTUS

KAYTUS is a leading provider of end-to-end AI and liquid cooling solutions, delivering a diverse range of innovative, open, and eco-friendly products for cloud, AI, edge computing, and other emerging applications. With a customer-centric approach, KAYTUS is agile and responsive to user needs through its adaptable business model. Discover more at KAYTUS.com and follow us on LinkedIn and X

View source version on businesswire.com: https://www.businesswire.com/news/home/20250626477464/en/

Contacts

Media contact
media@kaytus.com

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

www.businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Data4Industry-X Awarded BAIDATA Excellence 2025 for Advancing Interoperability of Industrial Data Spaces26.6.2025 15:30:00 EEST | Press release

Data4Industry-X, the trusted Industry Data Space solution, has been awarded the BAIDATA Excellence Award 2025 in the category “Deployment of Innovative Pilots and Use Cases”,by BAIDATA, the Spanish Association for the development of data sovereignty and data economy in the Iberian Peninsula. This recognition highlights the comprehensive achievement made by Dawex, Schneider Electric, Valeo, CEA, and Prosyst in demonstrating with Data4Industry-X solution, the first international interoperability of data spaces for the exchange of digital product passport information, as showcased at Hannover Messe, in April 2025 on Plattform Industrie 4.0. stand. This award underlines the French technology and industry excellence in fostering seamless, cross-border interoperability of industrial data spaces, and the impact of Data4Industry-X solution in advancing the development of federated, sovereign and interoperable data spaces, accelerating the adoption of trusted data exchanges across Europe’s indu

TH Global Capital Announces Its Growth Advisory Practice: ‘TH Growth Strategy + Deal’ to Drive Sustained Value Creation for Clients26.6.2025 15:13:00 EEST | Press release

TH Global Capital, an award-winning global investment banking firm recognized as Boutique Investment Banking Firm of the Year for three consecutive years, with a track record of closing transactions in 29 countries, is pleased to announce the launch of ‘TH Growth Strategy + Deal’. This enhanced growth advisory practice is designed to help mid-market companies around the world unlock transformative growth, drive strategic revenue expansion, and execute high impact deals through a hands on, results driven approach. The growth advisory practice enables clients to exit, raise capital or recapitalize at a higher valuation. TH Global Capital partners closely with clients to elevate their businesses through tailored interventions that are designed to drive measurable improvements in revenue, accelerate order bookings, expand geographic reach, improve EBITDA margins, and build a sustained sales capability engine with the goal of unlocking long term value. Vivek Subramanyam, Founder and CEO of

Groups Representing Patients, Healthcare Professionals and Pharmaceutical Industry Author New Principle on Use of AI In Healthcare26.6.2025 15:00:00 EEST | Press release

Six leading international organizations representing patients, physicians, pharmacists, nurses, hospitals, and the pharmaceutical industry have today adopted the first joint ethical principle in the healthcare industry on the responsible use of health data and technology, including artificial intelligence. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250626149027/en/ The new principle joins the International Consensus Framework for Ethical Collaboration (ICF), a longstanding principles-based voluntary agreement that guides ethical collaboration across these major healthcare bodies, working together to deliver high quality care for patients. The ICF was established in 2014 as a global platform to ensure that relationships across the health ecosystem are grounded in ethical, transparent, and responsible decision-making. It unites six leading health bodies representing patient organizations, healthcare professionals, and the

Bentley Systems, Enactus Launch 2025 iTwin4Good Challenge Amid Global Infrastructure Workforce Shortage26.6.2025 15:00:00 EEST | Press release

As the global infrastructure sector faces a critical workforce shortage, Bentley Systems, Incorporated (Nasdaq: BSY), the infrastructure engineering software company, and Enactus, a global nonprofit advancing student innovation and entrepreneurship, announce the start of the 2025 iTwin4Good Challenge. This international competition empowers university students to develop digital twin solutions to help address global infrastructure challenges. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250626207358/en/ Finalists of the 2024 iTwin4Good Challenge on the competition stage in Kazakhstan. (Image courtesy of Bentley Systems and Enactus) This initiative comes at a pivotal time. In recent years, global infrastructure sectors have faced a critical workforce shortage, despite rising demand for better and more resilient infrastructure. For example, the American Council of Engineering Companies (ACEC) Research Institute reports that

ExaGrid Expands Partnership with Cohesity with New Certified Integrations, Enabling Seamless Backup Storage for Both Cohesity NetBackup and DataProtect Customers26.6.2025 15:00:00 EEST | Press release

ExaGrid®, the world’s largest independent backup storage vendor, has announced that it will be an archive storage target for Cohesity DataProtect with its Tiered Backup Storage product line. Today, ExaGrid supports Cohesity NetBackup with 9 certified integrations. As Cohesity continues to support both NetBackup and DataProtect customers, ExaGrid’s Tiered Backup Storage appliances will continue to work as an archive storage target for all Cohesity customers. ExaGrid’s support for Cohesity is expected to be Generally Available (GA) in the first half of 2026. “We’re excited to support Cohesity, including both their NetBackup and DataProtect customers,” said Bill Andrews, President and CEO of ExaGrid. “With customers in over 80 countries, we’re seeing growing interest in Cohesity, and we want to ensure that their investment in ExaGrid remains protected during that transition.” ExaGrid Tiered Backup Storage features a unique architecture with a front-end Landing Zone for fast archiving and

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye