Gcore Unveils Inference at the Edge – Bringing AI Applications Closer to End Users for Seamless Real-Time Performance
6.6.2024 11:30:00 EEST | Business Wire | Press release
Gcore, the global edge AI, cloud, network, and security solutions provider, today announced the launch of Gcore Inference at the Edge, a breakthrough solution that provides ultra-low latency experiences for AI applications. This innovative solution enables the distributed deployment of pre-trained machine learning (ML) models to edge inference nodes, ensuring seamless, real-time inference.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240606719181/en/
Gcore Inference at the Edge empowers businesses across diverse industries with cost-effective, scalable, and secure AI model deployment (Graphic: Gcore)
Gcore Inference at the Edge empowers businesses across diverse industries—including automotive, manufacturing, retail, and technology—with cost-effective, scalable, and secure AI model deployment. Use cases such as generative AI, object recognition, real-time behavioural analysis, virtual assistants, and production monitoring can now be rapidly realised on a global scale.
Gcore Inference at the Edge runs on Gcore's extensive global network of 180+ edge nodes, all interconnected by Gcore's sophisticated low-latency smart routing technology. Each high-performance node sits at the edge of the Gcore network, strategically placing servers close to end users. Inference at the Edge runs on NVIDIA L40S GPUs, the market-leading chip designed specifically for AI inference. When a user sends a request, an edge node determines the route to the nearest available inference region with the lowest latency, achieving a typical response time of under 30 ms.
The new solution supports a wide range of fundamental ML and custom models. Available open-source foundation models in the Gcore ML Model Hub include LLaMA Pro 8B, Mistral 7B, and Stable-Diffusion XL. Models can be selected and trained agnostically to suit any use case, before distributing them globally to Gcore Inference at the Edge nodes. This addresses a significant challenge faced by development teams where AI models are typically run on the same servers they were trained on, resulting in poor performance.
Benefits of Gcore Inference at the Edge include:
- Cost-effective deployment: A flexible pricing structure ensures customers only pay for the resources they use.
- Inbuilt DDoS protection: ML endpoints are automatically protected from DDoS attacks through Gcore’s infrastructure.
- Outstanding data privacy and security: The solution features built-in compliance with GDPR, PCI DSS, and ISO/IEC 27001 standards.
- Model autoscaling: Autoscaling is available to handle load spikes, so a model is always ready to support peak demand and unexpected surges.
- Unlimited object storage: Scalable S3-compatible cloud storage that grows with evolving model needs.
Andre Reitenbach, CEO at Gcore comments: “Gcore Inference at the Edge empowers customers to focus on getting their machine learning models trained, rather than worrying about the costs, skills, and infrastructure required to deploy AI applications globally. At Gcore, we believe the edge is where the best performance and end-user experiences are achieved, and that is why we are continuously innovating to ensure every customer receives unparalleled scale and performance. Gcore Inference at the Edge delivers all the power with none of the headache, providing a modern, effective, and efficient AI inference experience.”
Learn more at https://gcore.com/inference-at-the-edge
About Gcore
Gcore is the global edge AI, cloud, network, and security solutions provider. Gcore provides its solutions to global leaders in numerous industries. The company manages its own global IT infrastructure across six continents, with one of the best network performances in Europe, Africa, and LATAM, due to the average response time of 30 ms worldwide. Gcore’s network consists of 180+ points of presence around the world in reliable Tier IV and Tier III data centres, with a total capacity exceeding 200 Tbps.
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
View source version on businesswire.com: https://www.businesswire.com/news/home/20240606719181/en/
Contact information
Gcore press contact
pr@gcore.com
About Business Wire
For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Royal London Asset Management Expands Relationship with SS&C to Service New Australian Funds27.5.2026 01:00:00 EEST | Press release
SS&C Technologies Holdings, Inc. (Nasdaq: SSNC) today announced that Royal London Asset Management, a leading U.K. fund management company, has extended its relationship with SS&C. SS&C Global Investor & Distribution Solutions will provide fund administration and unit registry services for its new range of Australian active funds, including: Royal London Global Equity Diversified Fund Royal London Global Equity Enhanced Fund Royal London Global Equity Select Fund Royal London Short Duration Global High Yield Bond Fund RLAM is part of Royal London, the U.K.’s largest mutual life, pensions and investment company. SS&C services approximately £72bn in assets under management across its U.K. fund range. Equity Trustees will serve as the Responsible Entity for RLAM’s new funds, which have launched with around AUD $1 billion in AUM. The unit trusts are structured as feeder funds, providing investors with indirect exposure to RLAM’s range of Dublin-domiciled Undertakings for Collective Investm
SLB Announces Date for Second-Quarter 2026 Results Conference Call26.5.2026 20:00:00 EEST | Press release
SLB (NYSE: SLB) will hold a conference call on July 24, 2026, to discuss the results for the second quarter ending June 30, 2026. The conference call is scheduled to begin at 9:30 a.m. U.S. Eastern time and a press release regarding the results will be issued at 7:00 a.m. U.S. Eastern time. To access the conference call, listeners should contact the Conference Call Operator at +1 (800) 715-9871 within North America or +1 (646) 307-1963 outside of North America approximately 10 minutes prior to the start of the call and the access code is 3440360. A webcast of the conference call will be broadcast simultaneously at https://events.q4inc.com/attendee/157027565 on a listen-only basis. Listeners should log in 15 minutes prior to the start of the call to test their browsers and register for the webcast. Following the end of the conference call, a replay will be available at www.slb.com/irwebcast until July 31, 2026, and can be accessed by dialing +1 (800) 770-2030 within North America or +1
Alipay Launches Next-Generation AI Payment Infrastructure, Debuts AI Wallet and Token Pay to Power Agentic Economy26.5.2026 18:20:00 EEST | Press release
Alipay today introduced its full-stack AI payment solution to partners across industries, ranging from AI companies to traditional retailers, and debuted two new services — the world’s first AI Wallet and Token Pay — to support the agentic economy’s rapid growth. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260526337824/en/ Alipay Unveils Next-generation AI Payment Infrastructure This launch extends Alipay's next-generation AI payment infrastructure, building on its consumer-facing product Alipay AI Pay and its business-facing AI payment processing product. “While the essence of commerce remains unchanged in the age of AI, the emergence of AI agents is reshaping everything. Drawing on 22 years of technological expertise and commercial know-how, Alipay is building a new generation of AI payment services to accelerate the growth of the agentic commerce ecosystem,” said Cyril Han, CEO of Ant Group. AI Wallet: Giving Users Vis
Daiichi Sankyo Europe Reaffirms Commitment to Patient-Centred Care with Extensive Data Showcase at EAS Congress 202626.5.2026 18:00:00 EEST | Press release
Daiichi Sankyo Europe (DSE) is pleased to announce its extensive scientific presence at the European Atherosclerosis Society (EAS) Congress 2026. The presentation of 15 abstracts, comprising both clinical trial analyses and real-world evidence, underscores the company's sustained investment in cardiovascular health and its mission to care for every heartbeat. The 15-abstract showcase provides a comprehensive look at the role of bempedoic acid in lipid management. This includes post-hoc analyses in collaboration with Esperion Therapeutics from the Phase 3 CLEAR Outcomes trial exploring its impact on stroke and venous thromboembolism (VTE) incidence.5,6 There are also real-world findings from the MILOS registry, including a dedicated sub-analysis investigating the effectiveness of bempedoic acid across various background therapies.1,2,3,4 Results reinforce consistent effectiveness and safety profile of bempedoic acid across various EU countries and regardless of patients’ existing treatm
OpenRouter Raises $113 Million CapitalG-led Series B as Weekly Volume Explodes to 25T Tokens26.5.2026 16:15:00 EEST | Press release
OpenRouter, the AI model exchange, today announced a $113 million Series B led by Alphabet’s independent growth fund, CapitalG, with participation from investors including NVentures (NVIDIA’s venture capital arm), ServiceNow Ventures, MongoDB Ventures, Snowflake Ventures, Databricks Ventures, alongside existing investors including Andreessen Horowitz and Menlo Ventures. OpenRouter’s volume has surged to 25 trillion tokens per week (100 trillion tokens per month), representing a 5X increase from the 5 trillion tokens processed per week just six months ago. The explosion in token demand illustrates how quickly enterprises are deploying agents and scaling AI across multiple models and providers. OpenRouter’s infrastructure manages and optimizes inference and provides access to 400+ models across leading AI providers, including Anthropic, Google, OpenAI, xAI, and DeepSeek, among others. The platform is used by over 8 million global users, including AI-native startups and large enterprises,
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom
