Gcore Unveils Inference at the Edge – Bringing AI Applications Closer to End Users for Seamless Real-Time Performance

en de

New AI solution enables fast, secure, and cost-effective deployment of pre-trained machine learning models globally, at the edge.

<< Back
12/06/2024 |
  • Andre Reitenbach CEO at Gcore27

    Andre Reitenbach, CEO at Gcore

Gcore, the global edge AI, cloud, network, and security solutions provider, today announced the launch of Gcore Inference at the Edge, a breakthrough solution that provides ultra-low latency experiences for AI applications. This innovative solution enables the distributed deployment of pre-trained machine learning (ML) models to edge inference nodes, ensuring seamless, real-time inference.

Gcore Inference at the Edge empowers businesses across diverse industries—including automotive, manufacturing, retail, and technology—with cost-effective, scalable, and secure AI model deployment. Use cases such as generative AI, object recognition, real-time behavioural analysis, virtual assistants, and production monitoring can now be rapidly realised on a global scale.

Gcore Inference at the Edge runs on Gcore's extensive global network of 180+ edge nodes, all interconnected by Gcore’s sophisticated low-latency smart routing technology. Each high-performance node sits at the edge of the Gcore network, strategically placing servers close to end users. Inference at the Edge runs on NVIDIA L40S GPUs, the market-leading chip designed specifically for AI inference. When a user sends a request, an edge node determines the route to the nearest available inference region with the lowest latency, achieving a typical response time of under 30 ms. 

The new solution supports a wide range of fundamental ML and custom models. Available open-source foundation models in the Gcore ML Model Hub include LLaMA Pro 8B, Mistral 7B, and Stable-Diffusion XL. Models can be selected and trained agnostically to suit any use case, before distributing them globally to Gcore Inference at the Edge nodes. This addresses a significant challenge faced by development teams, where AI models are typically run on the same servers they were trained on, resulting in poor performance.

Benefits of Gcore Inference at the Edge include:

  • Cost-effective deployment: A flexible pricing structure ensures customers only pay for the resources they use.
  • Inbuilt DDoS protection: ML endpoints are automatically protected from DDoS attacks through Gcore’s infrastructure.
  • Outstanding data privacy and security: The solution features built-in compliance with GDPR, PCI DSS, and ISO/IEC 27001 standards.
  • Model autoscaling: Autoscaling is available to handle load spikes, so a model is always ready to support peak demand and unexpected surges.
  • Unlimited object storage: Scalable S3-compatible cloud storage that grows with evolving model needs.

Andre Reitenbach, CEO at Gcore comments: “Gcore Inference at the Edge empowers customers to focus on getting their machine learning models trained, rather than worrying about the costs, skills, and infrastructure required to deploy AI applications globally. At Gcore, we believe the edge is where the best performance and end-user experiences are achieved, and that is why we are continuously innovating to ensure every customer receives unparalleled scale and performance. Gcore Inference at the Edge delivers all the power with none of the headache, providing a modern, effective, and efficient AI inference experience.”

Gcore Inference at the Edge is available now.
Learn more at https://gcore.com/inference-at-the-edge

Back to top  | << Back

Communiqués liés

gcore-cdn copy
27/06/2024

Gcore Launches Advanced AI Solution for Real-Time Online Con...

AI-based video content moderation combines computer vision, optical character re...

GCore
Andre Reitenbach CEO at Gcore27
12/06/2024

Gcore Unveils Inference at the Edge – Bringing AI Applicat...

New AI solution enables fast, secure, and cost-effective deployment of pre-train...

GCore
GCore
07/05/2024 Personnalités

Gcore Welcomes International Business Visionary, Dr. Philipp...

Expert in business growth strategies and former German Vice Chancellor takes up ...

GCore
Gcore Recognised as Highly Commended in the Industry Innovator Category at the EMEA NVIDIA Partner Network Awards
23/04/2024

Gcore Recognised as Highly Commended in the Industry Innovat...

Gcore acknowledged for successful launch of first AI speech-to-text solution for...

GCore
Left to right; Jin-yong Kim HyunYong Jung Jacques Flies Minwoo Kang (002)

Gcore opens the first H100-based data center in Korea - Part...

Gcore the global edge AI, cloud, network, and security solutions provider, will ...

GCore
Gcore AI-powered Speech Recognition service Sets New Speed and Scalability Standard for Broadcasters VOD and Content Owners
20/03/2024

Gcore AI-powered Speech Recognition Service Sets New Speed a...

Powerful automated speech recognition service delivers cost-efficiency to conten...

GCore

Il n'y a aucun résultat pour votre recherche

We use cookies to ensure the best experience on our website. By accepting you agree the use of cookies. OK Learn more