Back to articles
AIHugging Face Blog

The New and Fresh analytics in Inference Endpoints

Back to Articles Analytics is important Published March 21, 2025 Update on GitHub Upvote 22 +16 Erik Kaunismäki erikkaum Follow Thibault Goehringer beurkinger Follow Remy rtrm Follow Corentin Regal co...

The RSS feed only provided an excerpt. FlowMarket recovered the public content available from the original page without bypassing restricted content.

The New and Fresh analytics in Inference Endpoints

Analytics is important

  • +16
Erik Kaunismäki
Thibault Goehringer
Remy
Corentin Regal
Michelle Habonneau

Analytics and metrics are the cornerstone of understanding what's happening with your deployment. Are your Inference Endpoints overloaded? How many requests are they handling? Having well-visualized, relevant metrics displayed in real-time is crucial for monitoring and debugging.

We realized that our analytics dashboard needed a refresh. Since we debug a lot of endpoints ourselves, we’ve felt the same pain as our users. That’s why we sat down to plan and make several improvements to provide a better experience for you.

What’s New?

⏰ Real-Time Metrics: Data now updates in real-time, ensuring you get an accurate and up-to-the-second view of your endpoint’s performance. Whether you’re monitoring request latency, response times, or error rates, you can now see the events as they happen. We’ve also reworked the backend of our analytics dashboard to ensure that data loads swiftly, especially for high-traffic endpoints. No more waiting around for metrics to populate. Just open the dashboard and get instant insights.

Your browser does not support the video tag.

🔬 Customizable Time Ranges & Auto-Refresh: We know that different users need different views, so we’ve made it easier to zoom in on a specific time range or track long-term trends. You can also enable auto-refresh, ensuring that your dashboard stays up to date without needing to manually reload.

Your browser does not support the video tag.

🔄 Replica Lifecycle View: Understanding what’s happening with your replicas is crucial, so we’ve introduced a detailed view of each replica’s lifecycle. You can now track replicas from initialization to termination, observing every state transition in between. This helps understand what's going on with your endpoint even if you have several moving parts.

Image de l'article

Even though we’ve rolled out these updates, we’re actively iterating on them. Things will continue to improve, and we welcome all feedback.

Let us know what works, what doesn’t, and what you’d like to see next! 🙌

Head to Inference Endpoints to check out the changes!

More Articles from our Blog

Blazingly fast whisper transcriptions with Inference Endpoints

  • +2

Getting Started with Hugging Face Inference Endpoints

Community

· Sign up or log in to comment

  • +10

Need an n8n workflow or help installing it?

After the briefing, move to execution: find an n8n template or a creator who can adapt it to your tools.

Source

Hugging Face Blog - huggingface.co

View original publication