Skip to content
Khalil Nouisser
Back

Client : ENGIEPlatform & CloudPeriod : Jul 2024 — present

End-to-end Grafana observability

A complete Grafana stack — Alloy and Vector for collection, Mimir, Loki, and Tempo for storage — deployed across multiple clusters on ENGIE's DevOps platform.

Results

6

production EKS clusters covered

Context

ENGIE Digital & IT's DevOps platform runs on 6 production EKS clusters and 200+ VMs, serving 600+ organizations. Metrics, logs, and traces are produced there continuously, at scale.

Challenge

Unify three signals — metrics, logs, traces — across several clusters, in a coherent, operable stack, without piling up agents or creating one silo per team.

Solution

  1. Unified collection: Grafana Alloy and Vector deployed across all clusters.

  2. A dedicated backend per signal: metrics with Prometheus and Mimir, logs with Loki, traces with Tempo.

  3. Grafana dashboards and alerting, wired to the internal alert centralization tool (Python).

  4. Multi-cluster deployment and operations on the production environments.

Stack

  • Grafana
  • Alloy
  • Vector
  • Prometheus
  • Mimir
  • Loki
  • Tempo
  • EKS

Work

A similar project?

Describe your context — reply within 24 to 48 h.

More case studies

Neurones IT · Platform & Cloud

Nkube — multi-cloud Kubernetes platform

Design and technical lead of a multi-tenant platform for creating and managing Kubernetes clusters — Vanilla, K3S, RKE/RKE2, OpenShift — on AWS, GCP, Azure, and OVH.

4

clouds covered — AWS, GCP, Azure, and OVH

Go · Pulumi · Ansible · Next.js +8

View the case study

ENGIE · Platform & Cloud

CI/CD platform at scale

Ephemeral GitHub Actions runners (ARC), in-cluster Jenkins controllers, and in-house tooling for a DevOps platform serving 600+ organizations and 10,000+ users.

600+

organizations served by the platform

10 000+

daily users

Jenkins · Karpenter · Terraform · Go +7

View the case study