Deliver end-to-end observability solutions for
clients: assess current state, design Open
Telemetry-based architectures, and implement
collectors, instrumentation (auto + manual), APM
platforms, and production-grade telemetry
pipelines.
Architect and deploy observable cloud-native
systems on AWS (primary), Azure, or GCP,
including microservices, Kubernetes, serverless, and AI/ML workloads.
Lead distributed tracing, golden signals, SLO
implementation, service mapping, and intelligent
alerting to reduce client MTTR and improve
system reliability.
Optimize observability costs through smart
sampling, data tiering, and efficient pipeline design.