Observability Specialist at Deel

Deel

About the Role

Join a globally distributed, high-growth SaaS company transforming the future of work. As an Observability Specialist, you will play a critical role in designing and maintaining scalable monitoring systems across cloud-native environments. You’ll work closely with DevOps, SRE, and Engineering teams to ensure system reliability, performance, and cost efficiency at scale.

Key Responsibilities

  • Design, implement, and manage observability solutions for cloud-native infrastructure
  • Own monitoring across AWS and Kubernetes (EKS) environments
  • Operate and maintain self-hosted monitoring tools such as Prometheus, Grafana, Mimir, Loki, and Tempo
  • Manage and optimize Datadog (metrics, logs, APM, alerts, and cost monitoring)
  • Improve observability architecture for high availability, scalability, and fault tolerance
  • Implement cost optimization strategies (log/trace sampling, retention policies, storage optimization)
  • Automate observability infrastructure using Terraform, Helm, and scripting tools
  • Integrate monitoring and alerting into CI/CD pipelines (e.g., GitHub Actions)
  • Support performance tuning and capacity planning initiatives
  • Collaborate cross-functionally to embed best practices and drive continuous improvement

Required Skills & Experience

  • 5+ years of experience in observability or monitoring engineering
  • Strong hands-on experience with AWS and Kubernetes
  • Deep understanding of metrics, logs, traces, SLIs, SLOs, and alerting systems
  • Proven experience managing self-hosted monitoring stacks (Prometheus ecosystem preferred)
  • Experience designing observability architectures at scale
  • Hands-on experience with Datadog
  • Strong knowledge of high-availability and fault-tolerant systems
  • Experience with Infrastructure as Code (Terraform, Helm)
  • Familiarity with CI/CD pipelines and deployment workflows
  • Experience with performance optimization and capacity planning

Key Competencies

  • Strong analytical and problem-solving skills
  • Ability to take ownership of complex systems independently
  • Excellent collaboration and communication skills
  • Proactive mindset with a focus on continuous improvement

Why Join Deel?

  • Be part of one of the fastest-growing SaaS companies globally
  • Work in a fully remote, flexible environment
  • Access competitive compensation, stock options, and benefits
  • Contribute to building infrastructure that powers global work

Application Deadline: May 13, 2026

How to Apply
Interested and qualified candidates should:
Click here to apply

To apply for this job please visit jobs.ashbyhq.com.

Scroll to Top