Deel
About the Role
Join a globally distributed, high-growth SaaS company transforming the future of work. As an Observability Specialist, you will play a critical role in designing and maintaining scalable monitoring systems across cloud-native environments. You’ll work closely with DevOps, SRE, and Engineering teams to ensure system reliability, performance, and cost efficiency at scale.
Key Responsibilities
- Design, implement, and manage observability solutions for cloud-native infrastructure
- Own monitoring across AWS and Kubernetes (EKS) environments
- Operate and maintain self-hosted monitoring tools such as Prometheus, Grafana, Mimir, Loki, and Tempo
- Manage and optimize Datadog (metrics, logs, APM, alerts, and cost monitoring)
- Improve observability architecture for high availability, scalability, and fault tolerance
- Implement cost optimization strategies (log/trace sampling, retention policies, storage optimization)
- Automate observability infrastructure using Terraform, Helm, and scripting tools
- Integrate monitoring and alerting into CI/CD pipelines (e.g., GitHub Actions)
- Support performance tuning and capacity planning initiatives
- Collaborate cross-functionally to embed best practices and drive continuous improvement
Required Skills & Experience
- 5+ years of experience in observability or monitoring engineering
- Strong hands-on experience with AWS and Kubernetes
- Deep understanding of metrics, logs, traces, SLIs, SLOs, and alerting systems
- Proven experience managing self-hosted monitoring stacks (Prometheus ecosystem preferred)
- Experience designing observability architectures at scale
- Hands-on experience with Datadog
- Strong knowledge of high-availability and fault-tolerant systems
- Experience with Infrastructure as Code (Terraform, Helm)
- Familiarity with CI/CD pipelines and deployment workflows
- Experience with performance optimization and capacity planning
Key Competencies
- Strong analytical and problem-solving skills
- Ability to take ownership of complex systems independently
- Excellent collaboration and communication skills
- Proactive mindset with a focus on continuous improvement
Why Join Deel?
- Be part of one of the fastest-growing SaaS companies globally
- Work in a fully remote, flexible environment
- Access competitive compensation, stock options, and benefits
- Contribute to building infrastructure that powers global work
Application Deadline: May 13, 2026
How to Apply
Interested and qualified candidates should:
Click here to apply
To apply for this job please visit jobs.ashbyhq.com.
