Site Reliability Engineer
Actively Reviewing the ApplicationsHaystack
Chennai
Full-Time
4–8 years
Posted 2 days ago
•
Apply by June 11, 2026
Job Description
We're working with a fast-growing UK-based ISP and telecommunications innovator on this exciting opportunity.
This is your chance to take ownership of high-scale production systems for a tech-driven telecommunications provider. You will bridge the gap between development and operations, leveraging modern tools like Kubernetes, GCP, and Prometheus to ensure mission-critical platforms remain resilient and scalable as the user base expands.
The Role
This is your chance to take ownership of high-scale production systems for a tech-driven telecommunications provider. You will bridge the gap between development and operations, leveraging modern tools like Kubernetes, GCP, and Prometheus to ensure mission-critical platforms remain resilient and scalable as the user base expands.
The Role
- Act as the technical authority for production health, managing incident triage and high-level escalations for live services.
- Build and maintain robust observability frameworks using Prometheus, Grafana, and the ELK stack to proactively monitor system performance.
- Drive platform resilience by implementing automation practices, CI/CD pipelines via GitHub Actions, and container orchestration.
- Collaborate with network engineering teams to optimize ISP-specific infrastructure, including DNS, DHCP, and IPv4/IPv6 networking.
- Ensure new service deployments are "SRE-ready," focusing on scalability, security, and operational reliability from day one.
- Proven experience in an SRE, DevOps, or Systems Engineering role managing production environments at scale.
- Deep technical knowledge of Linux administration and scripting proficiency in Bash or Python.
- Hands-on expertise with Google Cloud Platform (GCP) and container technologies including Docker and Kubernetes.
- Strong understanding of ISP/Networking fundamentals such as RADIUS, PPPoE, and core networking protocols.
- Familiarity with Infrastructure as Code (IaC) tools like Terraform or Ansible to drive automation.
- Competitive salary up to £70,000 + comprehensive benefits package.
- True flexibility with Remote-First and Hybrid working options in a supportive team culture.
- Premium perks including free gym access, electric vehicle schemes, and a financial wellbeing platform.
- A modern tech stack and clear progression paths within a rapidly scaling organization.
Quick Tip
Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.
Related Similar Jobs
View All
Manager – People Business Partner and Operations
Gruve
Bengaluru
Full-Time
Schema design
Event-driven architecture
Retrieval-Augmented Generation
+5
Territory Sales - Field Sales Account Executive - Senior
Cummins India
Vadodara
Full-Time
4–8 years
Retrieval-Augmented Generation
Solution architecture
Event-driven architecture
+9
TechOps Engineer - I
Airtel Payments Bank
Noida
Full-Time
1–2 years
Retrieval-Augmented Generation
Data architecture
Event-driven architecture
+11
Application Architect
Geta AI Labs Private Limited
India
Full-Time
JWT
Data architecture
OAuth 2.0
+5
Assistant Vice President - Business Excellence
SBI Card
Bengaluru
Full-Time
10–20 years
Data architecture
Event-driven architecture
Adobe Illustrator
+5
Share
Quick Apply
Upload your resume to apply for this position