Cloud Site Reliability Engineer
Actively Reviewing the ApplicationsCornerstone OnDemand
Gurgaon
Full-Time
4–8 years
Posted 4 days ago
•
Apply by June 11, 2026
Job Description
We're looking for a Cloud Site Reliability Engineer This role is Office Based, Mumbai Office
We are seeking a highly skilled Site Reliability Engineer (SRE) with strong experience in Kubernetes troubleshooting, incident response, and deep knowledge of monitoring and alerting systems, along with solid experience in CI/CD pipeline design and maintenance. You will play a key role in building and maintaining reliable infrastructure, enhancing observability, and ensuring uptime for mission-critical systems.
In this role, you will…
Spark Greatness. Shatter Boundaries. Share Success. Are you ready? Because here, right now – is where the future of work is happening. Where curious disruptors and change innovators like you are helping communities and customers enable everyone – anywhere – to learn, grow and advance. To be better tomorrow than they are today.
Who We Are
Cornerstone powers the potential of organizations and their people to thrive in a changing world. Cornerstone Galaxy, the complete AI-powered workforce agility platform, meets organizations where they are. With Galaxy, organizations can identify skills gaps and development opportunities, retain and engage top talent, and provide multimodal learning experiences to meet the diverse needs of the modern workforce. More than 7,000 organizations and 100 million+ users in 180+ countries and in nearly 50 languages use Cornerstone Galaxy to build high-performing, future-ready organizations and people today.
Check us out on LinkedIn , Comparably , Glassdoor , and Facebook !
We are seeking a highly skilled Site Reliability Engineer (SRE) with strong experience in Kubernetes troubleshooting, incident response, and deep knowledge of monitoring and alerting systems, along with solid experience in CI/CD pipeline design and maintenance. You will play a key role in building and maintaining reliable infrastructure, enhancing observability, and ensuring uptime for mission-critical systems.
In this role, you will…
- Diagnose and resolve issues in Kubernetes clusters, including deployments, pod failures, networking issues, and autoscaling.
- Lead incident management efforts including on-call response, root cause analysis, and continuous improvement of incident playbooks.
- Design and maintain monitoring, logging, and alerting systems using tools such as Prometheus, Grafana, and ELK (Elasticsearch, Logstash, Kibana).
- Set up and manage Kibana dashboards and maintain the ELK stack to ensure high availability and performance of logging infrastructure.
- Integrate metrics, logs, and traces into a unified observability platform.
- Build and maintain alerting pipelines to reduce noise and improve signal-to-noise ratio for production incidents.
- Contribute to infrastructure automation using tools like Terraform, Helm.
- Set up and support CI/CD pipelines for automated testing, deployment, and rollback across multiple environments.
- Participate in shift rotations and continuously improve observability and response systems.
- 2+ years in an SRE, DevOps, or Infrastructure Engineer role.
- Bachelor's degree in computer science, IT, or related technical field.
- Hands-on experience on AWS and GCP Cloud
- Deep hands-on experience with Kubernetes (EKS, AKS, GKE)
- Strong understanding of Linux internals, container orchestration, and microservice architecture.
- Hands-on experience with monitoring/logging tools:
- Prometheus, Grafana, InfluxDB
- ELK stack (Elasticsearch, Logstash, Kibana)
- Proficient in incident response and alerting tools (PagerDuty etc.).
- Basic knowledge of:
- Kafka – topic monitoring, consumer health
- ElastiCache / Redis – caching patterns and troubleshooting
- InfluxDB – time-series metrics storage
- Experience writing and maintaining automation scripts in Bash, Python, or Go.
Spark Greatness. Shatter Boundaries. Share Success. Are you ready? Because here, right now – is where the future of work is happening. Where curious disruptors and change innovators like you are helping communities and customers enable everyone – anywhere – to learn, grow and advance. To be better tomorrow than they are today.
Who We Are
Cornerstone powers the potential of organizations and their people to thrive in a changing world. Cornerstone Galaxy, the complete AI-powered workforce agility platform, meets organizations where they are. With Galaxy, organizations can identify skills gaps and development opportunities, retain and engage top talent, and provide multimodal learning experiences to meet the diverse needs of the modern workforce. More than 7,000 organizations and 100 million+ users in 180+ countries and in nearly 50 languages use Cornerstone Galaxy to build high-performing, future-ready organizations and people today.
Check us out on LinkedIn , Comparably , Glassdoor , and Facebook !
Required Skills
Quick Tip
Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.
Related Similar Jobs
View All
Travel MRI Technologist - $2,778 per week
ARMStaffing
Navi Mumbai
Full-Time
1–2 years
Adobe Illustrator
Process Improvement
UHC: Registered Nurse, Behavioral Health Unit
United Hospital Center
Bengaluru
Full-Time
1–2 years
Revenue Management
Adobe Illustrator
Process Improvement
+2
Law/Legal Internship in Noida, Noida, Delhi
Corpniti Legal
Bengaluru
Full-Time
Adobe Illustrator
Process Improvement
Solution design
+2
Project Manager Civil
Studio 2.47
Hyderabad
Full-Time
1–2 years
Adobe Illustrator
Process Improvement
CRM
+1
General Showroom Manager
Wren Kitchens
Hyderabad
Full-Time
4–8 years
Adobe Illustrator
Program Management
Process Improvement
Share
Quick Apply
Upload your resume to apply for this position