Bestkaam Logo
EPAM Systems Logo

Site Reliability Engineering - GCP

Hyderabad, Telangana, India

3 weeks ago

Applicants: 0

Salary Not Disclosed

2 days left to apply

Job Description

We are seeking a Site Reliability Engineer with expertise in Google Cloud Platform to join our multi-functional SRE team. You will focus on enhancing operational automation and monitoring to improve efficiency within our cloud environments. Your role involves identifying repetitive tasks and implementing automated solutions to minimize manual effort. If you have a strong background in cloud engineering and are driven to optimize system reliability, we invite you to apply and contribute to our team. ? Responsibilities Act as subject matter expert for operation automation and monitoring on Google Cloud Platform Identify toil in existing systems and processes and recommend solutions to reduce manual tasks Design and implement automated workflows to improve team efficiency Define and create customer user journeys, service level objectives, service level indicators, and error budgeting based on non-functional requirements Develop and maintain infrastructure as code using Terraform and GitHub Write and maintain scripts using Bash, PowerShell, Python, and Ansible to support automation Manage containerized environments using Kubernetes Collaborate with team members to reduce toil in software development life cycle and IT operations Utilize source control management tools including Git, GitHub, and SonarQube Apply understanding of IT service management processes to support operational excellence Monitor and analyze system performance metrics using Prometheus and Grafana Provide proactive and analytical insights to improve system reliability ? Requirements Experience of 5 to 10 years in site reliability engineering or related cloud engineering roles Strong knowledge of Google Cloud Platform and cloud engineering practices Expertise in defining and implementing customer user journeys, service level objectives, service level indicators, and error budgeting Proficiency in infrastructure as code tools such as Terraform and GitHub Skills in scripting languages, including Bash, PowerShell, Python, and Ansible Competency in container orchestration with Kubernetes Experience designing and implementing automated workflows to reduce manual effort Familiarity with source control management tools like Git, GitHub, and SonarQube Understanding of IT service management processes Analytical and proactive mindset to identify and solve operational challenges

Additional Information

Company Name
EPAM Systems
Industry
N/A
Department
N/A
Role Category
N/A
Job Role
Mid-Senior level
Education
No Restriction
Job Types
On-site
Gender
No Restriction
Notice Period
Less Than 30 Days
Year of Experience
1 - Any Yrs
Job Posted On
3 weeks ago
Application Ends
2 days left to apply

Similar Jobs

Turing

3 weeks ago

Remote Sr Software Developer - Python

Turing

Accenture in India

2 months ago

Application Lead

Accenture in India

Creditsafe Technology

3 weeks ago

Senior Data Engineer

Creditsafe Technology

Turing

3 weeks ago

Backend Python Engineer - 17852

Turing

Capgemini Engineering

2 months ago

Python Fullstack Developer ? CASE, DevOps, GitHub, SQL

Capgemini Engineering

Duruper

2 months ago

Full Stack Developer

Duruper

LIXIL

2 months ago

Systems Engineer - Global IT Operations Center

LIXIL

Live Nation Entertainment

1 month ago

Senior Engineer 2

Live Nation Entertainment

Accenture in India

3 weeks ago

LLM Model Developer

Accenture in India

Tata Consultancy Services

3 weeks ago

DevOps Engineer Chennai Walkin 22nd November 2025 - Saturday

Tata Consultancy Services