DevOps Engineer
Actively Reviewing the ApplicationsHuxley
Job Description
Budget: 60 lahks INR
DevOps Engineer – Mobile Media Platform
About the Role
We are seeking a DevOps Engineer to support a large-scale mobile application that delivers personalized advertising and reward-based user experiences. The service integrates with multiple internal systems, processes high user traffic, and requires a highly reliable and scalable backend environment.
In this role, you will take ownership of the platform’s infrastructure design, daily operations, deployment workflows, and ongoing technical evolution. You will work closely with engineers across regions, collaborate with domain experts, and lead initiatives that strengthen reliability, performance, observability, and cost efficiency.
This position is ideal for someone who enjoys solving unfamiliar problems, researching new technologies, and driving improvements across the entire platform lifecycle.
What You Will Do
Platform & Infrastructure Ownership
- Build, design, and enhance cloud‑based infrastructure that supports rapid development cycles and stable production operations.
- Lead architectural discussions, propose improvements, and enforce best practices for scalability, reliability, and system maintainability.
- Ensure operational processes, documentation, and platform guidelines are consistently maintained and kept up to date.
Kubernetes & Cloud Responsibilities
- Manage and optimize container-based workloads running on managed Kubernetes environments.
- Drive infrastructure upgrades—including cloud services, middleware, and platform components—with minimal user impact.
- Implement strategies that improve availability, resilience, performance, and multi-scenario operation handling.
Observability, Reliability & Cost Management
- Establish monitoring and alerting patterns aligned with business and technical needs.
- Identify service bottlenecks and systematically improve performance through metrics, logging, and tracing.
- Continuously assess infrastructure usage, eliminate unnecessary spending, and contribute to a long-term cost-efficient roadmap.
Production Support & Incident Response
- Ensure production systems remain stable and meet high availability expectations.
- Lead troubleshooting of complex incidents and coordinate recovery efforts across teams.
- Apply security guidelines, remediate vulnerabilities, and maintain compliance with organizational standards.
Project Leadership
- Deliver large-scale infrastructure initiatives from planning through implementation.
- Partner with cross-functional teams, mentor peers, and provide technical guidance in reviews and architecture sessions.
Required Experience
- 7+ years in DevOps, SRE, or platform engineering roles.
- Strong hands-on expertise with cloud platforms (preferably GCP) and container orchestration (GKE/EKS) at scale.
- Experience supporting distributed system architectures such as microservices, event-driven systems, or high-traffic APIs.
- Deep knowledge of middleware technologies (caching layers, message queues, RPC frameworks) and core networking/IAM concepts.
- Demonstrated experience maintaining service uptime, recovering from system incidents, and operating in fast-moving production environments.
- Familiarity with resource planning, cost control strategies, and performance tuning for large-scale systems.
- Strong ownership mindset and proactive approach to problem-solving.
Preferred Qualifications
- Experience designing reusable DevOps tooling, shared CI/CD workflows, or automation libraries.
- Ability to translate business goals into technical requirements and influence system direction.
- Strong experience designing alerting strategies tied to business KPIs.
- Understanding of data workflow orchestration (e.g., Airflow) and experience with cloud‑native data pipelines.
Required Skills
Quick Tip
Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.
Related Similar Jobs
View All
Security Engineer IV
Meesho
IT Architect II
Paychex
IT Head-Hunter
Equisoft
Sr. HR Recruiter
Webito
Developer III - Software Engineering
UST
Share
Quick Apply
Upload your resume to apply for this position