MLOps Engineer - Azure/Kubernetes
Actively Reviewing the ApplicationsThe IT Firm
Bengaluru
Full-Time
4–8 years
Posted 2 days ago
•
Apply by June 11, 2026
Job Description
Description
Location : Bangalore (Work from Office / Hybrid)
Experience : 5 to 8 Years
Employment Type : Full-Time
About The Role
We are looking for a highly skilled Senior Ops/MLOps Engineer to drive the deployment, scalability, and operational excellence of GenAI, LLM, and Machine Learning workloads. This role requires deep expertise in Azure cloud ecosystem, Kubernetes platforms, and modern LLMOps practices.
You will play a critical role in building reliable, scalable, and production-grade AI platforms, enabling seamless deployment of ML models, Large Language Models (LLMs), and GenAI-based applications.
Key Responsibilities
ii. AI agents and GenAI applications
ii. Hugging Face models
iii. Retrieval-Augmented Generation (RAG) pipelines
ii. Clusters and compute resources
iii. MLflow Model Registry
iv. Job orchestration pipelines
ii. Networking and security configurations
iii. Helm-based deployments
iv. GitOps workflows
ii. Data drift and model drift
iii. System reliability and uptime
ii. Kubernetes (AKS/ARO)
iii. Azure Databricks & MLflow
ii. RAG pipelines and vector databases (FAISS, Pinecone, Chroma, etc.)
ii. CI/CD tools (GitHub Actions preferred)
ii. Distributed systems and cloud-native architecture
Good To Have Skills
Location : Bangalore (Work from Office / Hybrid)
Experience : 5 to 8 Years
Employment Type : Full-Time
About The Role
We are looking for a highly skilled Senior Ops/MLOps Engineer to drive the deployment, scalability, and operational excellence of GenAI, LLM, and Machine Learning workloads. This role requires deep expertise in Azure cloud ecosystem, Kubernetes platforms, and modern LLMOps practices.
You will play a critical role in building reliable, scalable, and production-grade AI platforms, enabling seamless deployment of ML models, Large Language Models (LLMs), and GenAI-based applications.
Key Responsibilities
- CI/CD & Automation
- Design, implement, and maintain CI/CD/CT pipelines for ML models, LLMs, and GenAI workloads
- Automate end-to-end model lifecycle including build, test, deployment, and monitoring
- Implement GitOps-based deployment strategies using modern tooling
- Model Deployment & Platform Engineering
- Deploy and operationalize :
ii. AI agents and GenAI applications
- Work with platforms such as Azure Databricks, MLflow, AKS, and ARO
- Enable scalable and highly available model serving infrastructure
- GenAI & LLM Ecosystem Integration
- Integrate and manage GenAI services including:
ii. Hugging Face models
iii. Retrieval-Augmented Generation (RAG) pipelines
- Work with vector databases such as FAISS, Pinecone, Chroma, etc.
- Support development and deployment of both custom-built and pre-trained AI models/agents
- Databricks & ML Platform Management
- Manage and optimize:
ii. Clusters and compute resources
iii. MLflow Model Registry
iv. Job orchestration pipelines
- Ensure efficient utilization and performance tuning
- Kubernetes & Cloud Infrastructure
- Own end-to-end lifecycle management of AKS / ARO clusters
- Handle:
ii. Networking and security configurations
iii. Helm-based deployments
iv. GitOps workflows
- Ensure platform reliability and fault tolerance
- Observability & Reliability Engineering
- Implement robust monitoring and observability for AI/ML systems:
ii. Data drift and model drift
iii. System reliability and uptime
- Establish alerting and incident response mechanisms
- Security, Governance & Cost Optimization
- Enforce cloud security best practices, IAM, and compliance policies
- Implement governance frameworks for ML and AI workloads
- Optimize infrastructure and cloud cost usage
- Strong hands-on experience with :
ii. Kubernetes (AKS/ARO)
iii. Azure Databricks & MLflow
- Experience with :
ii. RAG pipelines and vector databases (FAISS, Pinecone, Chroma, etc.)
- Proficiency in :
ii. CI/CD tools (GitHub Actions preferred)
- Solid understanding of :
ii. Distributed systems and cloud-native architecture
Good To Have Skills
- Experience with GenAI frameworks (LangChain, LlamaIndex, etc.)
- Exposure to Helm, GitOps tools (ArgoCD / Flux)
- Familiarity with containerization (Docker)
- Knowledge of model evaluation, prompt engineering, and fine-tuning
- Strong problem-solving and analytical mindset
- Ability to work in a fast-paced, innovation-driven environment
- Experience working with cross-functional teams (Data Science, Engineering, DevOps)
- Ownership mindset with focus on scalability and reliability
- Work on cutting-edge GenAI & LLM technologies
- Opportunity to build scalable AI platforms from the ground up
- Collaborative and growth-focused environment
Required Skills
Quick Tip
Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.
Related Similar Jobs
View All
Associate Creative Director
Saatchi & Saatchi India
4–8 years
Solution Selling
Key Account Management
Upselling
+1
Internal Auditor
SDSK & Associates - Chartered Accountants
Mumbai
Full-Time
1–2 years
Adobe Illustrator
Key Account Management
CRM
+2
Automation Tester / QA Engineer - Bangalore - GO/JC/2877/2026
Golden Opportunities
Delhi
Full-Time
4–8 years
Sales Operations
Sales automation
Key Account Management
+2
Technical Lead
gentle.money
4–8 years
Salesforce Service Cloud
Process Improvement
AHT
+3
Manager - HR Operations
JSW Steel
Mumbai
Full-Time
4–8 years
Cold Calling
Pipeline Management
B2B Sales
+3
Share
Quick Apply
Upload your resume to apply for this position