GenAI DevOps Engineer
2 weeks ago
Applicants: 0
Share
1 week left to apply
Job Description
Job Description Position: GenAI DevOps Engineer About the Role As a GenAI DevOps Engineer on our AI Engineering team, you?ll be responsible for building, deploying, and maintaining the infrastructure and automation that powers our generative AI products. You?ll work closely with software engineers and data scientists to ensure scalable, secure, and reliable operations across cloud environments, with a strong focus on CI/CD, observability, and orchestration. Core Responsibilities CI/CD & Workflow Automation Design and maintain automated CI/CD pipelines for GenAI services using GitHub Actions , Azure DevOps , or Jenkins Implement DevOps best practices including code linting, testing, and validation gates to ensure high-quality and secure deployments Automate build, test, and release processes for GenAI components such as prompt orchestration scripts, vector retrieval services, and APIs Integrate infrastructure updates into CI/CD workflows using Terraform or CloudFormation for version-controlled provisioning Deployment & Runtime Management Containerize GenAI services using Docker and deploy to Kubernetes clusters (AKS/EKS/GKE) for scalable and resilient operations Deploy serverless applications using platforms like Cloud Run , AWS Lambda , or Azure Functions to optimize cost and performance Implement safe rollout strategies (canary, blue/green deployments) to minimize risk during updates Manage runtime environments and ensure high availability and scalability of GenAI endpoints Monitoring & Alerting Instrument services with key metrics (latency, throughput, error rates) and structured logs Build dashboards and alerts using Prometheus , Grafana , Datadog , or cloud-native tools to monitor performance and detect anomalies Support drift detection and regression monitoring for GenAI workflows Cloud Infrastructure & Automation Operate and optimize compute resources on Azure , AWS , or GCP , including managed services like Databricks, SageMaker, or Vertex AI Maintain Infrastructure-as-Code (IaC) for provisioning clusters, storage, and networking components Ensure secure and compliant infrastructure with secrets management, encryption, and access controls GenAI Workflow Orchestration & Retrieval Use orchestration frameworks (e.g., LangGraph , LangChain , Langfuse ) to automate GenAI workflows and prompt chaining Support embedding-based retrieval pipelines, including vector index maintenance and refresh processes Collaboration & Documentation Collaborate with cross-functional teams to integrate GenAI capabilities into production systems Produce clear documentation including runbooks, architecture diagrams, and operational guides for on-call support and incident response Growth Areas & Nice-to-Have Skills Security & Compliance : API authentication (OAuth/JWT), secrets management (Key Vault, AWS KMS), and data encryption Testing & Validation : Data-quality checks, adversarial testing, and automated validation gates Scalability & Cost Optimization : Load testing (Locust, JMeter), spot-instance usage, and caching strategies (Redis) Reliability Engineering : On-call rotations, post-mortems, SLOs, and chaos testing fundamentals Experimentation Lifecycle : Tracking experiments, configuration sweeps, and supporting A/B testing Tooling Flexibility : Familiarity with alternative DevOps frameworks (Kubeflow, TFX) and observability stacks (OpenTelemetry) Required Qualifications 5 years in DevOps roles, with at least 1 year supporting GenAI or AI-driven solutions Hands-on experience with one major cloud provider ( Azure , AWS , or GCP ) Strong proficiency in Docker , Kubernetes , and serverless deployment strategies Proven ability to build and maintain CI/CD pipelines and Infrastructure-as-Code Experience with monitoring and alerting stacks (Prometheus/Grafana, Datadog, or cloud-native) Familiarity with GenAI orchestration tools and vector retrieval concepts Soft Skills Clear communicator who can bridge technical and non-technical teams Strong problem-solver with the ability to troubleshoot across infrastructure, data, and application layers Fast learner who thrives in dynamic, iterative environments Additional Information Our Benefits Flexible working environment Volunteer time off LinkedIn Learning Employee-Assistance-Program (EAP) About NIQ NIQ is the world?s leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most comprehensive consumer insights?delivered with advanced analytics through state-of-the-art platforms?NIQ delivers the Full View?. NIQ is an Advent International portfolio company with operations in 100+ markets, covering more than 90% of the world?s population. For more information, visit NIQ.com Want to keep up with our latest updates? Follow us on: LinkedIn | Instagram | Twitter | Facebook Our commitment to Diversity, Equity, and Inclusion At NIQ, we are steadfast in our commitment to fostering an inclusive workplace that mirrors the rich diversity of the communities and markets we serve. We believe that embracing a wide range of perspectives drives innovation and excellence. All employment decisions at NIQ are made without regard to race, color, religion, sex (including pregnancy, sexual orientation, or gender identity), national origin, age, disability, genetic information, marital status, veteran status, or any other characteristic protected by applicable laws. We invite individuals who share our dedication to inclusivity and equity to join us in making a meaningful impact. To learn more about our ongoing efforts in diversity and inclusion, please visit the https://nielseniq.com/global/en/news-center/diversity-inclusion
Required Skills
Additional Information
- Company Name
- NielsenIQ
- Industry
- N/A
- Department
- N/A
- Role Category
- SRE (Site Reliability Engineer)
- Job Role
- Mid-Senior level
- Education
- No Restriction
- Job Types
- On-site
- Gender
- No Restriction
- Notice Period
- Immediate Joiner
- Year of Experience
- 1 - Any Yrs
- Job Posted On
- 2 weeks ago
- Application Ends
- 1 week left to apply
Similar Jobs
Quick Apply
Upload your resume to apply for this position