Senior Observability Engineer (SRE)
Actively Reviewing the ApplicationsCognite
India, Karnataka, Bengaluru
Full-Time
On-site
Posted 4 days ago
•
Apply by June 9, 2026
Job Description
Cognite operates at the forefront of industrial digitalization, building AI and data solutions that solve some of the world’s hardest, highest-impact problems. With unmatched industrial heritage and a comprehensive suite of AI capabilities, including low-code AI agents, Cognite accelerates the digital transformation to drive operational improvements.
Our moonshot is bold: unlock $100B in customer value by 2035 and redefine how global industry works.
What Cognite is Relentless to achieve
We thrive in challenges. We challenge assumptions. We execute with speed and ownership. If you view obstacles as signals to step forward - not step back - you’ll feel at home here. Join us in this venture where AI and data meet ingenuity, and together, we forge the path to a smarter, more connected industrial future.
About The Role
We are seeking a Senior Observability Engineer with strong expertise in designing, implementing, and optimizing observability solutions. In this role, you will be key to shaping the future of observability at Cognite, assessing existing observability frameworks, identifying gaps, and building robust capabilities encompassing log aggregation, event correlation, noise reduction, and comprehensive telemetry analysis to enable proactive operational excellence and reliability for our services.
How you’ll demonstrate Ownership
Cognite is committed to creating a diverse and inclusive environment at work and is proud to be an equal opportunity employer. All qualified applicants will receive the same level of consideration for employment.
Our moonshot is bold: unlock $100B in customer value by 2035 and redefine how global industry works.
What Cognite is Relentless to achieve
We thrive in challenges. We challenge assumptions. We execute with speed and ownership. If you view obstacles as signals to step forward - not step back - you’ll feel at home here. Join us in this venture where AI and data meet ingenuity, and together, we forge the path to a smarter, more connected industrial future.
About The Role
We are seeking a Senior Observability Engineer with strong expertise in designing, implementing, and optimizing observability solutions. In this role, you will be key to shaping the future of observability at Cognite, assessing existing observability frameworks, identifying gaps, and building robust capabilities encompassing log aggregation, event correlation, noise reduction, and comprehensive telemetry analysis to enable proactive operational excellence and reliability for our services.
How you’ll demonstrate Ownership
- Conduct assessments of existing observability architectures to identify gaps and improvement opportunities.
- Design and implement scalable log aggregation pipelines for centralized and efficient data collection.
- Apply noise-reduction techniques to filter irrelevant or false-positive alerts, enhancing focus on actionable issues.
- Develop and maintain monitoring dashboards that deliver actionable insights across applications and infrastructure.
- Lead the migration from Lightstep to Honeycomb, ensuring seamless data pipeline transitions, OpenTelemetry alignment, and stakeholder adoption.
- Collaborate with infrastructure and product teams to integrate observability tooling into CI/CD workflows and cloud environments.
- Analyze telemetry data (metrics, logs, traces) to troubleshoot complex system behaviors and recommend improvements.
- Participate in production debugging and incident troubleshooting using telemetry data
- Mentor junior engineers on log management, event correlation, distributed tracing, and alert management.
- Stay current on observability innovations and recommend adoption strategies aligned with organizational goals.
- Support post-incident reviews and continuous improvement through data-driven root cause analysis.
- Drive continuous improvement in reliability and operational excellence through proactive observability initiatives.
- 6+ years of experience in software or systems engineering, with at least 3 years focused on observability or SRE practices.
- Hands-on experience with observability tools such as Honeycomb, VictoriaMetrics, Lightstep, Prometheus, Grafana, OpenTelemetry, Splunk, Datadog, or New Relic.
- Strong knowledge of OpenTelemetry instrumentation (metrics, traces, logs) and SLIs/SLOs for reliability tracking.
- Experience with distributed tracing, event correlation, and noise reduction frameworks.
- Proficiency in one or more programming/scripting languages such as Python, Java, Kotlin, Go, or Shell.
- Working knowledge of Infrastructure as Code (Terraform) and CI/CD (Jenkins, Github Actions,...) pipelines.
- Familiarity with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes).
- Strong analytical, troubleshooting, and communication skills with the ability to work effectively across teams.
- Experience conducting observability gap assessments and defining improvement plans.
- Experience working in complex or multi-cloud environments is preferred.
Cognite is committed to creating a diverse and inclusive environment at work and is proud to be an equal opportunity employer. All qualified applicants will receive the same level of consideration for employment.
Required Skills
Communication
Engineering
Data Collection
Troubleshooting
Monitoring
Python
Root Cause Analysis
Cloud Platforms
AWS
Kotlin
Jenkins
Kubernetes
Terraform
GitHub
Prometheus
Grafana
Splunk
Azure
GitHub Actions
Datadog
New Relic
CI/CD
Continuous Improvement
Debugging
Scripting languages
Scripting
Instrumentation
Systems Engineering
Orchestration
Correlation
Relic
SLOs
Migration
Dashboards
Honeycomb
Cloud environments
Log management
Noise
Telemetry
SRE
Aggregation
Java
Infrastructure as Code
Shell
Observability
OpenTelemetry
Container
Incident
TRACES
Data Pipeline
Gap Assessments
Container orchestration
Quick Tip
Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.
Related Similar Jobs
View All
Supply Analyst
Aarki
India
Full-Time
Engineering
Data Analysis
Data Science
+1
Backend Support Engineer (Node.js & MongoDB)
Uplers
India
Full-Time
₹2–5 LPA
MongoDB
GraphQL
Analytics
+2
Business Development Intern
Farsight Technologies
India
Full-Time
Communication
Customer Service
Lead Generation
+16
DEVOPS MODULE LEAD - LEAD
Happiest Minds Technologies
Bengaluru
Full-Time
Azure
Python
PowerShell scripting
+1
Java Spark Developer
Citi
India
Full-Time
Spark
Debugging
Java
Share
Quick Apply
Upload your resume to apply for this position