Bestkaam Logo
Uplers Logo

DevOps Engineer - Platform

Ahmedabad, Gujarat, India

2 days ago

Applicants: 0

Salary Not Disclosed

3 weeks left to apply

Job Description

Experience : 5.00 + years Salary : Confidential (based on experience) Expected Notice Period : 30 Days Shift : (GMT+05:30) Asia/Kolkata (IST) Opportunity Type : Remote Placement Type : Full Time Permanent position(Payroll and Compliance to be managed by: Incept Labs) (*Note: This is a requirement for one of Uplers' client - Incept Labs) What do you need for this opportunity? Must have skills required: and building tools, Proactive about automation, testing, caching strategies, experience building and maintaining distributed systems, Fluent in containerization, Orchestration, Performance Optimization Incept Labs is Looking for: DevOps Engineer - Platform We''re looking for an Engineer experienced in DevOps, preferably with experience filling in as an infrastructure/backend engineering generalist. We view infrastructure as a critical component, as it enables every breakthrough. You''ll work directly with researchers to accelerate experiments, optimize efficiency, and enable key insights across our models and data assets. You''ll join a high-impact, compact team responsible for both architecture and scaling of Incept''s core infrastructure. Work will span the full technical stack and involve solving complex distributed systems problems as well as building resilient, scalable platforms. Responsibilities Design, build, and operate scalable, fault-tolerant infrastructure to support distributed computing and data orchestration for LLM Research. Develop and maintain high-throughput systems for data ingestion, processing, and transformation to support LLM model development. Collaborate with research teams to improve system efficiency and accelerate model training and evaluation cycles. Implement and maintain monitoring and alerting to support platform reliability and performance. Build systems for traceability, reproducibility, and robust quality control to ensure adherence to industry compliance standards. Requirements 5+ years of experience building and maintaining distributed systems, ideally supporting high-scale applications or research platforms. Fluent in containerization, orchestration, and distributed computing frameworks. Extensive experience with performance optimization, caching strategies, and system scalability patterns. Knowledge of and experience with specialized hardware (GPUs, TPUs) computing and GPU cluster management. Strong knowledge of databases, storage systems, and how architecture choices impact performance at scale. Deeply familiar with cloud infrastructure, microservices architectures, and both synchronous and asynchronous processing. Proactive about automation, testing, and building tools that empower engineering teams. How to apply for this opportunity? Step 1: Click On Apply! And Register or Login on our portal. Step 2: Complete the Screening Form & Upload updated Resume Step 3: Increase your chances to get shortlisted & meet the client for the Interview! About Uplers: Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. (Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well). So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

Additional Information

Company Name
Uplers
Industry
N/A
Department
N/A
Role Category
SRE (Site Reliability Engineer)
Job Role
Mid-Senior level
Education
No Restriction
Job Types
Remote
Gender
No Restriction
Notice Period
Less Than 30 Days
Year of Experience
1 - Any Yrs
Job Posted On
2 days ago
Application Ends
3 weeks left to apply

Similar Jobs

McDonald's

1 month ago

Platform Engineer III - Data Platform Support [T500-21291]

McDonald's

Bristol Myers Squibb

1 month ago

Senior Data & DevOps Engineer

Bristol Myers Squibb

Marvell Technology

2 days ago

Validation Engineer (C, Python, Ethernet PHY Testing, Networking L1, L2, L3 (SAI), Switch)

Marvell Technology

Turing

2 days ago

Software Engineer (C++) - 30570

Turing

Turing

2 days ago

Python Developer - 17852

Turing

Harrison.ai

1 month ago

Software Engineer

Harrison.ai

Arm

1 month ago

Staff DevOps Engineer

Arm

Optum

1 month ago

Senior Site Reliability Engineer

Optum

Turing

2 days ago

Software Engineer (C++) - 30570

Turing

Uplers

1 month ago

Microsoft Power Platform Engineer / Automation Consultant

Uplers