Bestkaam Logo
AMD Logo

ML GPU Kernel Development Engineer

Hyderabad, Telangana, India

3 days ago

Applicants: 0

Salary Not Disclosed

3 weeks left to apply

Job Description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences?from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you?ll discover the real differentiator is our culture. We push the limits of innovation to solve the world?s most important challenges?striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. ML GPU Kernel Development Engineer The Role We are seeking a talented Machine Learning Kernel Developer to design, develop, and optimize low-level machine learning kernels for AMD GPUs using the ROCm software stack. In this role, you will work on high-impact projects to accelerate AI frameworks and libraries, with a focus on emerging technologies like Large Language Models (LLMs) and other generative AI workloads. The Person The ideal candidate will have hands-on experience with GPU programming (ROCm or CUDA) and a passion for pushing the boundaries of AI performance. Key Responsibilities Design and implement highly optimized ML kernels (e.g., matrix operations, attention mechanisms) for AMD GPUs using ROCm. Profile, debug, and tune kernel performance to maximize hardware utilization for AI workloads. Collaborate with ML researchers and framework developers to integrate kernels into AI frameworks (e.g., PyTorch, TensorFlow) and inference engines (e.g., vLLM, SGLang). Contribute to the ROCm software stack by identifying and resolving bottlenecks in libraries like MIOpen, BLAS, or Composable Kernel. Stay updated on the latest AI/ML trends (LLMs, quantization, distributed inference) and apply them to kernel development. Document and communicate technical designs, benchmarks, and best practices. Troubleshoot and resolve issues related to GPU compatibility, performance, and scalability. Required Experience 2+ years of experience in GPU kernel development for machine learning (ROCm or CUDA). Proficiency in C/C++ and Python, with experience in performance-critical programming. Strong understanding of ML frameworks (PyTorch, TensorFlow) and GPU-accelerated libraries. Basic knowledge of modern AI technologies (LLMs, transformers, inference optimization). Familiarity with parallel computing, memory optimization, and hardware architectures. Problem-solving skills and ability to work in a fast-paced environment. Preferred Experience Direct experience with AMD ROCm development (HIP, MIOpen, Composable Kernel). Knowledge of LLM-specific optimizations (e.g., FlashAttention, PagedAttention in vLLM). Experience with distributed training/inference or model compression techniques. Contributions to open-source ML projects or GPU compute libraries. Academic Credentials Bachelor?s/Master?s in Computer Science, Electrical Engineering, or related field. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants? needs under the respective laws throughout all stages of the recruitment and selection process.

Additional Information

Company Name
AMD
Industry
N/A
Department
N/A
Role Category
Machine Learning Engineer
Job Role
Mid-Senior level
Education
No Restriction
Job Types
On-site
Gender
No Restriction
Notice Period
Less Than 30 Days
Year of Experience
1 - Any Yrs
Job Posted On
3 days ago
Application Ends
3 weeks left to apply

Similar Jobs

Numerize AI

3 weeks ago

Full Stack Developer ? AI & Agent Systems

Numerize AI

Uplers

3 weeks ago

Python Backend Engineer

Uplers

Turing

3 weeks ago

Software Engineer (Full Stack) - 17853

Turing

UST

3 weeks ago

Associate III - Data Science (3-5 Years)

UST

Talent Worx

2 months ago

DevOps Engineer

Talent Worx

Maven, Linux, SQL +1
SGC IT Solutions Private Limited

3 weeks ago

Full Stack Engineer

SGC IT Solutions Private Limited

C, HTML, CSS
Tesco Bengaluru

2 months ago

Senior Associate - Business Intelligence & Data Warehousing

Tesco Bengaluru

IBM

3 days ago

Application Developer-AWS Cloud FullStack

IBM

Accenture in India

3 weeks ago

S&C Global Network - AI - CDI - Data Science Analyst

Accenture in India

Polestar Analytics

3 days ago

Sr. Python Developer

Polestar Analytics