Bestkaam Logo
Orion Innovation Logo

Performance Tester – GenAI

Actively Reviewing the Applications

Orion Innovation

Chennai On-site
Posted 5 days ago Apply by June 17, 2026

Job Description

Orion Innovation is a premier, award-winning, global business and technology services firm. Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity. We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.


Role: Performance Test Engineer – Generative AI

Experience: 5+ years (with hands-on performance testing in GenAI / LLM-based applications)

Role Overview:

We are seeking a skilled and detail-oriented Performance Tester with strong experience in Generative AI (GenAI) projects. The ideal candidate will be responsible for ensuring scalability, reliability, and optimal performance of AI-powered applications, including Large Language Model (LLM) integrations, conversational AI systems, and Retrieval-Augmented Generation (RAG) pipelines. This role requires expertise in performance engineering, cloud platforms, and testing of AI/ML workloads in production environments.

Key Responsibilities

  • Performance Strategy & Planning:
  • Define and implement performance testing strategies for GenAI and LLM-based applications.
  • Identify performance bottlenecks across APIs, model inference layers, vector databases, and cloud infrastructure.
  • Establish performance benchmarks, SLAs, and scalability targets for AI-driven systems.
  • Performance Testing & Engineering:
  • Design, develop, and execute load, stress, spike, endurance, and scalability tests for GenAI applications.
  • Perform performance testing of LLM-powered APIs (e.g., ChatGPT-like applications) hosted on cloud platforms.
  • Validate latency, throughput, token usage, concurrency handling, and cost-performance trade-offs.
  • Conduct performance validation for RAG pipelines including embedding generation and vector search.
  • Analyze model inference time, GPU/CPU utilization, memory usage, and autoscaling behavior.
  • Tools & Automation:
  • Develop automated performance test scripts using tools such as JMeter, LoadRunner, k6, or Gatling.
  • Monitor system performance using APM tools like Dynatrace, AppDynamics, Azure Monitor, or AWS CloudWatch.
  • Integrate performance testing into CI/CD pipelines using Azure DevOps or similar platforms.
  • Create dashboards and reports for performance metrics and trend analysis.
  • Cloud & Infrastructure Testing:
  • Conduct performance testing on AI solutions deployed on Azure, AWS, or GCP.
  • Validate autoscaling configurations, containerized deployments (Docker, Kubernetes), and serverless architectures.
  • Assess performance of vector databases such as Chroma, Pinecone, Weaviate, or FAISS under load.
  • Collaboration & Optimization:
  • Collaborate with AI engineers, data scientists, DevOps, and architects to optimize model serving and API performance.
  • Recommend improvements in prompt engineering, caching strategies, batching, and parallelization.
  • Support capacity planning and cost optimization for LLM-based applications.
  • Governance & Reporting:
  • Document performance test results, bottlenecks, and optimization recommendations.
  • Ensure compliance with security and data privacy standards in performance environments.
  • Present findings to stakeholders and provide actionable insights.

Key Requirements

  • Technical Skills:
  • 5+ years of experience in Performance Testing and Engineering.
  • Hands-on experience in performance testing GenAI / LLM-based applications.
  • Experience working with LLM platforms such as OpenAI GPT models, Gemini, Llama 2, Claude, or Grok.
  • Understanding of concepts like tokenization, embeddings, vector search, and RAG architecture.
  • Experience testing AI services hosted on Azure AI Services, Azure ML, AWS Bedrock, or Google Vertex AI.
  • Proficiency in performance testing tools such as JMeter, LoadRunner, k6, or Gatling.
  • Knowledge of API testing tools like Postman or Rest Assured.
  • Familiarity with monitoring tools such as Azure Monitor, AWS CloudWatch, Grafana, or Prometheus.
  • Experience with containerization (Docker) and orchestration (Kubernetes).
  • Basic scripting knowledge in Python or Java for test automation.
  • Understanding of CI/CD pipelines and DevOps practices.
  • GenAI-Specific Knowledge:
  • Experience testing conversational AI applications and chatbot performance.
  • Knowledge of inference latency optimization techniques for LLMs.
  • Understanding of GPU-based workloads and performance considerations.
  • Exposure to agentic frameworks like LangChain, Semantic Kernel, AutoGen, or CrewAI (preferred).
  • Experience validating performance of vector databases (Chroma, Pinecone, Weaviate, FAISS).

Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • 5+ years of experience in performance testing, with at least 2 years in AI/ML or GenAI projects.
  • Experience in testing cloud-native, microservices-based applications.
  • Strong analytical and troubleshooting skills.
  • Excellent communication and stakeholder management skills.

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.

Required Skills

Communication Product Development Engineering Troubleshooting Reporting Recruitment LoadRunner Postman Automation Compliance Monitoring Python Cloud Platforms AWS Stakeholder Management Docker Kubernetes CI/CD Pipelines Prometheus Grafana Azure Azure DevOps LangChain AppDynamics Dynatrace Performance Metrics Chatbot DevOps CI/CD Testing Information Technology Test automation RAG Validation Governance API testing Trend analysis Financial Services Cost optimization Gatling Life Sciences GPT Data Privacy Scripting Orchestration Performance Testing Capacity Planning Performance Engineering Creed Llama Semantic Kernel RAG Architecture Vector Agility Generative Business Transformation Industrial Automation BedRock Semantic Tokenization Pinecone Digital Strategy Conversational Stress CPU Chroma Embedding generation Experience design Kernel Optimization techniques Consumer products Test Scripts Dashboards Embedding GPU Performance Tester APM OpenAI Privacy Jmeter ChatGPT Vertex Testing tools Concurrency Recruitment process Vector Search Large Language Model API Testing Tools Grok Retrieval-Augmented Generation Pregnancy Memory usage Inference Monitoring Tools GenAI Prompt engineering Gemini Batching Retrieval Java LLMs Testing strategies AWS Bedrock Cloud Infrastructure Memory Generative AI Trade Weaviate Telecommunications Test Engineer Azure ML AI/ML AWS Cloudwatch Computer Science LLM FAISS Augmented Generation Conversational AI Orion Retrieval-augmented OpenAI GPT Vector Databases Caching strategies Performance Testing tools Azure Monitor Serverless Vertex AI AutoGen Embeddings Claude GPT Models
Check Qualification

Quick Tip

Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.