3 weeks left to apply
Job Description
AI Researcher, Speech & Audio, Intern - Internship Opportunity at JoshTalks AI Lab (ai.joshtalks.com) Location: Gurgaon, India Type: Full-time Internship (6?12 months) Who: Final-year engineering students or recent graduates passionate about AI/ML in speech About Us At JoshTalks AI Lab, we believe that voice will be the primary medium of interaction between man and machine. Our mission is simple yet ambitious: ? Help machines talk like humans. ? Build the benchmarks and datasets that become the backbone of global progress in speech AI. ? Drive improvements not just through compute or algorithms ? but through high-quality, diverse, real-world data. Our datasets today power some of the largest and most widely used speech models in the world (you?ve de?nitely used them, even if we can?t name them ??). What You?ll Work On This is not a ?just another internship.? You?ll be directly contributing to the global race to perfect speech AI: 1. Benchmarking the world?s speech models ? Design and run evaluations for ASR and speech-to-speech systems. ? Create benchmarks that will guide top AI labs on where their models fail and where they shine. 2. Modeling & Fine-Tuning ? Fine-tune speech recognition systems (like Whisper/wav2vec2) to push Word Error Rates toward ~5%. ? Experiment with multilingual, code-switched, and noisy speech to mimic real-world conditions. 3. Impact at Scale ? Your work won?t just sit in a paper. It will in?uence how the world?s largest AI models get built, tested, and improved. Who We?re Looking For ? Final-year undergraduates (B.Tech/B.E.) in CSE, EE, AI/ML, or related ?elds. ? Strong interest in speech, audio, NLP, or multimodal AI. ? Hands-on experience in one or more of: ? Fine-tuning speech or language models (Whisper, wav2vec2, HuBERT, SER, etc.) ? Building speech-driven projects (assistants, classi?ers, chatbots, SER systems) ? Working with PyTorch, TensorFlow, or Hugging Face transformers. ? Bonus: past projects on GitHub, Kaggle, or research papers. Why Join Us ? Ownership: Even as a ?nal-year student, you?ll get the chance to own problems of global importance ? from reducing ASR word error rates toward 5% to building benchmarks that in?uence how the next generation of speech-to-speech models are developed. These are not side projects: the problems you?ll work on may de?ne how billions of people interact with machines in the future. ? Front-row seat in speech AI: Your work will shape benchmarks and datasets used by the world?s top model labs. ? Learning: Work with experts solving speech challenges across 20+ Indian languages and noisy, real-world audio. ? Impactful projects: The benchmarks and models you help build will set direction for global AI progress. ? Startup energy, global scale: Small team, big impact ? perfect for ambitious builders. ? Co-Authorship: If any of the work you contribute to is published as a paper, benchmark report, or dataset release, you will be credited as a co-author. This means your contributions won?t just stay inside the lab ? they?ll be visible to the wider research community and part of the academic and industry record. Details ? Location: Gurgaon (on-site preferred for collaboration) ? Duration: 6?12 months ? Type: Paid Internship (full-time) ? Start Date: Flexible for ?nal-year students (aligns with academic calendar) If you?re someone who dreams of making speech AI as natural as human conversation, this is your chance to work on the real frontier. Super interested? You can also directly write to our founder Shobhit at [email protected] To Apply write to [email protected]
Required Skills
Additional Information
- Company Name
- Josh Talks
- Industry
- N/A
- Department
- N/A
- Role Category
- Robotics Software Engineer
- Job Role
- Internship
- Education
- No Restriction
- Job Types
- On-site
- Gender
- No Restriction
- Notice Period
- Less Than 30 Days
- Year of Experience
- 1 - Any Yrs
- Job Posted On
- 6 days ago
- Application Ends
- 3 weeks left to apply