Required education
Bachelor's Degree
Preferred education
Master's Degree
Required technical and professional expertise
Excellent coding skills in Java/Python (including Pandas, NumPy), Data Structures, Algorithms, Problem Solving, Linear Algebra, Probability, Statistics, Experience with VS Code, Jupyter notebooks, Git
Cloud platforms and frameworks (AWS/Azure/IBM/Google), Containerization (Docker/Kubernetes/Openshift), Virtualization (VMware, Hyper-V), Networking, Security, Scripting, Monitoring and Logging, AI/ML fundamentals
* Good programming and hands on experience in python
* Familiarity with Cloud technologies (containerization, kubernetes)
* Exposure to different model types like Dense, MoE, Mamba and multimodal models.
* Experience with Pytorch and FSDP
* Exposure to tuning and GPU optimization
* Experience with internals of training stacks
* Exposure to different tuning techniques including SFT, LoRA, RL.
Eligibility Criteria
* B.E. / B.Tech
* M.E. / M.Tech (including Dual Degree programs)
* Ph.D.
* Minimum 70% or 7.0 CGPA and above in the pursuing degree
Time Duration
The internship will be conducted between May 2026 to August 2026, for a maximum duration of 3 months.
Preferred technical and professional experience
* Exposure to Triton and Hugging Face
* Exposure to distributed foundation model training
* Familiarity of GPU architectures, NCCL and compilers / Pytorch Compile