S

Backend Intern - Inference Pipelines & Diagnostics

SarvamBengaluruIndia3h ago
onsiteinternshipentry
19 views13 applicants
💼 Competitive Salary

Job Description

We're looking for a Backend Intern to join Sarvam's engineering team and own meaningful workstreams in two critical areas: our inference pipeline infrastructure and diagnostics API services. You'll work on the systems that serve AI models at scale, help build robust APIs for diagnostics and observability, and contribute to the data pipelines that keep everything running reliably. Strong performers will be fast-tracked to a full-time offer at the end of the internship. Preferred background: AI/ML or Computer Science. What You'll Do • Build and optimise backend services for LLM inference pipelines in Python or Node.js • Develop and maintain diagnostics API services for model observability and health monitoring • Integrate LLM APIs and manage request routing, latency, and error handling across inference flows • Design and query SQL and NoSQL databases to support pipeline state management and diagnostics data • Build and maintain data pipelines to support inference workloads and operational metrics • Deploy and manage services on cloud infrastructure (AWS or GCP) using version-controlled codebases on Git • Collaborate with ML engineers and platform teams to debug, profile, and improve system performance What We're Looking For • Proficiency in Python or Node.js; comfortable writing clean, production-quality backend code • Solid understanding of REST API design, including diagnostics and observability endpoints • Familiarity with SQL and at least one NoSQL database (e.g. MongoDB, Redis, or DynamoDB) • Working knowledge of Git for version control and collaborative development • Basic exposure to cloud platforms — AWS or GCP • Interest in LLMs and familiarity with LLM API integration patterns • Background in Computer Science, AI, or Machine Learning preferred Bonus Points • Prior exposure to inference serving frameworks (e.g. vLLM, TGI, Triton, or similar) • Experience with monitoring and observability tooling (e.g. Prometheus, Grafana, or OpenTelemetry) • Familiarity with containerisation and orchestration (Docker, Kubernetes) • Contributions to open-source projects in backend or ML infrastructure

About Sarvam

AI-POWERED

Resume Reviewer

Transform your resume with AI-powered insights and land your dream job

98%
Match Rate
3x
More Interviews
ATS-friendly optimization
Instant feedback & scoring
Industry-specific suggestions
Professional formatting tips
Analyze My Resume
Free to use
Instant results

Trending Jobs

A

Full Stack Developer Intern

Asuraa
Bengaluru
₹25K+
1w ago120 applicants
A

AI Summer Intern

Aryma Labs
Bengaluru
₹40K - ₹60K
1mo ago1 applicants
S

UI/UX Designer

Sunfox Technologies
Deharadun, Uttarakhand
₹40K - ₹70K
5d ago1 applicants

Ready to Start Your Journey?

Join thousands of professionals who found their dream job through our platform.