Rubrik empowers the world to stay in motion by organizing and protecting our customers' information. We’re a team of change makers, defining the future of cybersecurity. Our culture of empowerment encourages everyone to share their ideas, run with them, and leave their mark on what we’re building. United by our purpose to secure the world’s data, we continue to reach new heights. And the best part? We’re just getting started! Everyone here has the opportunity to grow by leaps and bounds, and there is unlimited potential in front of us. Let’s pioneer the next frontier in cybersecurity by securing the world’s data, together.
As an InfoSec SRE Intern focused on monitoring and alerting on our security infrastructure systems. You will work closely with our security engineering and operations teams to develop and maintain observability solutions. You will gain practical experience configuring cloud-native monitoring tools, ingesting metrics into Prometheus, and creating insightful dashboards and alerts in Grafana.
This internship is an excellent opportunity for students or recent graduates interested in Site Reliability Engineering, DevOps, or cloud infrastructure.
Key Responsibilities
Assist in implementing and maintaining monitoring and alerting systems using native cloud monitoring services (e.g., AWS CloudWatch, Azure Monitor, Google Cloud Operations).
Respond and remediate operational issues impacting performance or availability.
Configure Prometheus to scrape and store metrics from cloud resources.
Develop dashboards and alerts in Grafana to enable proactive incident detection.
Collaborate with teams to identify key reliability and performance indicators.
Support incident response by tuning alert thresholds and helping diagnose alerts.
Participate in documenting monitoring procedures and best practices.
Learn and apply SRE principles focused on reliability, scalability, and automation.
Qualifications
Currently pursuing or recently completed a degree in Computer Science, Engineering, or related field.
Basic understanding of cloud computing concepts (AWS, Azure, or GCP).
Familiarity with monitoring and observability tools like Prometheus and Grafana is a plus.
Knowledge of scripting or programming languages (e.g., Python, Bash) is desirable.
Strong problem-solving skills and eagerness to learn.
Good communication skills and ability to work collaboratively.
What We Offer
Mentorship from experienced SRE professionals.
Hands-on exposure to cutting edge technology.
Opportunity to contribute to meaningful projects with a direct impact on system reliability.
Flexible work arrangements.
Potential for full-time opportunities after successful completion.