SRE - Intern
Zycus is a global leader in Source-to-Pay (S2P) procurement software, helping large enterprises drive efficiency, compliance, and measurable value across their procurement and finance operations. Trusted by leading Fortune 1000 organizations worldwide, Zycus enables procurement teams to move from cost control to strategic value creation.
At the core of Zycus’ platform is Merlin AI, an advanced AI-powered engine that brings intelligence, automation, and predictive insights across the entire procurement lifecycle—from sourcing and contract management to procurement, invoicing, and supplier management. Merlin AI empowers Chief Procurement Officers and finance leaders to make faster, smarter decisions with real-time visibility and actionable insights.
Zycus is consistently recognized by top industry analysts such as Gartner, Forrester, and IDC for its innovation, depth of functionality, and strong customer outcomes. Known for its enterprise-grade solutions, global delivery model, and customer-first mindset, Zycus partners closely with organizations to modernize procurement and unlock long-term business value.
With a strong global presence across North America, EMEA, and APAC, Zycus continues to invest aggressively in product innovation, AI-led capabilities, and brand leadership—shaping the future of intelligent procurement.
We Are An Equal Opportunity Employer:
Zycus is committed to providing equal opportunities in employment and creating an inclusive work environment. We do not discriminate against applicants on the basis of race, color, religion, gender, sexual orientation, national origin, age, disability, or any other legally protected characteristic. All hiring decisions will be based solely on qualifications, skills, and experience relevant to the job requirements.
Job Description
Zycus is looking for an AI-focused Site Reliability Engineer (SRE) Intern who is excited about building, operating, and scaling reliable AI-driven systems. This role is ideal for candidates interested in the intersection of SRE, cloud infrastructure, and GenAI-driven platforms, with hands-on exposure to deploying and monitoring intelligent, Java-based enterprise applications powered by cloud-based LLMs.
Internship Details:
Duration: 6 months
Stipend: INR 25,000 per month
Full-Time Opportunity: High-performing interns will be considered for a full-time SRE role at Zycus
Roles and Responsibilities:
AI System Reliability: Ensure high availability, scalability, and performance of AI-driven, Java-based applications powered by LLM integrations.
GenAI Platform Support: Assist in managing and optimizing applications leveraging cloud-based LLMs and AI services.
Kubernetes & Microservices: Support containerized applications using Kubernetes, ensuring reliability of microservices architecture.
Monitoring & Observability: Track system health, latency, and performance using tools like Prometheus, Grafana, or similar.
Automation & AI-driven Ops: Leverage AI tools to automate repetitive SRE tasks, incident resolution, and operational workflows.
Incident Management: Troubleshoot production issues, identify root causes, and implement preventive measures.
Performance Optimization: Improve system efficiency, resource utilization, and application responsiveness.
Cloud Infrastructure Support: Work with cloud environments (AWS/GCP/Azure) to maintain reliable and scalable systems.
Collaboration: Partner with engineering teams to improve system resilience and deployment processes.
Documentation & Continuous Learning: Maintain system documentation and explore emerging trends in SRE + GenAI operations.
Eligibility & Skills:
Experience:
Final-year students or recent graduates (0–1 year experience)
Must Have Skills:
Programming & Scripting:
Basic proficiency in Java (preferred) and/or Python, along with Shell scriptingAI / GenAI Awareness:
Basic understanding of Generative AI concepts and familiarity with LLM-based applicationsContainers & Kubernetes (Basics):
Understanding of Docker and Kubernetes fundamentalsOperating Systems:
Basic knowledge of Linux/Unix systemsCloud Fundamentals:
Awareness of AWS/GCP/Azure environments
Good to Have Skills:
Exposure to AI-assisted development tools (e.g., GitHub Copilot, ChatGPT) for automation and troubleshooting
Basic understanding of microservices architecture
Familiarity with CI/CD pipelines
Knowledge of Infrastructure-as-Code (Terraform, Ansible)
Experience with monitoring/logging tools
Version control (Git)
What You’ll Gain:
Hands-on experience with GenAI-powered enterprise SaaS platforms
Exposure to LLM-integrated applications in production environments
Real-world experience in SRE + AI-driven operations
Mentorship from experts in cloud, SRE, and AI engineering
Opportunity to transition into a full-time role based on performance
Apply for this Job
Upload Your Resume
Acceptable formats are .docx or .pdf with a maximum file size of 5 MB.