SRE - Intern

J7TL7BGP5
Experience: 0-1 YearsLocation: Mumbai (Hybrid)Department: Global DeliveryEmployment Type: Full Time

Zycus is a global leader in Source-to-Pay (S2P) procurement software, helping large enterprises drive efficiency, compliance, and measurable value across their procurement and finance operations. Trusted by leading Fortune 1000 organizations worldwide, Zycus enables procurement teams to move from cost control to strategic value creation.

At the core of Zycus’ platform is Merlin AI, an advanced AI-powered engine that brings intelligence, automation, and predictive insights across the entire procurement lifecycle—from sourcing and contract management to procurement, invoicing, and supplier management. Merlin AI empowers Chief Procurement Officers and finance leaders to make faster, smarter decisions with real-time visibility and actionable insights.

Zycus is consistently recognized by top industry analysts such as Gartner, Forrester, and IDC for its innovation, depth of functionality, and strong customer outcomes. Known for its enterprise-grade solutions, global delivery model, and customer-first mindset, Zycus partners closely with organizations to modernize procurement and unlock long-term business value.

With a strong global presence across North America, EMEA, and APAC, Zycus continues to invest aggressively in product innovation, AI-led capabilities, and brand leadership—shaping the future of intelligent procurement.

We Are An Equal Opportunity Employer:
Zycus is committed to providing equal opportunities in employment and creating an inclusive work environment. We do not discriminate against applicants on the basis of race, color, religion, gender, sexual orientation, national origin, age, disability, or any other legally protected characteristic. All hiring decisions will be based solely on qualifications, skills, and experience relevant to the job requirements.

Job Description

Zycus is looking for an AI-focused Site Reliability Engineer (SRE) Intern who is excited about building, operating, and scaling reliable AI-driven systems. This role is ideal for candidates interested in the intersection of SRE, cloud infrastructure, and GenAI-driven platforms, with hands-on exposure to deploying and monitoring intelligent, Java-based enterprise applications powered by cloud-based LLMs.


Internship Details:

  • Duration: 6 months

  • Stipend: INR 25,000 per month

  • Full-Time Opportunity: High-performing interns will be considered for a full-time SRE role at Zycus


Roles and Responsibilities:

  • AI System Reliability: Ensure high availability, scalability, and performance of AI-driven, Java-based applications powered by LLM integrations.

  • GenAI Platform Support: Assist in managing and optimizing applications leveraging cloud-based LLMs and AI services.

  • Kubernetes & Microservices: Support containerized applications using Kubernetes, ensuring reliability of microservices architecture.

  • Monitoring & Observability: Track system health, latency, and performance using tools like Prometheus, Grafana, or similar.

  • Automation & AI-driven Ops: Leverage AI tools to automate repetitive SRE tasks, incident resolution, and operational workflows.

  • Incident Management: Troubleshoot production issues, identify root causes, and implement preventive measures.

  • Performance Optimization: Improve system efficiency, resource utilization, and application responsiveness.

  • Cloud Infrastructure Support: Work with cloud environments (AWS/GCP/Azure) to maintain reliable and scalable systems.

  • Collaboration: Partner with engineering teams to improve system resilience and deployment processes.

  • Documentation & Continuous Learning: Maintain system documentation and explore emerging trends in SRE + GenAI operations.


Eligibility & Skills:

Experience:

  • Final-year students or recent graduates (0–1 year experience)

Must Have Skills:

  • Programming & Scripting:
    Basic proficiency in Java (preferred) and/or Python, along with Shell scripting

  • AI / GenAI Awareness:
    Basic understanding of Generative AI concepts and familiarity with LLM-based applications

  • Containers & Kubernetes (Basics):
    Understanding of Docker and Kubernetes fundamentals

  • Operating Systems:
    Basic knowledge of Linux/Unix systems

  • Cloud Fundamentals:
    Awareness of AWS/GCP/Azure environments


Good to Have Skills:

  • Exposure to AI-assisted development tools (e.g., GitHub Copilot, ChatGPT) for automation and troubleshooting

  • Basic understanding of microservices architecture

  • Familiarity with CI/CD pipelines

  • Knowledge of Infrastructure-as-Code (Terraform, Ansible)

  • Experience with monitoring/logging tools

  • Version control (Git)


What You’ll Gain:

  • Hands-on experience with GenAI-powered enterprise SaaS platforms

  • Exposure to LLM-integrated applications in production environments

  • Real-world experience in SRE + AI-driven operations

  • Mentorship from experts in cloud, SRE, and AI engineering

  • Opportunity to transition into a full-time role based on performance

Apply for this Job

Personal InformationPersonal Information
Pre-screening QuestionsPre-screening Questions

Upload Your Resume

Acceptable formats are .docx or .pdf with a maximum file size of 5 MB.

Upload
Drag and drop your resume here
or click to browse files