Location: Chennai
Experience: 8–15 years
Compensation: ₹20L – ₹40L / year
Job Source: Cutshort.io
Role Overview
Seeking an experienced Staff Site Reliability Engineer (SRE) to ensure the health, scalability, and automation of its complex web-scale systems. This mission-critical role involves working closely with development teams from design through production, ensuring platforms are reliable, monitored, and optimized for growth.
The ideal candidate thrives in fast-paced environments, has deep knowledge of applications and infrastructure, and believes automation is key to operating large-scale systems.
6-Month Accomplishments
- Familiarize with tech stack and functional requirements.
- Gain comfort with automation tools/frameworks and deployment processes.
- Develop in-depth knowledge of product functionality and infrastructure.
- Contribute to small-to-medium scale projects.
- Participate in on-call rotation as secondary support.
12+ Month Accomplishments
- Execute projects independently with minimal guidance.
- Create meaningful alerts and dashboards for infrastructure monitoring.
- Identify gaps in infrastructure and propose improvements.
- Fully participate in on-call rotations.
Key Responsibilities
- Ensure the health, performance, and capacity of internet-facing services.
- Gain deep knowledge of complex applications.
- Assist in rollouts and deployments of new product features.
- Develop tools for rapid deployment and monitoring in large-scale UNIX environments.
- Collaborate with development teams to design platforms with operability in mind.
- Function effectively in a fast-paced, dynamic environment.
- Participate in 24x7 on-call rotation.
Desired Skills
- 5+ years in Systems Engineering/SRE roles, ideally in startups or fast-growing companies.
- Proven experience in UNIX-based large-scale web operations.
- Hands-on with cloud infrastructure (AWS, GCP, Azure).
- Experience with CI/CD tools (Jenkins), configuration management (Ansible), and monitoring tools (Nagios, New Relic, Graphite).
- Strong scripting/coding skills.
- Ability to leverage a wide variety of open-source technologies.
Technologies
- Languages/Frameworks: Ruby, JavaScript, Node.js, Tomcat, Nginx, HaProxy
- Databases/Messaging: MongoDB, RabbitMQ, Redis, ElasticSearch
- Cloud: AWS (EC2, RDS, CloudFront, S3)
- DevOps Tools: Terraform, Packer, Jenkins, Datadog, Kubernetes, Docker, Ansible
๐ Why Join?
- Be part of a global fashion resale leader.
- Work on large-scale, mission-critical systems.
- Collaborate with high-performing engineering teams.
- Competitive compensation and career growth opportunities.
๐ Apply now

0 comments:
Post a Comment