Position: Site Reliability Engineer – Python & DevOps (NV512 RM 2566)
Strong expertise in Python Development and DevOps tools is mandatory.
Python Development- 3+ years of strong experience with OOP concepts.
Kubernetes, Jenkins and DB query creation experience should also be strong (more than 3+ years)
Overall, 5+ years of resources with these skills will be most suitable to their requirements.
Job Description:
Site Reliability Engineers are critical team members with a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden, and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and auto-remediation. The SRE should have an “automate everything” mindset, helping us bring value to our customers by deploying services with incredible speed, consistency, and availability. The SRE constantly evaluates products and services before and after production releases to prevent, identify, and fix problems that impact service availability in deploying, configuring, releasing, monitoring, recovering, and scaling.
Responsibilities:
- Ensure the scalability, performance, and resilience of our suite of products
- Work with the development and product team to establish the right monitoring and alerting strategy
- Develop build, test, and deployment automation that seamlessly targets multiple cloud regions
- Define and implement standards and best practices related to, system architecture, service delivery, metrics, and the automation of operational tasks
- Optimize the telemetry platform to identify customer-impacting events while providing relevant data to drive debugging
- Partner with the engineering team to optimize the performance of services for cloud architecture
- Debug Live Site events and conduct follow-up post-mortem and RCA analysis
Required Skills and Qualifications:
- Understanding of basic networking concepts
- B.E/B. Tech in Computer Science or equivalent
- 5 to 12 years of relevant experience
- Scripting languages like Bash, Python, etc.
- Exposure to operational knowledge of managing applications in AWS/GCP
- Experienced in automating software build, deployment, and server configuration management using tools such as Puppet, Chef, and Jenkins
- Hands-on experience with Linux/Unix Administration
- Good understanding of containerization concepts
- Docker, ECS, EKS, Kubernetes
- Experience with building tools such as Jenkins
- Working experience with NoSQL databases such as MongoDB, PostgreSQL, etc.
**************************************************************************************************************************************
Apply for this position
Mention correct information below. Mention skills aligned with the job description you are applying for. This would help us process your application seamlessly.