ZebPay is India’s oldest and most-loved Bitcoin and crypto asset company, helping ⅔ of India’s crypto owners buy their first Bitcoin since 2014. With new owners and leaders in 2020, we’re expanding our mission to help millions of people in India and around the world to join the Bitcoin revolution through creating best in class technology and user experiences.
A career at ZebPay is all about being part of our Ohana (Hawaiian for family!) and working on some of the most challenging, yet fun projects you can find in the software industry. You would be welcomed into a dedicated and inclusive environment where you can learn and collaborate with some of the most talented people in the tech industry.
With the rapid growth of blockchain globally and other long-term initiatives, the successful candidate will be working with bleeding-edge technology in an internationally established team, while having great attention to detail, being a strong team player, and having excellent communication skills.
What you will do:
- You will be in charge of proactively building and implementing services to make IT and support better at their jobs and fixing the Support Escalation issues.
- Site reliability engineer will take on-call responsibilities and will be responsible for adding automation and context to alerts – leading to better real-time collaborative response from on-call responders. Additionally, you will update run-books, tools and documentation to help prepare on-call teams for future incidents.
- Work experience with AWS or more cloud service providers
- Experience in building systems for monitoring, logging, alerting
- Experience in leading a team of at least 4 people and running the SRE teams
- Expertise in setting up monitoring and alerting for AWS services like (EC2, Redis, Open Search, DynamoDB, API Gateway, RDS, Lambda, CloudFront, CloudFormation, VPC, containers, EKS, CloudWatch, Route 53, etc.)
- Expertise in setting up logging , monitoring and alerting for the containerization platform
- Expertise in ELK stack is plus
- In depth understanding of the micro services , containerization , highly scalable and reliable platforms
- Extensive experience in debugging error codes like 502,503 etc and applications like nginx, LB, firewalls, WAF etc
- Ability to understand logs of applications and generating different metrics to understand
- Work at a company that stays ahead of the curve and encourages the use of cutting-edge technology.
Get to learn more about Blockchain which is a Hot in-demand skill.
- Constant Learning Curve
- Flexible Timings
- You can be as creative as you can
- You are treated as one extended Family
- Learning and Development Policy