Site Reliability Engineer

at Truecaller

Bengaluru,India

About Us

Remember the days when the phone rang and you didn’t know who it was? If it was the company you always dreamt of working for? A call from a hospital trying to tell you someone close to you got sick? Or just that stubborn sales guy.

Our mission is to make it possible for you to know who’s trying to contact you, and also tell you when not to pick up. We want to remove all uncertainty, making your communication safe and efficient by separating the important stuff from the noise and create trust, no matter if it’s in the beginning of a call, in the middle of a transaction or at the end of a signature. We are building a platform which empowers our users to take control of their own digital identity and making their communication more safe and efficient. 

We are a Diverse organisation with over 90 of the best minds coming from different backgrounds, joining hands to ensure our vision of building trust everywhere.  

Truecaller is one of the fastest growing tech companies in the world. We have 100 million daily active users around the world with the strongest presence in South Asia, Middle East and North Africa. 

We are backed by some of the most prominent investors in the world such as Sequoia Capital, Atomico, and Kleiner Perkins Caufield & Byers.

Your Mission

The main purpose of Site Reliability Engineering at Truecaller is to design, manage and maintain infrastructure services over the course of their life cycle. responsible for building and improving the performance, reliability, availability, security, and evolution of the
TC production sites.

 Role responsibilities

  • Building tooling to ease the provisioning and scaling of infrastructure resources.
  • Continuously improve and extend infrastructure components to handle growth.
  • Improve overall systems performance and investigate failures taking part of actively in future improvements discussion.
  • Ensure systems availability, reachability, maintainability, and testability buildingthe necessary instrumentation, tooling, and alarming systems in order to escalate abnormalities.
  • Being influential in monitoring and capacity planning together with the system development teams and in alignment with the business goals.  
  • Being one of the thought leaders in the tech team for scalability and reliability of systems, communicating ideas clearly and taking steps for continuous improvements.
  • Take part of the team on-call schedule.
 
Team responsibilities:
  • Gather requirements and formulate acceptance criteria from the business units collaboratively as part of my team.
  • Collaborate with the other appropriate members of my team to provide just enough design to get the feature we’re working on done.
  • Collaborate with the other appropriate members of my team to provide just enough documentation to support the features we are working on.
  • As part of the team, help implement and improve processes for planning, operations and continuous improvements.
  • Attend all team meetings
 
Needed competencies/experience
 
  • Extensive knowledge of programming (Python & Shell) and system administration on Linux environments, preferably working on high throughput and low latency systems.
  • Excellent understanding of distributed system design across process and site boundaries.
  • Hands-on experience with service orchestration and management, deployment activities, configuration management and all necessary automation.
  • Strong grasp of Process Isolation,Virtualization and Containerization concepts and being able to apply them when necessary.
  • Good understanding of software development lifecycle, versioning, building, testing, staging and deployment processes with a strong continuous delivery mindset.
  • Extensive experience with Public Cloud platforms preferably GCP.
  • Extensive experience with Kubernetes, Ansible and Terraform.
  • Strong tendency to keep things simple and maintainable (stick to KISS + YAGNI).

Good to have

  • Experience in troubleshooting & optimizing JVM based applications.
  • Experience in ElasticSearch / Splunk stack.
  • Experienced in research-oriented methodology.

We offer

At Truecaller we have built a dynamic and diverse culture where we are keen to take ownership of what we are doing, learn and develop ourselves and are willing to share their knowledge with others. We love to experiment with new tools and technologies to push the envelope and be able to deliver the best product to our users and we believe that failure is halfway to success. At Truecaller you will find challenges and a team with passion for what we do.

 

Apply for this Job

* Required
  
  
When autocomplete results are available use up and down arrows to review
+ Add Another Education