About Workato

Workato is the only integration and automation platform that is as simple as it is powerful — and because it’s built to power the largest enterprises, it is quite powerful. 

Simultaneously, it’s a low-code/no-code platform. This empowers any user (dev/non-dev) to painlessly automate workflows across any apps and databases.

We’re proud to be named a leader by both Forrester and Gartner and trusted by 7,000+ of the world's top brands such as Box, Grab, Slack, and more. But what is most exciting is that this is only the beginning. 

Why join us?

Ultimately, Workato believes in fostering a flexible, trust-oriented culture that empowers everyone to take full ownership of their roles. We are driven by innovation and looking for team players who want to actively build our company. 

But, we also believe in balancing productivity with self-care. That’s why we offer all of our employees a vibrant and dynamic work environment along with a multitude of benefits they can enjoy inside and outside of their work lives. 

If this sounds right up your alley, please submit an application. We look forward to getting to know you!

Also, feel free to check out why:

  • Business Insider named us an “enterprise startup to bet your career on”

  • Forbes’ Cloud 100 recognized us as one of the top 100 private cloud companies in the world

  • Deloitte Tech Fast 500 ranked us as the 17th fastest growing tech company in the Bay Area, and 96th in North America

  • Quartz ranked us the #1 best company for remote workers

All full-time employees in Singapore will also have the following benefits:

  • Workato Stock Options at one of Silicon Valley’s fastest growing startups

  • Flexible and personalised medical and wellness benefits (protection for hospitalisation and surgical procedures, clinical outpatient visits, accident coverage and more...) 

  • Up to 20 weeks of paid maternity leave, and 10 weeks of paid paternity leave

Responsibilities

At Workato, we run the world’s mission critical business processes. Our customers connect with 1000s of APIs that make for a complex operational environment. Add four 9 uptime, and the fact that when any of the APIs that we connect to doesn’t work, we need to be resilient and not lose customer data. If you are looking for a real challenge in terms of mission criticality, diversity of managed services, with cutting edge technology like. 

As our business scales, we are currently seeking an experienced SRE Leader to take ownership and commit to providing the highest levels of uptime for our customers. We are searching for an unconventional thinker, who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

You will also be responsible to:

  • Run the production environment to provide the highest levels of uptime, performance, and reliability

  • Work with development teams to make sure the applications are production-ready, scalable, reliable, and observable from the ground up

  • Identify and drive opportunities to improve automation for code deployment, management, and visibility of application services

  • Develop tools and framework to automate operational tasks, deployment of machines, services, applications

  • Establish end-to-end monitoring and alerting on all critical components of the application

  • Participate in the on-call rotation supporting the platform and or the production application

  • Manage end-to-end availability, performance of critical services, and build automation to prevent problem recurrence.

  • Independently troubleshoot complex systems and environments including applications, microservices, networking components. Directs root cause analysis of critical business and production issues

  • Recruit, develop, and mentor other SREs on standard methodology from Infra orchestration and troubleshooting application service in production

  • Represent SRE in design reviews and work cross-functionally with Engineering teams on operational readiness

  • Work with engineering teams to better address needs and enable more effective and efficient developer throughput.

  • Participate in roadmap and sprint planning, execution, and retrospectives.

Daily and Monthly Responsibilities

  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding

  • Partner with development teams to improve services through rigorous testing and release procedures

  • Participate in system design consulting, platform management, and capacity planning

  • Create sustainable systems and services through automation and uplifts

  • Balance feature development speed and reliability with well-defined service level objectives

Requirements

Qualifications / Experience / Technical Skills

  • 5+ years experience in an SRE Leadership role

  • Bachelor’s degree in computer science or other highly technical, scientific discipline.

  • Experience administering Kubernetes based microservices, web servers (nginx), and databases (Postgres, MySql, MongoDB; Desirable - Redis, Clickhouse)

  • Strong experience with AWS technologies such as EKS, ELB, RDS, S3/EBS/Glacier, and VPC

  • Experience architecting high scalability/availability systems and running them at scale within AWS

  • Experience with Infrastructure as Code tools (e.g. Terraform, CloudFormation)

  • Ability to program (structured and OO) with one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript.

General Requirements

  • BS or MS from top-notch CS programs 

  • 5+ years of professional experience in hands-on engineering roles 

  • 2+ years operating high traffic production environments in public clouds: AWS, GCP, Azure 

  • Python programming experience in production environments 

  • Experience with modern cloud environments: containerization, infrastructure-as-code, DevOps, CI/CD pipelines, and general automation 

  • Hands-on experience with network security, databases systems

Preferred Requirements

  • Operating Kubernetes clusters 

  • Experience with cloud and infrastructure security regulations & compliance: SOC2, ISO27001, HIPAA, GDPR, CCPA 

  • Experience with ML Ops: Spark, TensorFlow, GPUs 

  • Experience with Terraform

Apply for this Job

* Required