Clearwater Analytics® is a global SaaS solution for automated investment data aggregation, reconciliation, accounting, and reporting. Clearwater helps thousands of organizations make the most of investment portfolio data with cloud-native software and client-centric servicing. Every day, investment professionals worldwide trust Clearwater to deliver timely, validated investment data and in-depth reporting. Clearwater aggregates, reconciles, and reports on more than $5 trillion in assets across many Fortune 500 clients.
The Platform Engineering Division is looking for a Sr. Site Reliability Engineer to help us shape the future of our infrastructure and evolve our SRE practices. In this role, you’ll join a team of engineers responsible for the infrastructure our customers use internally and externally to run the business with observability, scalability, and capacity planning as main functions. The team you’ll be on and help grow is a talented and diverse group of engineers focused on providing a highly reliable and performant platform used by Clearwater’s engineering teams to deliver cutting-edge features. This team is responsible for all aspects of the platform including maintenance, upgrades, automation, and new features to support the scale and reliability of the platform.
What you’ll be doing as a Site Reliability Engineer
Designing, automating, maintaining and monitoring our development environment and supporting infrastructure, applications and continuous integration
Designing and supporting robust build deployment and configuration management systems for multi-tier Java applications
Working closely with the System Operations team establishing baseline architecture requirements and best practices, optimization, security, high availability and resource planning
Performing proactive application monitoring, configuring and maintaining a healthy system using industry standard monitoring tools
Utilize 'DevOps' principles and tools to improve systems and processes
Aiding in the troubleshooting efforts at the infrastructure, container and application layers
Logging and monitoring of critical log events
Extensive experience testing, troubleshooting and improving performance (application performance and database performance)
Scripting and Automation experience
Attitude to thrive in a fun, fast-paced start-up like environment
Ability to excel at problem solving, adapt easily to change, contribute while working as a team or individually
Passion for DevOps, systems monitoring, automation and reliability
Experience in software development (Java) is preferred but not required.