Continuously improve Duolingo’s infrastructure through automation and monitoring.
As a Senior Operations Engineer, you will use and develop cutting-edge tools to enable growth and stability across all Duolingo services and platforms. You will also act as an advisor for how to best utilize these tools internally to properly deploy, manage, monitor, and control the costs of production services.
Automate the deployment and management of Duolingo software services
Monitor for performance and security anomalies
Participate in weekly on-call rotations
Streamline operational and development practices
Analyze current costs and plan for future growth
You Have...A Bachelor’s degree in Computer Science or related technical field
5+ years of experience managing large services/infrastructure
Knowledge of Linux system administration and automation
Strong background in computer network security, performance, and capacity planning
Considerable experience with one or more configuration management (Ansible, Puppet, Chef) or infrastructure management tools (Terraform, CloudFormation)
Familiarity with containerization runtimes (Docker, rkt)
Experience deploying and maintaining large-scale monitoring and logging systems (Zabbix, Nagios, ELK, Splunk)
Scripting experience with Python and Bash
Exceptional Candidates Will Have...
Master’s degree in Computer Science or related technical field
Experience managing a large infrastructure on AWS or Google Cloud
Experience implementing continuous integration pipelines in Jenkins
The ability to configure and troubleshoot enterprise-grade routers, switches, and wireless access points
Extensive experience with relational and no-SQL database management
We aim to return to office, and as such are requiring all employees to be fully vaccinated against COVID-19.