Coursera was launched in 2012 by two Stanford Computer Science professors, Andrew Ng and Daphne Koller, with a mission to provide universal access to world-class learning. It is now one of the largest online learning platforms in the world, with 148 million registered learners as of March 31, 2024.
Coursera partners with over 325 leading university and industry partners to offer a broad catalog of content and credentials, including courses, Specializations, Professional Certificates, Guided Projects, and bachelor’s and master’s degrees. Institutions around the world use Coursera to upskill and reskill their employees, citizens, and students in fields such as data science, technology, and business. Coursera became a B Corp in February 2021.
Join us in our mission to create a world where anyone, anywhere can transform their life through access to education. We're seeking talented individuals who share our passion and drive to revolutionize the way the world learns.
We at Coursera are committed to building a globally diverse team and are thrilled to extend employment opportunities to individuals in any country where we have a legal entity. We require candidates to possess eligible working rights and have a compatible timezone overlap with their team to facilitate seamless collaboration. As a remote-first company, our interviews and onboarding are entirely virtual, providing a smooth and efficient experience for our candidates.
Overview:
As a Staff Software Engineer in our team, you will play a critical role in designing, implementing, and maintaining our highly available, scalable, and fault-tolerant infrastructure on AWS. You will collaborate with various software engineering teams to ensure the reliability, performance, and scalability of our applications and services. This position requires a strong sense of ownership, technical expertise, and the ability to mentor other engineers.
Responsibilities:
- Design, implement, and maintain highly available, scalable, and fault-tolerant infrastructure on AWS using Terraform
- Collaborate with software engineering teams to ensure that applications and services are designed and architected for high availability, reliability, and performance
- Develop and implement monitoring and alerting systems to proactively identify and mitigate potential issues before they impact users
- Automate deployment, testing, and configuration management using tools like Jenkins
- Collaborate with cross-functional teams to design and implement disaster recovery plans and procedures
- Participate in incident response and post-mortem activities to identify root causes and prevent future occurrences
- Mentor and train junior SREs on best practices for infrastructure design, implementation, and maintenance
- Stay up-to-date with emerging technologies and industry trends to drive continuous improvement of our infrastructure and processes
Requirements:
- Bachelor's degree in Computer Science or related field, or equivalent experience
- 8+ years of experience in SRE or DevOps roles, with a focus on AWS
- Strong experience with infrastructure automation tools such as Terraform
- Deep understanding of AWS services such as EC2, ECS, IAM, and VPC, and ability to design and implement highly available, scalable, and fault-tolerant architectures using these services
- Strong understanding of networking concepts and protocols, including TCP/IP, DNS, load balancers, and firewalls
- Proficiency in at least one programming language (Python, Go, Java, etc.).
- Excellent communication and collaboration skills, with a passion for mentoring and training junior team members
- Strong problem-solving and analytical skills, with the ability to work independently and as part of a team
If you are passionate about transforming lives through learning, and you have the skills and experience we're looking for, we'd love to hear from you. Join our team at Coursera and help us build a better world through education.
If this opportunity interests you, you might like these courses on Coursera:
#LI-AG2