We accept applications from remote employees in the following US states: MA, CO, CT, ME, NH, NJ, NY, RI, VT.
edX’s Site Reliability Engineering (SRE) team is foundational to the success of our growth and scale. We have come this far, educating more than 38 million learners on edX.org, enabling them to advance their careers and their lives.
The ideal candidate has a strong SRE profile and is comfortable working with complex systems in a cloud environment.
You will be part of a team based in the US, with additional support from engineers offshore. The team forms a center of operational expertise for the organization, currently leading us in a transformational journey to containerized kubernetes clusters and modern configuration and deployment pipelines. In addition to providing these platform-level services, the team functions as an enabler for product delivery teams - with training, automation, and operational support.
- Build tools to automate and accelerate edX engineering teams’ ability to ship code
- Design and develop systems and processes that enable highly available, scalable systems
- Leverage technology such as Python, Django, AWS, Docker, Kubernetes, Bash, Terraform, MySQL, Redis, elasticsearch, and more to advance our platform
- Champion best practices by collaborating with other teams on a shared vision of success
- Help the Open edX community stand up their own Open edX platforms and make contributions to edX’s open-source codebases
- Rapidly diagnose and resolve faults with services as a member of an on-call rotation focused on investing in actionable alerting and automation to reduce alert fatigue
- Contribute to the company in multiple areas, constantly pushing yourself to be a better engineer and to level up peers within your team and across edX
- 4+ years of professional experience with at least two years responsible for production systems
- Adept skills in at least one programming language, preferably Python
- A working knowledge of Linux both as an end-user and as an administrator
- Experience running web applications in a production environment
- Strong business communication skills for interfacing with technical and non-technical business stakeholders
- Nginx, MySQL, MongoDB, Django, Splunk, Git, and Jenkins knowledge
- A robust understanding of automation tools, continuous integration pipelines, using CI/CD systems and configuration management
Why You’ll Like it Here
- edX is collaborative at its core. You’ll work within your team and across the organization, allowing for continuous learning and discovery.
- We’re on a mission to unlock our learners potential on a global level, seeking to create a more diverse, equitable, and inclusive world.
- We set outcomes that matter and provide value in all that we do, from building meaningful products to serving the edX community.
We understand that applying for a job can be intimidating. Applicants rarely meet every single job requirement, and we know there are many skills and backgrounds that will contribute to success in this role.
That’s why we provide new employees with:
- Employee on-boarding and training sessions
- Personalized 30/60/90+ day plans
- Individual quarterly and annual goals
- Career pathways
And much more! If this role looks like a great next step for you, please apply… even if you can’t “check every box.” We’d love to hear from you!
edX is the education movement for restless learners. Together with our founding partners Harvard and MIT, we’ve brought together more than 38 million learners, the majority of top-ranked universities in the world, and industry-leading companies onto one online learning platform that supports learners at every stage. And we’re not stopping there—as a global nonprofit, we’re relentlessly pursuing our vision of a world where every learner can access education to unlock their potential, without the barriers of cost or location.