At 98point6, our mission is bringing technology and medicine together to reinvent primary healthcare. You’ll collaborate with multiple teams, work across disciplines, encountering and conquering many challenges that are new and exciting. Each day you’ll be working towards profoundly transforming primary care.
Your Role and Impact
As a Site Reliability Engineer (SRE) at 98point6 you will design, implement, and operate systems and software that deals with federally protected health information. You will play a central role in supporting and advising other engineering teams (e.g. backend, mobile, web) to build and operate their services and applications incorporating principles of operational excellence. As we grow, you will optimize existing services, expand our AWS infrastructure with serverless components, and operate highly available and reliable systems.
Participate in the design, architecture and implementation of the systems, software, networks and services required to operate 98point6 products
Identify and develop processes, tools, automation, and software changes to address top operational issues
Lead development of solutions to complex operational or reliability challenges
Work in close collaboration with software development leadership and support operations technical leads to shape the future roadmap and establish strong operational readiness across teams
Identify and exploit efficiencies in operational engineering through automation
Design the systems and processes that engineers use to manage and deploy their software into production
Practice sustainable incident response, root cause analysis and blameless postmortems
Improve the cost efficiency of our operational infrastructure
Bachelor's or master’s degree in computer science, math or equivalent experience
3+ years experience designing and coding in Java, C#, Python or similar language for production services that operate on public internet
Familiarity with Linux fundamentals and network protocols such as TCP, HTTP and DNS
Experience using AWS (API Gateway, Lambda, DynamoDB, WAF, CloudFront, RDS, VPC) or similar solutions to build infrastructure
Experience running mission-critical, 24/7 systems
Experience with incident response and root-cause analysis
Excellent written and oral communication skills, including an ability to communicate with technical and business teams
98point6 provides equal employment opportunities to all without regard to race, color, religion, sex (including sexual orientation or gender identity), national origin, age, disability, genetic information or other protected status.