The team at SIDEARM provides the technology platform that powers the official websites, mobile apps, statistical integration, live audio and video streaming, and e-commerce platforms of more than 1,600 collegiate athletic partners across the nation.

We’re a passionate mix of technologists, creatives, and strategists that care deeply about the products we create, and the people we create them for. We stand at the intersection of sports and technology, and are constantly innovating and evolving our products to deeply integrate the industries and provide exceptional products for our partners and their fans. Our team embraces a casual and collaborative work environment that moves at a rapid pace, where all team members are deeply involved in the success of our products and services.

We're fortunate enough to be trusted by some of the biggest brands in the industry, including 300 NCAA Division I programs, and over 55 of the 65 Power 5 athletic departments. We are proud that the work we do is experienced by millions of sports fans each year.

As we continue to grow, we need an outstanding Senior Site Reliability Engineer to travel on our journey together. Our SRE team influences the technology growth that impacts millions of customers across the entertainment space. Opportunities abound to grow our products - and grow your career!

What you’ll do

Work in cross-discipline teams to ensure service reliability, availability, and performance
Collaborate with our domain engineering and Site Reliability Engineering teams to architect and maintain live services
Plan and forecast service capacity and demand, analyze software performance, and tune systems and software
Solve mission-critical incidents and build automation to prevent problem recurrence; automate away all toil
Identify root causes of production issues, and recommend permanent solutions for them
Setup and improve monitoring (metrics, logs, alerts, etc) to identify issues quickly
Develop effective documentation, tooling, and alerts to identify and address risks
Actively participate/offer solutions to keep our environment secure. Review compliance and internal scans and work with development teams to stay ahead of security vulnerabilities
Develop Run Books for Level I NOC team to reduce MTTD/MTTR for alerts.
Participate in on-call rotation with other members of Site Reliability Engineering team

Qualifications

Experience with Linux container technologies (Docker, Kubernetes)
Experience with public and private clouds: GCP, OpenStack, AWS, and/or Azure
Understanding of cloud orchestration frameworks (terraform, Kubernetes, argoCD, spinnaker, etc) and their role in IT transformation
5+ years’ experience working with Linux systems and related tooling (kernel, shell, system libraries, file systems, client-server protocols, etc)
The ability to read/write code fluently in C#, Python, or Go
Deep understanding of software development lifecycle including git-based CI and CD pipelines
Networking: experience with network theory and protocols, e.g. TCP/IP, UDP, DNS, HTTP, TLS, and load balancing.
Strong experience in distributed systems architectures – layered, event-driven, service mesh, etc.
Familiarity with distributed message buses such as Kafka, Confluent
Strong interpersonal and communication skills

Learfield offers a full spectrum of benefits for eligible employees including Medical, Dental, Vision, Health Savings Account, Life Insurance, Disability and Other Insurance Plans, Flexible Paid Time Off (including Parental Leave), Paid Holidays, 401(k), and Short/Long Term Disability.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status. Learfield is an Equal Opportunity Employer: Female / Minority / Disability / Protected Veteran / Sexual Orientation / Gender Identity.

#2334314

Sr. Site Reliability Engineer

Apply for this Job

Voluntary Self-Identification of Disability