The team at SIDEARM provides the technology platform that powers the official websites, mobile apps, statistical integration, live audio and video streaming, and e-commerce platforms of more than 1,600 collegiate athletic partners across the nation.
We’re a passionate mix of technologists, creatives, and strategists that care deeply about the products we create, and the people we create them for. We stand at the intersection of sports and technology, and are constantly innovating and evolving our products to deeply integrate the industries and provide exceptional products for our partners and their fans. Our team embraces a casual and collaborative work environment that moves at a rapid pace, where all team members are deeply involved in the success of our products and services.
We're fortunate enough to be trusted by some of the biggest brands in the industry, including 300 NCAA Division I programs, and over 55 of the 65 Power 5 athletic departments. We are proud that the work we do is experienced by millions of sports fans each year.
As we continue to grow, we need an outstanding Senior Site Reliability Engineer to travel on our journey together. Our SRE team influences the technology growth that impacts millions of customers across the entertainment space. Opportunities abound to grow our products - and grow your career!
What you’ll do
- Work in cross-discipline teams to ensure service reliability, availability, and performance
- Collaborate with our domain engineering and Site Reliability Engineering teams to architect and maintain live services
- Plan and forecast service capacity and demand, analyze software performance, and tune systems and software
- Solve mission-critical incidents and build automation to prevent problem recurrence; automate away all toil
- Identify root causes of production issues, and recommend permanent solutions for them
- Setup and improve monitoring (metrics, logs, alerts, etc) to identify issues quickly
- Develop effective documentation, tooling, and alerts to identify and address risks
- Actively participate/offer solutions to keep our environment secure. Review compliance and internal scans and work with development teams to stay ahead of security vulnerabilities
- Develop Run Books for Level I NOC team to reduce MTTD/MTTR for alerts.
- Participate in on-call rotation with other members of Site Reliability Engineering team
Qualifications
- Experience with Linux container technologies (Docker, Kubernetes)
- Experience with public and private clouds: GCP, OpenStack, AWS, and/or Azure
- Understanding of cloud orchestration frameworks (terraform, Kubernetes, argoCD, spinnaker, etc) and their role in IT transformation
- 5+ years’ experience working with Linux systems and related tooling (kernel, shell, system libraries, file systems, client-server protocols, etc)
- The ability to read/write code fluently in C#, Python, or Go
- Deep understanding of software development lifecycle including git-based CI and CD pipelines
- Networking: experience with network theory and protocols, e.g. TCP/IP, UDP, DNS, HTTP, TLS, and load balancing.
- Strong experience in distributed systems architectures – layered, event-driven, service mesh, etc.
- Familiarity with distributed message buses such as Kafka, Confluent
- Strong interpersonal and communication skills
Learfield offers a full spectrum of benefits for eligible employees including Medical, Dental, Vision, Health Savings Account, Life Insurance, Disability and Other Insurance Plans, Flexible Paid Time Off (including Parental Leave), Paid Holidays, 401(k), and Short/Long Term Disability.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, or disability status. Learfield is an Equal Opportunity Employer: Female / Minority / Disability / Protected Veteran / Sexual Orientation / Gender Identity.
#2334314