At Pachyderm, we're building an open-source enterprise-grade data science platform that lets you deploy and manage multi-stage, language-agnostic data pipelines while maintaining complete reproducibility and provenance. Our system, developed with open source roots, shifts the paradigm of data science workflows by providing reproducibility, data provenance, and opportunity for true collaboration. Pachyderm utilizes modern technologies like Docker and Kubernetes to build an entirely new method of analyzing data. Offered both as an in-house solution as well as hosted-service, Pachyderm brings together version-control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to use any language, framework, or tool they want. If you want to learn more about our grand vision, read what has become our"manifesto."
Pachyderm is a rapidly growing, early-stage company funded by the top VC’s — Benchmark, Decibel, M12, and YCombinator. Like many modern companies, Pachyderm embraces a “Remote-first” approach to growing our team. It gives us a huge advantage in hiring top talent and diverse talent across the country while giving our team members the flexibility to work from anywhere.
Pachyderm is hiring distributed systems engineers to help us build out the core product -- a distributed version-controlled filesystem and data processing engine. You’ll be solving hard algorithmic and distributed systems problems every day and building a first-of-its-kind, containerized, data infrastructure platform.
While your primary focus will of course be building the core product, you’ll also have direct exposure to users and enterprise customers via our open source support channels. At Pachyderm, OSS user and customer feedback is major driver of our product roadmap and we believe that everyone within the company should experience that first-hand.
Pachyderm is just a small team right now, so you'd be getting in right at the ground floor and have an enormous impact on the success and direction of the company and product. You can of course check out the product on GitHub because it’s open-source.
We offer significant equity, full benefits, and all the usual startup perks.
2+ years of experience working in distributed systems, data infrastructure, back-end systems or related development work.
Major contribution to prominent and related open-source projects are a plus or can be a replacement for work experience in some circumstances (e.g. You’ve been a student just finishing your degree)
While it is a bonus, experience with Golang is not a strict requirement. Programming languages are just part of your arsenal and we’ve found that great engineers have no problem learning new tools.
Must have strong communication skills when talking about technical concepts. Our interview process strongly tests for communication as we have a very collaborative work environment where many parts of the codebase interact in complex ways.
Things change quickly as our product develops and breaking down major features into smaller and more easily executable PRs is an imperative skill.