DataCamp is building the future of data science education. Our students get real hands-on experience by completing self-paced, interactive data science courses from the best instructors in the world, right in the browser. In fact, nearly two million students around the world have completed more than 87 million DataCamp exercises to date!
We are looking for a talented data engineer that can empower us to take advantage of our data. You will be responsible for managing our entire data flow, ingesting data from a variety of sources including SQL, streams, and external APIs. We're not looking for an ETL engineer, but rather someone who can build reusable and scalable tools that empower the data team. You will work closely together with the data scientists and infrastructure engineers in order to provide the platforms and technology to create and support recommendation engines, internal dashboards, and A/B testing. You will actively take the lead in assessing priorities, and planning out your work.
The ideal candidate has
Proven track record of manipulating, processing and extracting value from large disconnected datasets
Advanced SQL knowledge and experience working with relational databases, as well as familiarity with a variety of data stores
Experience and willingness to work in Python
Experience with Spark
Experience with AWS or another cloud computing system (Azure, Google Cloud, etc)
Comfort in an agile environment with rapid iteration
It’s a plus if you have
Experience working closely with data scientists
Experience with a NoSQL data store such as MongoDB or DocumentDB