Our product infrastructure team is looking for a data engineer to reform the Wikimedia Foundation’s approach to data instrumentation, so that we can continuously learn how to make Wikipedia and its sister projects more useful. This is a great position for you if you love instrumenting the full stack and cherish the opportunity to help software engineers, data scientists, and product managers through better data engineering tools and techniques. You will play an important role in improving our data insight capabilities, nurturing an even more data informed culture in our products and programs.

You will have the opportunity to work with software that serves over half a billion pages per day through Wikipedia (and other Wikimedia sites), build components that operate in a privacy first manner, and modernize code and practices that support experimentation, measurement, monitoring & alerting, and reporting in our product department.

Are you ready to write open source software that operates on data at the scale and speed of Wikipedia traffic, innovate in applied data science that respects user privacy, and help create a world in which everyone can freely share in the sum of all knowledge?

We are a family friendly employer and have a great benefits package. We look forward to your application.

You will do these things:

  • Develop an overall approach to product instrumentation that fits the Wikimedia Foundation’s unique goals and responsibilities to our communities
  • Clearly and concisely communicate complex ideas about software, instrumentation, and statistics
  • Author polished code that makes it possible to generate and validate data, using a number of languages in contexts such as A/B testing and the data pipeline - Python, JavaScript, PHP, Java, Scala and more
  • Work with product analysts to integrate incoming data into ad hoc analytics tasks and automated analytics workflows
  • Coach web and native iOS & Android software engineers on effective use of instrumentation and other data tooling
  • Build up a collection of repeatable patterns and practices and ensure that data engineering components and architecture will scale sustainably
  • Partner with our Analytics Engineering team and Product Analytics team on enhancements to big data architecture

We would like you to have these skills:

  • 3+ years experience working with data in a professional environment
  • 2+ years building web and apps involving instrumentation components, frameworks, and data models.
  • Experience with Hadoop (HDFS, Hive, YARN, MapReduce) and later technology like Spark
  • Experience with NoSQL databases like Cassandra or Druid and “conventional” SQL ones like MySQL
  • Experience with data friendly languages like Python, R, Scala, or Hadoop-flavored Java
  • Application server experience such as PHP or Node.js
  • Focused software engineering: you enjoy writing unit tests, reviewing code and responding to code reviews, and discussing architectural approach
  • A bachelor’s, master’s, or doctorate in data science, computer science, or other scientific field; or equivalent experience.
  • A love of free knowledge and open access

These are also pluses:

  • Exposure to applied machine learning (ML), deep learning, natural language processing (NLP)
  • Native apps programming experience (e.g., Objective-C, Swift, Android Java / Android SDK, Kotlin)
  • Experience with Jupyter, Mathematica, or similar notebook technology
  • Professional use of distributed and in-memory databases
  • Application of information security to big data
  • Experience with an internet software environment operating at scale. For example: messaging platforms that process hundreds of thousands of events per second.
  • Track record of working remotely with teams distributed across many time zones
  • Examples of connecting software to CI tools like Jenkins or Travis
  • Contributions to Wikipedia (or editing wikitext on other wikis) and other open access / open source ecosystems


If you have any existing open source software that you've developed (these could be your own software or patches to other packages), please share the URLs for the source. Links to your projects on GitHub, GitLab, BitBucket, etc. are exceptionally useful.

The Wikimedia Foundation is... 

...the nonprofit organization that supports Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge, free of interference. We host the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive. The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive financial support from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.

The Wikimedia Foundation is an equal opportunity employer, and we encourage people with a diverse range of backgrounds to apply

Benefits & Perks *

  • Fully paid medical, dental and vision coverage for employees and their eligible families (yes, fully paid premiums!)
  • The Wellness Program provides reimbursement for mind, body and soul activities such as fitness memberships, baby sitting, continuing education and much more
  • The 401(k) retirement plan offers matched contributions at 4% of annual salary
  • Flexible and generous time off - vacation, sick and volunteer days, plus 19 paid holidays - including the last week of the year.
  • Family friendly! 100% paid new parent leave for seven weeks plus an additional five weeks for pregnancy, flexible options to phase back in after leave, fully equipped lactation room.
  • For those emergency moments - long and short term disability, life insurance (2x salary) and an employee assistance program
  • Pre-tax savings plans for health care, child care, elder care, public transportation and parking expenses
  • Telecommuting and flexible work schedules available
  • Appropriate fuel for thinking and coding (aka, a pantry full of treats) and monthly massages to help staff relax
  • Great colleagues - diverse staff and contractors speaking dozens of languages from around the world, fantastic intellectual discourse, mission-driven and intensely passionate people

* for benefits eligible staff, benefits may vary by location

More information



Annual Report - 2017

Wikimedia 2030

Apply for this Job
* Required

File   X
File   X

U.S. Equal Opportunity Employment Information (Completion is voluntary)

Individuals seeking employment at Wikimedia Foundation are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. You are being given the opportunity to provide the following information in order to help us comply with federal and state Equal Employment Opportunity/Affirmative Action record keeping, reporting, and other legal requirements.

Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Form CC-305

OMB Control Number 1250-0005

Expires 1/31/2020

Voluntary Self-Identification of Disability

Why are you being asked to complete this form?

Because we do business with the government, we must reach out to, hire, and provide equal opportunity to qualified people with disabilities1. To help us measure how well we are doing, we are asking you to tell us if you have a disability or if you ever had a disability. Completing this form is voluntary, but we hope that you will choose to fill it out. If you are applying for a job, any answer you give will be kept private and will not be used against you in any way.

If you already work for us, your answer will not be used against you in any way. Because a person may become disabled at any time, we are required to ask all of our employees to update their information every five years. You may voluntarily self-identify as having a disability on this form without fear of any punishment because you did not identify as having a disability earlier.

How do I know if I have a disability?

You are considered to have a disability if you have a physical or mental impairment or medical condition that substantially limits a major life activity, or if you have a history or record of such an impairment or medical condition.

Disabilities include, but are not limited to:

  • Blindness
  • Deafness
  • Cancer
  • Diabetes
  • Epilepsy
  • Autism
  • Cerebral palsy
  • Schizophrenia
  • Muscular dystrophy
  • Bipolar disorder
  • Major depression
  • Multiple sclerosis (MS)
  • Missing limbs or partially missing limbs
  • Post-traumatic stress disorder (PTSD)
  • Obsessive compulsive disorder
  • Impairments requiring the use of a wheelchair
  • Intellectual disability (previously called mental retardation)
Reasonable Accommodation Notice

Federal law requires employers to provide reasonable accommodation to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job or to perform your job. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.

1Section 503 of the Rehabilitation Act of 1973, as amended. For more information about this form or the equal employment obligations of Federal contractors, visit the U.S. Department of Labor's Office of Federal Contract Compliance Programs (OFCCP) website at

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.