About TrialSpark

TrialSpark is a technology-driven drug development company that runs end-to-end clinical trials, focused on bringing new treatments to patients faster and more efficiently.

The biggest bottleneck in bringing new treatments to patients is the clinical trial. On average, getting a drug through the trial process takes nearly a decade and frequently costs $1B+. To combat this industry problem, TrialSpark has built a technology platform that optimizes all aspects of a clinical trial, enabling more efficient trial design, faster trial completion, and higher trial data quality.

TrialSpark recently raised their Series C, and is putting the capital to work by  in-licensing and co-developing drug programs through in-house development, joint ventures, and NewCos. Together with doctors, patients, and communities, TrialSpark is working to develop the treatments of tomorrow.

About the Position

As a Senior Data Engineer on our Data Platform Team, you will own TrialSpark’s data infrastructure. You will make key data architecture decisions and lead significant greenfield initiatives to implement the next generation of our data platform and pipelines. You’ll enable product and analytical teams with timely, high quality data from diverse sources including application databases, partner Electronic Health Record systems, and medical devices. You’ll become a domain expert in clinical data and its application to products and operations across the company. 

In this role you’ll collaborate with Product Engineers and our Infrastructure Team to build data transformations that unlock efficiencies in the clinical trial process, for example to ingest hundreds of millions of complicated health records, clean and structure this data for analytical and product use cases, and identify patients that will be served by previously inaccessible treatments. You’ll partner with the Analytics, Product, and Medical teams to set and achieve targets for data quality and latency, and build a learning feedback loop to move these needles over time. As a founding member of the Data Platform team, you will play a significant role in developing the team’s culture and strategy. Ultimately, you will leverage data to bring treatments to patients who may not have had access otherwise.


  • Evolve our infrastructure and data architecture to accommodate growing business needs and data volume and complexity
  • Collaborate with operational and product partners to achieve business and mission outcomes
  • Design and build data pipelines to clean and structure clinical data
  • Deploy tools to continuously monitor, test, and optimize data pipelines to ensure timely delivery and high data quality
  • Partner with Analytical stakeholders to assess the quality of our data and automate targeted improvements
  • Safeguard patient privacy and trial data integrity by Implementing data privacy and security, for example by implementing de-identification of Personally Identifiable Information
  • Partner with our Analytics team to maintain and evolve our modern data stack as necessary (Looker, Redshift, DBT, Stitch)
  • Help enforce best practices and promote testability and maintainability throughout our systems and codebase


  • Three or more years of professional software development experience preferably in a data-oriented role (e.g. Data Engineer)
  • Professional experience building and maintaining data pipelines (e.g. Airflow, Prefect, Luigi, AWS Glue or Batch)
  • Fluency in SQL and at least one other programming language (Python preferred)
  • Strong knowledge of data modeling with a track record of creating simple models to solve for nuanced product needs from complex data
  • Experience architecting data systems
  • Comfortable with Linux, Docker, and cloud technologies
  • Excellent problem solving and debugging skills
  • Strong written and verbal communication skills with the ability to convey complicated systems to both technical and non-technical audiences

Nice to have

  • Experience building cross functional feedback loops
  • Experience with infrastructure as code tools (e.g. Terraform, Ansible, Pulumi)
  • Experience performance tuning row-based (e.g. PostgreSQL) and columnar (e.g. Redshift) data stores
  • Experience working in a regulated environment (Healthcare, Finance, etc.)
  • Experience working with healthcare data (Electronic Health Records, Insurance Claims, etc.)
  • B.S. in Computer Science or related field

You will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Apply for this Job

* Required

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in TrialSpark’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Form CC-305

OMB Control Number 1250-0005

Expires 05/31/2023

Voluntary Self-Identification of Disability

Why are you being asked to complete this form?

We are a federal contractor or subcontractor required by law to provide equal employment opportunity to qualified people with disabilities. We are also required to measure our progress toward having at least 7% of our workforce be individuals with disabilities. To do this, we must ask applicants and employees if they have a disability or have ever had a disability. Because a person may become disabled at any time, we ask all of our employees to update their information at least every five years.

Identifying yourself as an individual with a disability is voluntary, and we hope that you will choose to do so. Your answer will be maintained confidentially and not be seen by selecting officials or anyone else involved in making personnel decisions. Completing the form will not negatively impact you in any way, regardless of whether you have self-identified in the past. For more information about this form or the equal employment obligations of federal contractors under Section 503 of the Rehabilitation Act, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

How do you know if you have a disability?

You are considered to have a disability if you have a physical or mental impairment or medical condition that substantially limits a major life activity, or if you have a history or record of such an impairment or medical condition.

Disabilities include, but are not limited to:

  • Autism
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, or HIV/AIDS
  • Blind or low vision
  • Cancer
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or hard of hearing
  • Depression or anxiety
  • Diabetes
  • Epilepsy
  • Gastrointestinal disorders, for example, Crohn's Disease, or irritable bowel syndrome
  • Intellectual disability
  • Missing limbs or partially missing limbs
  • Nervous system condition for example, migraine headaches, Parkinson’s disease, or Multiple sclerosis (MS)
  • Psychiatric condition, for example, bipolar disorder, schizophrenia, PTSD, or major depression

1Section 503 of the Rehabilitation Act of 1973, as amended. For more information about this form or the equal employment obligations of Federal contractors, visit the U.S. Department of Labor's Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp.

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.