Senior C++ Software Engineer – Data Engines
We are looking for a highly motivated Senior C++ Software Engineer – Data Engines to join our team at Voltron Data. In this role, you’ll have the opportunity to work directly on Theseus, our accelerator-native data processing engine built for composability. You will collaborate closely with Voltron Data’s development teams to build, optimize, and maintain our data execution framework, enhancing its scalability, performance, and feature set. You’ll also contribute to core architectural components designed to handle Petabyte-scale data processing.
Why Work at Voltron Data?
- We are Going for Impact: Voltron Data is post-Series A, venture-backed startup building a new foundation for data analytics with Apache Arrow. Our mission is to drive innovation in data processing by harnessing the speed and efficiency of modern hardware.
- We are Committed to Bridging Open Source Communities: Our team includes open-source maintainers who have led the development of C++, Python, and R ecosystems over the past 15 years.
- We are Building a Diverse, Inclusive Company: Voltron Data values creating an equitable workplace that prioritizes growth for all employees. Every voice is essential to shaping the company’s future.
Responsibilities
Upon Joining:
- Gain a deep understanding of Apache Arrow, the compute primitives in Theseus, query parsing, optimizers, and foundational components.
- Learn the architecture of the data processing engine, including component interactions and compute graph data flows.
- Master memory management mechanics, including memory spilling from GPU to Host and Disk.
- Embrace Voltron Data’s software development culture.
Within 1 Month:
- Profile single-node and distributed query executions, analyze telemetry, and solve distributed engine issues.
- Study distributed relational algebra algorithms to identify areas for improvement.
- Work with the team on bug fixes, code optimizations, and refactoring.
Within 6 Months:
- Build new relational algebra components to expand SQL/DataFrame functionality.
- Implement improvements in resource management, task scheduling, and fault tolerance.
Within 12 Months:
- Propose and implement core architectural enhancements for the engine.
- Work on complex projects such as language-agnostic user-defined functions, multi-query concurrency, and multi-tenancy.
- Integrate the engine with other company-developed components to deliver enterprise-grade solutions.
Required Experience
- Expertise with data processing engines or frameworks.
- Experience in distributed High-Performance Computing (HPC) systems.
- Hands-on knowledge of GPU and CPU resource management, including memory and thread pools.
- Several years of experience developing in C++.
- Experience developing for multiple hardware types (CPU, GPU).
- Familiarity with UCX or libfabric.
- Experience working with open-source projects such as Apache Arrow, Flight, Rapids, Velox, and Data Fusion.
Preferred Skills
- Deep understanding of distributed and multi-threaded systems.
- Proficiency in SQL and non-SQL systems with their computational abstractions.
US Compensation
The salary range for this role is between $171,000.00 to $210,000.00. Compensation varies globally and will be based on factors such as work location, skills, and experience. This position is also eligible for additional incentives, including equity awards.
Join us at Voltron Data and play a critical role in shaping the future of data analytics!
Benefits
To All Agencies: Please, no phone calls or emails to any employee of Voltron Data outside of the Talent Acquisition team. Voltron Data's policy is to only accept resumes from agencies via the Voltron Data Agency Portal. Agencies must have a valid fee agreement in place and they must have been assigned the specific requisition to which they submit resumes, by the Talent Acquisition team. Any resume submitted outside of this process will be deemed the sole property of Voltron Data and in the event a candidate is submitted outside of this policy is hired, no fee or payment of any kind will be paid