Petuum Inc. has a Senior Software Engineer position in Pittsburgh, PA responsible for building robust, effective, and well-packaged modern machine learning systems, as well as contributing to our CASL open source projects. Specific duties include: (i) collaborating with system architects, designers, and engineers to support the development of robust machine learning systems; (ii) contributing high-quality code and lead efforts in building Petuum’s open-source CASL projects, such as AdaptDL, AutoDist, and Tuun; (iii) developing parallel programming techniques to simplify distributed ML programming; (iv) assessing and recommending technology choices and directions in consideration of cost-benefit trade-off; (v) communicating work to a broader audience through talks, tutorials, and blog posts; and (vi) mentoring junior engineers to facilitate the development process.
- Must have a master’s degree (or foreign equivalent) in Computer Science, Machine Learning, or a related field plus five (5) years of experience in a related position.
- Must also have five (5) years of experience with each of the following (can be concurrent): (i) database systems; (ii) network systems; (iii) storage systems; and (iv) C or C++.
- Must have three (3) years of experience with both of the following (can be concurrent): (i) distributed systems; and (ii) system performance analysis of distributed systems.
- Must also know: (i) Python; (ii) Machine Learning or Deep Learning; (iii) PyTorch or TensorFlow; and (iv) development and troubleshooting in Docker environment.