Data Analytics remote Internship – Natural Language Processing and Active Learning
Sumitovant is the parent company of five biopharmaceutical companies: Myovant, Urovant, Enzyvant, Altavant and Spirovant. Our companies have a robust pipeline of innovative drug treatments targeting a broad range of diseases. With a technology-enabled approach to drug discovery, development and commercialization, we remove friction from the process of bringing promising new compounds to market.
Our mission is to improve the lives of people globally by using technology to rapidly develop and deliver new therapies that address unmet needs. The Computational Research team is focused on building an analytical engine to enable the identification and valuation of drugs for potential acquisition.
As a Computational Research intern, you’ll be part of a team using data-driven approaches to streamline and support drug identification and evaluation. This intern will help setup our active learning infrastructure, further improve existing NLP models and develop new ones as needed. We apply NLP models daily on thousands of documents and collect curation feedback from end users to be leveraged for active learning. We’d also like to further explore reducing the size and inference time of BERT-based models (using pruning or distilling methods).
Who You Are:
Currently pursuing a Master’s degree (preferably in Computer Science applied to NLP)
Programming experience in Python
Experience in using ML framework such as TensorFlow 2, Keras or PyTorch
Experience working in Linux environment is a plus
Passionate about our mission to systematically reduce the time, cost and risk of delivering new medicines to market
Flexible and adapt well to unstructured and ambiguous environments
Self-motivated, proactive and thrive in ambiguous situations