About AlphaSense: 

AlphaSense is a market intelligence platform used by the world's leading companies and financial institutions. Since 2011, our AI-based technology has helped professionals make smarter business decisions by delivering insights from an extensive universe of public and private content—including company filings, event transcripts, news, trade journals, and equity research. Our platform is trusted by over 1,800 enterprise customers, including a majority of the S&P 500. Headquartered in New York City, AlphaSense employs over 450 people across offices in the U.S., U.K., Finland, and India. For more information, please visit www.alpha-sense.com.
 
Check out what we’ve built so far: https://www.youtube.com/watch?v=QPMfWLwv23I

The Role: 

You will build a scraping framework and new web sources can be added easily using it. This system will fetch millions of documents every month. The ideal candidate has strong crawling & scraping skills using Python and accompanied by solid experience with working with cloud computing. 

You will join our team of world-class experts developing the AlphaSense platform. The team is right at the very core of what we do and responsible for implementing the cutting edge technology for scalable, distributed processing of crawling, searching and text processing functions.  

Role Responsibilities: 

  • Developing scalable systems to crawl and scrape data (in form of HTML pages or documents)
  • Quickly create proof of concepts for new crawlers/scrapers
  • Leveraging cloud computing resources (AWS) to optimally execute back-end processing

Candidate Requirements: 

Must Have

  • Bachelor’s or Master’s Degree in Computer Science or a related discipline.
  • Minimum 5 years of software development experience mainly with Python
  • Minimum 2 years of hands-on experience in crawling/scraping using frameworks such as Scrapy, Beautiful Soup, Selenium
  • Strong fundamental C.S. skills (Data structures, algorithms, multithreading etc.).
  • Solid hands on experience in working in distributed and scalable application environments.
  • Excellent oral and written communication skills.

Nice to have

  • Experience with working on Linux platforms, Docker, K8s
  • Working knowledge on AWS Services (S3, DynamoDB, RDS, SQS, SNS)
  • Experience with Java. 

 

Apply for this Job

* Required