Job Overview

We are seeking a skilled and motivated individual to join our team as a Data Engineer. The successful candidate will assist in research, support model training, and optimize machine learning solutions. The role requires organizing relevant data, analyzing test results, and fine-tuning models. Additionally, the candidate will be responsible for analyzing and interpreting large datasets to support our B2B AI large language model (LLM) Verification solution.

OneDegree Tech Blog: https://medium.com/onedegree-tech-blog

 

Responsibilities

  • Assist in Research and Support Model Training:
    1. Provide technical support during the model training phase, ensuring the process runs smoothly and efficiently.
    2. Conduct literature reviews and stay updated with the latest advancements in machine learning and AI.
  • Evaluate and Optimize Machine Learning Solutions:
    1. Perform thorough evaluations of machine learning models to assess their performance and accuracy.
    2. Implement optimization techniques to enhance model efficiency and effectiveness.
  • Organize Relevant Data Based on Requirements and Optimize Related Models:
    1. Collect and organize datasets according to project requirements, ensuring data quality and integrity.
    2. Continuously optimize models based on data insights and project needs.
  • Analyze Test Results and Fine-Tune Models:
    1. Conduct detailed analyses of test results to identify patterns, anomalies, and areas for improvement.
  • Analyze and Interpret Large Datasets:
    1. Work with large volumes of data to extract meaningful insights that inform model development.

 

Requirements

  • Proficient in Data Warehousing, Data Preprocessing Concepts, Data Cleaning, Data Preparation, Tokenization, and Related Data.
  • Skilled in tokenization and transforming raw data into structured formats suitable for machine learning models.
  • Proficient in Python (including numpy, scipy, pandas) and OpenAI.
  • Experience with SQL and Database Design (e.g., SQL/NoSQL/Vector DB).
    1. Understanding of database design principles, including both SQL and NoSQL databases.
    2. Familiarity with vector databases and their application in AI and machine learning contexts.
  • Experience with OpenAI, Langchain, and RAG Architecture:
    1. Hands-on experience with OpenAI technologies and integrating them into machine learning workflows.
    2. Knowledge of Langchain and RAG (Retrieval-Augmented Generation) architecture, and their implementation in practical projects.
  • Demonstrated ability to analyze large datasets, using statistical and machine learning techniques to derive insights.
  • Strong Team Collaboration and Self-Learning Abilities:
    1. Proven ability to work effectively in a team environment, collaborating with colleagues to achieve common goals.
    2. Self-motivated with a strong desire to continuously learn and stay updated with the latest industry trends and technologies.
  • Preferred Qualifications
  • Familiarity with Machine Learning Frameworks (e.g., Keras, TensorFlow, PyTorch) is a Plus:
    1. Experience using popular machine learning frameworks such as Keras, TensorFlow, or PyTorch.

  • Familiarity with NLU/NLG/NLP Architectures (e.g., BERT, Transformers) is a Plus:
    1. Knowledge of Natural Language Understanding (NLU), Natural Language Generation (NLG), and Natural Language Processing (NLP) architectures.
    2. Understanding of Java programming language and its application in data engineering and machine learning projects.

 

Other Benefits 

To us, people are our greatest asset, and we are more than happy to invest in employees! We create a healthy work atmosphere and provide you with the tools and support for doing your job successfully. With a culture of flexibility and transparency, we believe there should be no barriers, and everyone’s contributions matter. 

Work Life Balance is a must  

  • 15 days annual leaves (pro-rata for partial month at first year) 
  • 5 days full-pay sick leaves, 3 days menstrual leaves 
  • Health check subsidy 
  • Ergonomic-design chair and fully-equipped devices for work
  • Hybrid remote work and flexible working hour.

Grow together & keep learning

  • Conferences & external subsidy 
  • Learning clubs to share technical skill (e.g: Frontend/Backend tech sharing, Blockchain...etc) 

Work Hard, Play even Harder 

  • Various entertainment & sports clubs, attend basketball clubs today, and play board game tomorrow! 
  • Snacks & beverage to refill your energy anytime 

Apply for this Job

* Required
resume chosen  
(File types: pdf, doc, docx, txt, rtf)
cover_letter chosen  
(File types: pdf, doc, docx, txt, rtf)


Enter the verification code sent to to confirm you are not a robot, then submit your application.

This application was flagged as potential bot traffic. To resubmit your application, turn off any VPNs, clear the browser's cache and cookies, or try another browser. If you still can't submit it, contact our support team through the help center.