AI Engineer (Data Cleaning) for AI Singapore (Products)

Date: 1 Oct 2024

Location: UNIV ADMIN, Kent Ridge Campus, SG

Company: National University of Singapore

Job Description

AI Singapore (AISG) is a national AI programme launched by the National Research Foundation (NRF) to anchor deep national capabilities in Artificial Intelligence (AI).

 

The programme office is hosted by the National University of Singapore (NUS) and brings together all Singapore-based research institutions and the vibrant ecosystem of AI start-ups and companies developing AI products to perform use-inspired research, grow the knowledge, create the tools, and develop the talent to power Singapore's AI efforts.

 

We are looking for an AI Engineer to perform development and engineering work for the SEA-LION project under AI Singapore.

 

Duties & Responsibilities

  • Develop data processing pipeline for multilingual dataset
  • Develop data processing tools for multiple data sources (e.g. txt, pdf, html)
  • Develop web scraping pipeline for targeted scraping of data sources
  • Develop and train classifier for data quality filtering
  • Perform literature review of academic machine learning papers
  • Keep abreast of latest developments in NLP, AI and LLMs.

Qualifications

  • Degree/Master in Computer Science, Machine Learning, AI, Statistics, Mathematics, Engineering or equivalent practical experience
  • Familiar with NLP tasks and modern NLP models
  • Familiar with text based data processing
  • Familiar with EDA techniques for text data
  • Fluent in Python, PyTorch, Hugging Face, AllenNLP and related nlp libraries
  • Fluent in data processing and web scraping tools
  • Previous experience with language modeling will be a plus
  • Ability to write and communicate in English
  • Candidates with no experience are welcome to apply

More Information

Location: Kent Ridge Campus

Organization: Office of the Deputy President(Res&Tech)

Department : AI Singapore

Employee Referral Eligible: No

Job requisition ID : 26527