Share this Job

Corpus Developer for AI Singapore (Makerspace)

Date: 26-Jan-2023

Location: UNIV ADMIN, Kent Ridge Campus, SG

Company: National University of Singapore

Job Description

AI Singapore (AISG) is a national AI programme launched by the National Research Foundation (NRF) to anchor deep national capabilities in Artificial Intelligence (AI).


The programme office is hosted by the National University of Singapore (NUS) and brings together all Singapore-based research institutions and the vibrant ecosystem of AI start-ups and companies developing AI products to perform use-inspired research, grow the knowledge, create the tools, and develop the talent to power Singapore's AI efforts.


AI Singapore’s NLPHub is developing publicly available datasets for Natural Language Processing (NLP) in Southeast Asian languages. Due to the rapid growth of our operations, we are looking for a Corpus Developer trained in linguistics to manage our data annotation projects for various tasks and languages.



Duties & Responsibilities 
●    Manage data annotation projects for Southeast Asian languages
●    Adapt existing annotation guidelines (for English etc.) or develop new ones for data annotation projects
●    Perform linguistic research to inform decision-making in guideline formulation
●    Curate data sources to select suitable data for the specific task at hand
●    Perform quality control on annotated data and collaborate closely with external quality controllers and data annotators to iteratively improve on data quality
●    Keep abreast of the latest developments in language resources for NLP


●    A degree in Linguistics, Computational Linguistics or equivalent practical experience
●    Proficiency in English and at least one Southeast Asian language is highly preferred
●    Experience in corpus development
●    Experience in project management, especially for data annotation
●    Experience working with data annotation or crowdsourcing platforms
●    Familiar with linguistic concepts (strong understanding of morphology, syntax and semantics is preferred)
●    Able to work with and analyse unfamiliar languages
●    Familiarity with NLP tasks is a plus
●    Knowledge of programming languages, especially Python, is a plus
●    Good communication skills (as work is highly collaborative)

Covid-19 Message

At NUS, the health and safety of our staff and students are one of our utmost priorities, and COVID-vaccination supports our commitment to ensure the safety of our community and to make NUS as safe and welcoming as possible. Many of our roles require a significant amount of physical interactions with students/staff/public members. Even for job roles that may be performed remotely, there will be instances where on-campus presence is required.

Taking into consideration the health and well-being of our staff and students and to better protect everyone in the campus, applicants are strongly encouraged to have themselves fully COVID-19 vaccinated to secure successful employment with NUS.

More Information

Location: Kent Ridge Campus

Organization: Office of the Deputy President(Res&Tech)

Department : AI Singapore

Employee Referral Eligible: No

Job requisition ID : 18178