Home
Public Media
- #HoeNou?!
CV & Profiles

Van Huyssteen, Puttkammer, Pilon & Groenewald 2007

2007, Afrikaans, bootstrapping, linguistic data, machine learning, Setswana, TurboAnnotate

Van Huyssteen, Gerhard B., Martin J. Puttkammer, Suléne Pilon, and Hendrik J. Groenewald. 2007. “Using machine learning to annotate data for NLP tasks semi-automatically.” Proceedings of International Workshop on Computer-Aided Language Processing.

Download PDF

DOI

Abstract

Developing digital resources is an expensive and time-consuming endeavour; especially in the case of less-resourced languages. We developed TurboAnnotate in an attempt to accelerate the annotation of linguistic data by means of bootstrapping linguistic data for machine-learning purposes. The design and functionality of the tool is given to show how machine learning is used in the annotation process. It is shown that TurboAnnotate does not only promise to help increase the accuracy of human annotators; but also to save enormously on human effort in terms of time.

Written in:

English

Dealing with:

Afrikaans and Setswana

Keywords

machine learning, bootstrapping, linguistic data, TurboAnnotate, Afrikaans, Setswana

Afrikaans keywords

Afrikaans, masjienleer, Setswana, skoenlussteekproefneming, taalkundige data, TurboAnnotate

This is my work, my life

Facebook
YouTube
WhatsApp
Mail