English: machine learning, bootstrapping, linguistic data, TurboAnnotate, Afrikaans, Setswana
Afrikaans: Afrikaans, masjienleer, Setswana, skoenlussteekproefneming, taalkundige data, TurboAnnotate
English: The development of digital resources is an expensive and time-consuming endeavor; especially in the case of less-resourced languages. In this paper; we describe a freely available; open-source system; called TurboAnnotate; for bootstrapping linguistic data for machine-learning purposes, or for manually creating gold standards or other annotated lists. A detailed description of the design and functionalities of the tool is given, focusing on how the requirements of end-users are being addressed through it. It is indicated that TurboAnnotate does not only promise to help increase the accuracy of human annotators, but also to save enormously on human effort in terms of time.
On: Afrikaans and Setswana