Share on email
Share on whatsapp
Share on facebook
Share on twitter

Van Huyssteen 2021

Van Huyssteen, Gerhard B. 2021. Afrikaans stoplist for corpus research 1.0. Potchefstroom: Centre for Text Technology (CTexT), North-West University.

English: Afrikaans, corpus linguistics, Taalkommissie (TK), stoplist

Afrikaans: Afrikaans, korpuslinguistiek, Taalkommissie (TK), stoplys

English: The Afrikaans stoplist for corpus research (version 1.0) comprises a master list of 1,298 items, based on frequency counts in the Taalkommissie corpus 1.1. The list has been curated based on relative word frequency classes, and Zipf values. In addition, each item has been categorised in terms of length, typecase, selection category, lexical type (i.e. content or function word), and part-of-speech category. For ease of use, three subsets of the main list is also provided.


Afrikaans: Die Afrikaanse stoplys vir korpusnavorsing (weergawe 1.0) bevat ‘n meesterlys van 1 298 items, gebaseer op frekwensietellings in die Taalkommissiekorpus 1.1. Die lys is saamgestel op grond van relatiewe woordfrekwensieklasse en Zipf-waardes. Daarbenewens is elke item gekategoriseer in terme van lengte, letterkas, seleksiekategorie, leksikale tipe (d.i. inhoud- of funksiewoord) en woordsoortkategorie. Gemaklikheidshalwe word daar ook drie substelle van die hooflys verskaf.

In: Afrikaans

On: Afrikaans