Corpora are ubiquitous in linguistic research, yet to date, there has
been no consensus on how to conceptualize corpus representativeness
and collect corpus samples. This pioneering book bridges this gap by
introducing a conceptual and methodological framework for corpus
design and representativeness. Written by experts in the field, it
shows how corpora can be designed and built in a way that is both
optimally suited to specific research agendas, and adequately
representative of the types of language use in question. It considers
questions such as 'what types of texts should be included in the
corpus?', and 'how many texts are required?' – highlighting that the
degree of representativeness rests on the dual pillars of domain
considerations and distribution considerations. The authors introduce,
explain, and illustrate all aspects of this corpus representativeness
framework in a step-by-step fashion, using examples and activities to
help readers develop practical skills in corpus design and evaluation.
Les mer
A Practical Framework for Corpus Representativeness
Produktdetaljer
ISBN
9781009254762
Publisert
2022
Utgiver
Cambridge University Press
Språk
Product language
Engelsk
Format
Product format
Digital bok