Corpus linguistics
From The Art and Popular Culture Encyclopedia
Related e |
Featured: |
Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally done by hand, corpora are now largely derived by an automated process.
[edit]
See also
- Concordance (KWIC)
- Collocation
- Collostructional analysis
- Keyword (linguistics)
- Lexical priming
- Linguistic Data Consortium
- Machine translation
- Natural Language Toolkit
- Pattern grammar
- Search engines: they access the "web corpus".
- Semantic prosody
- Text corpus
- Translation memory
- Treebank
- Xaira: a general purpose XML aware open-source corpus analysis tool
Unless indicated otherwise, the text in this article is either based on Wikipedia article "Corpus linguistics" or another language Wikipedia page thereof used under the terms of the GNU Free Documentation License; or on research by Jahsonic and friends. See Art and Popular Culture's copyright notice.