Abstract
This article introduces a quantitative method for identifying newly emerging word forms in large time-stamped corpora of natural language and then describes an analysis of lexical emergence in American social media using this method, based on a multi-billion- word corpus of Tweets collected between October 2013 and November 2014. In total 29 emerging word forms, which represent various semantic classes, grammatical parts-of- speech and word formation processes, were identified through this analysis. These 29 forms are then examined from various perspectives in order to begin to better understand the process of lexical emergence.
Original language | English |
---|---|
Pages (from-to) | 99-127 |
Number of pages | 29 |
Journal | English Language & Linguistics |
Volume | 21 |
Issue number | 1 |
Early online date | 25 May 2016 |
DOIs | |
Publication status | Published - Mar 2017 |
Bibliographical note
© Cambridge University Press 2016ASJC Scopus subject areas
- Language and Linguistics