Against the Arrow of Time. Theory and Practice of Mining Massive Corpora of Polish Historical Texts for Linguistic and Historical Research
- In Stock: in stock
- ISBN: 978-83-232-3504-0
- Category: OUTLET, English Philology, Polish Philology, Linguistics, Lingwistyka
- Year of publication: 2019
As more and more efforts are made to preserve cultural heritage all over the world and more and more print material — old newspapers, books, documents or posters — is being digitised and made available online, language units can be studied diachronically with “distant reading” on various levels of analysis. This book aims to present the theory and practice of mining Polish language units, such as words, collocations, names and folk-tales from massive diachronic corpora. The book takes a bottom-up approach, starting from the issues related to handling large diachronic collections and searching in them, whereas the third part focuses on the techniques used to model the Polish language along the time dimension. Finally, the apparatus developed in the book is applied in the context of specific types of linguistic, folkloristic and historical research. The author draws on the ideas of “culturomics” (studying the culture through the lens of diachronic corpora), though their limitations are exposed in the book.
Detailed information | |
---|---|
Table of contents | Download file |
Introduction | Download file |
|
|
Publication Version | printed |
Format | 19,0 x 24,5 |
Type of publication | Monografia |
Edition | I |
ISBN | 978-83-232-3504-0 |
Number of pages | 316 |
Number of publishing sheets | 15,00 |
Type of binding | paperback |