Language
Mobile navigation Home page

Against the Arrow of Time. Theory and Practice of Mining Massive Corpora of Polish Historical Texts for Linguistic and Historical Research

PLN 15.00

As more and more efforts are made to preserve cultural heritage all over the world and more and more print material — old newspapers, books, documents or posters — is being digitised and made available online, language units can be studied diachronically with “distant reading” on various levels of analysis. This book aims to present the theory and practice of mining Polish language units, such as words, collocations, names and folk-tales from massive diachronic corpora. The book takes a bottom-up approach, starting from the issues related to handling large diachronic collections and searching in them, whereas the third part focuses on the techniques used to model the Polish language along the time dimension. Finally, the apparatus developed in the book is applied in the context of specific types of linguistic, folkloristic and historical research. The author draws on the ideas of “culturomics” (studying the culture through the lens of diachronic corpora), though their limitations are exposed in the book.

Table of contents
(Size: 56.6 KB)
Introduction
(Size: 85.7 KB)
Write Your Own Review
You're reviewing:Against the Arrow of Time. Theory and Practice of Mining Massive Corpora of Polish Historical Texts for Linguistic and Historical Research
Detailed information
Publication Version printed
Format 19,0 x 24,5
Type of publication Monografia
Edition I
ISBN 978-83-232-3504-0
Number of pages 316
Number of publishing sheets 15,00
Type of binding paperback
Sign up