Delivery included to the United States

Programming for Corpus Linguistics With Python and Dataframes

Programming for Corpus Linguistics With Python and Dataframes - Cambridge Elements. Elements in Corpus Linguistics

Paperback (20 Jun 2024)

  • $24.80
Add to basket

Includes delivery to the United States

10+ copies available online - Usually dispatched within 2-3 weeks

Other formats & editions

New
Hardback (20 Jun 2024) RRP $68.35 $63.99

Publisher's Synopsis

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

About the Publisher

Cambridge University Press

Cambridge University Press dates from 1534 and is part of the University of Cambridge. We further the University's mission by disseminating knowledge in the pursuit of education, learning and research at the highest international levels of excellence.

Book information

ISBN: 9781108822589
Publisher: Cambridge University Press
Imprint: Cambridge University Press
Pub date:
DEWEY: 410.188
DEWEY edition: 23
Language: English
Number of pages: 104
Weight: 176g
Height: 228mm
Width: 151mm
Spine width: 9mm