Comparing Lexical Bundles across Corpora of Different Sizes: The Zipfian Problem

Volume: 27, Issue: 3, Pages: 272 - 290
Published: Feb 5, 2019
Abstract
Formulaic sequences in language use are often studied by means of the automatic identification of frequently recurring series of words, often referred to as ‘lexical bundles’, in corpora that contrast different registers, academic disciplines, etc. As corpora often differ in size, a critically important assumption in this field states that the use of a normalized frequency threshold, such as 20 occurrences per million words, allows for an...
Paper Details
Title
Comparing Lexical Bundles across Corpora of Different Sizes: The Zipfian Problem
Published Date
Feb 5, 2019
Volume
27
Issue
3
Pages
272 - 290
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.