CORPUS LINGUSITICS RESEARCH

ALL ISSUE

Year :

Volume :

Extraction of Functional Sentence Stems (FSSs) in English Academic Texts

Jingjie Li

CORPUS LINGUSITICS RESEARCH :: Vol.2 No. pp.54-55

Abstract

Extraction of Functional Sentence Stems (FSSs) in English Academic Texts ×

Pawley and Syder (1983) propose the concept of lexicalized sentence stems as “a unit of clause length or longer whose grammatical form and lexical content is wholly or largely fixed”(ibid: 191-192), and they conclude that native speakers rely much on stringing lexicalized sentence stems together for their fluent communication (ibid: 202). The concept of lexicalized sentence stems has values in phraseology; however, ‘lexicalization’ is not particularly operationalizable in empirical research, as corpus studies have shown that most phrasal units are only partially lexicalized (see Moon 1998: 37, etc.). Granger and Paquot(2008: 44) develop further the notion of lexicalized sentence stems and propose textual sentence stems. According to them (ibid: 44), textual sentence stems are routinized fragments ofsentences that serve textual functions, and a textual sentence stem typically involves a subject and a verb. Examples of textual sentence stems are the final point is, another thing is,it will be shown that andI will discuss. The concept of textual sentence stems lends itself more easily to empirical research than the lexicalized sentence stem in that it attaches more importance to functions of the sentence stem rather than its lexicalization. However, Granger and Paquot did not propose the practical methods and procedures for identifying textual sentence stems so defined, particularly in corpus-based studies. In addition, sentence stems, as the clause-level phraseological units, may perform such functions as the research-oriented function or participant-oriented function in Hylland’s term (2008:13-14), other than the textual or text-oriented function. And in some cases a sentence stems may perform multiple functions simultaneously. For these reasons, in the present research, we propose to opt for the term ‘functional sentence stem’ (henceforward FSS). This paper aims to explore common functional sentence stems in academic English texts, with a view to uncovering their typical forms and specific functions in context, in order to better inform ESP students of language resources with higher pedagogic utility. Functional sentence stems are temporarily defined as “An FSS is a recurrent contiguous lexico-grammatical sequence which contains a subject-predicate structure and which is associated with a particular function pertaining to a particular textual environment. The FSS may have a range of variations. ” The definition spells out several important defining features of the linguistic entity. Firstly a FSS consists of a subject-predicate structure, which makes it more compete in grammatical structure than the lexical bundle. For example, previous studies show that, data indicate that and it is not surprising that are all FSSs dealt with in the present study. Second, statistically, an FSS has to be significantly recurrent to ensure that it is a common means for meanings and functions. In the present study, only those FSSs whose frequencies have reached a frequency threshold of 10 or above and whose new-MI scores have reached 3.0 or above will be counted as a FSS. Thirdly, an FSS performs particular functions in association with co-occurring lexical and grammatical features in the local textual environment. And the functions can be of different types and nature, but most often a FSS is typically used to help organize textual information or to express the writer’s attitudinal or evaluative meanings. Fourthly, a FSS is usually not a fixed expression and it may have a range of varied forms. For instance, this paper describes may be varied to this paper discusses, the present paper examines, etc. We then use the new-MI measure (Wei & Li 2013) to extract from an academic English corpus a large number of FSSs, structural and functional characteristics of which are carefully examined and described in association with their co-selection patterns. Results indicate that FSSs are important means for a wide variety of specific discourse acts pertaining to characteristic local textual environments. The findings have potentially valuable implications for ESP pedagogy, offering, in particular, insights for improving non-native novice writers’ academic writing performance.

Download PDF Export Citation

Extraction of Functional Sentence Stems (FSSs) in English Academic Texts ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

The Inauguration of Corpus Linguistics Research

Se-Eun Jhang

CORPUS LINGUSITICS RESEARCH :: Vol.1 No. pp.-2--1

Abstract

The Inauguration of Corpus Linguistics Research ×

Download PDF Export Citation

The Inauguration of Corpus Linguistics Research ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

Stance and Grammatical Complexity in Conversation: An Unlikely Partnership Discovered through Corpus Analysis

Douglas Biber

CORPUS LINGUSITICS RESEARCH :: Vol.1 No. pp.1-19

Abstract

Stance and Grammatical Complexity in Conversation: An Unlikely Partnership Discovered through Corpus Analysis ×

The present paper attempts to synthesize results from two independent lines of corpus-based research: One focused on grammatical complexity, and the second focused on the expression of stance. The paper begins by describing an unexpected pattern of use in conversational discourse: Despite the fact that conversation is co-constructed by multiple participants, producing language in real-time and discussing personal topics, it is characterized by an extremely dense use of dependent clauses. Corpus-based findings regarding the use of stance expressions are less surprising, showing how stance devices are more commonly used in conversation than in academic writing. The main focus of the present paper is to explore the intersection between these two lines of research, showing how many grammatically complex structures in conversation are used to support the functional prominence given to the expression of stance in that register. That is, utterances in conversation often involve two grammatical components, with an idea or a report of an action occurring as the dependent clause, and an expression of stance occurring as the main clause that provides the interpretive frame for the information in the dependent clause. As a result, it is not a coincidence that personal expressions of stance as well as complex grammatical structures are both so prevalent in conversational discourse.

Download PDF Export Citation

Stance and Grammatical Complexity in Conversation: An Unlikely Partnership Discovered through Corpus Analysis ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

Utility Specialists in Hong Kong : A Corpus Study of Perception of Communicative Competence

Winnie Cheng

CORPUS LINGUSITICS RESEARCH :: Vol.1 No. pp.21-51

Abstract

Utility Specialists in Hong Kong : A Corpus Study of Perception of Communicative Competence ×

The paper reports on part of a larger scale collaborative and interdisciplinary project among English Applied Studies, Land Surveying and Geo-informatics, and utility specialists industry in engineering in Hong Kong. The present study interviewed 36 utility specialists in Hong Kong to find out their understanding of and perceived importance of communicative competences in English situated in their professional workplaces. The corpora created from the interview data were analysed in several complementary and inter-connected ways, namely word lists, concgram lists, and key semantic fields, to ascertain the kinds of meanings or topical concerns and themes expressed by the professionals when they talked about different types of competences in communication, namely linguistic, sociolinguistic, discourse, strategic, socio-cultural, and social in their workplace. The project has also achieved effective outcomes in terms of knowledge sharing and exchange with the industry.

Download PDF Export Citation

Utility Specialists in Hong Kong : A Corpus Study of Perception of Communicative Competence ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

Linguistic Dimensions of Learner Speech in English Interviews

Eric Friginal,Brittany Polat

CORPUS LINGUSITICS RESEARCH :: Vol.1 No. pp.53-82

Abstract

Linguistic Dimensions of Learner Speech in English Interviews ×

This paper discusses the utilization of multi-dimensional (MD) analysis (Biber, 1988; 1995) to examine linguistic variation in learner speech represented by transcribed interviews from the Louvain International Database of Spoken English Interlanguage (LINDSEI). LINDSEI is the first large-scale corpus of spoken learner English with sub-corpora of interview responses from eleven different mother-tongue backgrounds. The primary goals of this study are: to extract and identify the linguistic dimensions of learner speech from LINDSEI, to functionally interpret these dimensions, and to compare how these dimensions are distributed across speakers’ eleven first language backgrounds. Results show that the four primary functional dimensions of learner speech are: Involved Conversational Style vs. Informational Production; Complex Statement of Opinion; Formal, Academic Focus of Discussion vs. Informal, Non-Academic Discourse; and Personal Narrative Prose vs. Non-Narrative Discourse. Interesting differences are observed in how these dimensions are used by learners across first language backgrounds and interview tasks.

Download PDF Export Citation

Linguistic Dimensions of Learner Speech in English Interviews ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

A Corpus-based Study of Collocation in Chinese EFL Learners' Oral Production

Lihui Zheng,Richard Zhonghua Xiao

CORPUS LINGUSITICS RESEARCH :: Vol.1 No. pp.83-108

Abstract

A Corpus-based Study of Collocation in Chinese EFL Learners' Oral Production ×

This article provides a systematic account of collocational use in Chinese EFL learners’ oral production and explores some of the issues involved, by adopting a corpus-based error analysis approach. The distribution of six types of collocational errors extracted from two sizeable Chinese learner English corpora shows that verb-noun collocations pose the greatest difficulty for Chinese learners. An exploration of the correlation between the learners’ English proficiency and collocational performance finds that the learners’ knowledge of collocation has not developed alongside their knowledge of vocabulary in general. Our further descriptive and diagnostic analyses of verb-noun errors indicate that 1) Chinese EFL learners have the greatest difficulty with the verbs when using verb-noun collocations; 2) the learners’ use of nouns is also not satisfactory; 3) due attention should be paid to the inappropriate use of the non-lexical elements (prepositions and articles); and 4) the main causes of verb-noun collocational errors include L1 transfer, assumed synonyms, overgeneralization and misselection of the target word. It is suggested that university English teaching in China should attach more importance to the examination and diagnosis of collocational errors and also integrate learner-centered, corpus-based methods into vocabulary teaching.

Download PDF Export Citation

A Corpus-based Study of Collocation in Chinese EFL Learners' Oral Production ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

The Most Frequent Formulaic Sequences In College Engineering Textbooks

Wenhua Hsu

CORPUS LINGUSITICS RESEARCH :: Vol.1 No. pp.109-132

Abstract

The Most Frequent Formulaic Sequences In College Engineering Textbooks ×

This research describes an attempt to establish a pedagogically useful list of the most frequent formulaic sequences for engineering undergraduates who need to read the textbooks of their fields in English. The Engineering English Formulae/Formulaic Sequences List (EEFL) was derived from a corpus containing 4.57 million tokens of one hundred college textbooks across twenty engineering subjects. In consideration of formulae for widespread use and pedagogical relevance, a series of criteria were applied. Comparable to a list of 1,000 high-frequency individual words, a total of 1,000 two- to six-word sequences were selected into the EEFL and they accounted for 25.73% of the running words in the Engineering Textbook Corpus. The EEFL, not highly technical in nature, contains the most commonly-used multi-word units traversing the subfields of the engineering domain and engineering majors may encounter these word sequences very often. For matriculating engineering students, the present EEFL and the EEWL (Engineering English Word List common to engineering subjects) may be mutually complementary in providing a pathway to the engineering register and may be helpful for ESP teachers without a background of science and technology when preparing engineering English teaching materials for an EST course required by most engineering-related departments.

Download PDF Export Citation

The Most Frequent Formulaic Sequences In College Engineering Textbooks ×

EndNote
RefWorks
Scholar's Aid
BibTeX

Export Citation Cancel

1 2 3 4 5 6 7 8 9