Search for collections on Wintec Research Archive

Automatically augmenting academic text for language learning: PhD abstract corpora with the British library


Full text not available from this repository. (Request a copy)


This chapter describes the automated FLAX language system ( that extracts salient linguistic
features from academic text and presents them in an interface designed for L2 students who are learning
academic writing. Typical lexico-grammatical features of any word or phrase, collocations, and lexical
bundles are automatically identified and extracted in a corpus; learners can explore them by searching
and browsing, and inspect them along with contextual information. This chapter uses a single running
example, the PhD abstracts corpus of 9.8 million words derived from the open access Electronic Theses
Online Service (EThOS) at the British Library, but the approach is fully automated and can be applied to
any collection of English writing. Implications for reusing open access publications for non-commercial
educational and research purposes are presented for discussion. Design considerations for developing
teaching and learning applications that focus on the rhetorical and lexico-grammatical patterns found
in the abstract genre are also discussed.

Item Type: Book Chapter
Uncontrolled Keywords: augmenting, academic text, language learning
Subjects: L Education > LC Special aspects of education > LC5201 Education extension. Adult education. Continuing education
Q Science > QA Mathematics > QA76 Computer software
Divisions: Schools > Centre for Business, Information Technology and Enterprise > School of Information Technology
Depositing User: Alex Yu
Date Deposited: 30 Apr 2018 01:43
Last Modified: 21 Jul 2023 06:54

Actions (login required)

View Item
View Item