Dr Andrew Kehoe
Associate Professor, Deputy Head of School, MA Course Director
Andrew Kehoe is Deputy Head of School, and Director of the Research and Development Unit for English Studies (RDUES). He studied at the University of Liverpool, gaining qualifications in both English and Computer Science. He researches in the field of Corpus Linguistics, with a particular emphasis on the use of the web as a source of natural language data and on the development of software tools to facilitate this.
Andrew was lead software developer on the WebCorp project and manager of the JISC-funded eMargin project, building an online collaborative text annotation tool for use in teaching. He also led the School of English return to REF2014 and wrote a blog post on the School's outstanding performance.
Andrew is an elected member of the executive committee of University English and the executive board of the International Computer Archive of Modern and Medieval English (ICAME). He has co-edited two volumes on Corpus Linguistics, and published widely in the field.
Andrew is Course Director of the School's distance-learning MA in English Linguistics.
PhD in Linguistics, Birmingham City University.
MSc Information Systems, University of Liverpool.
BA (Hons) English Language and Literature, University of Liverpool.
Andrew is an elected member of the Executive Board of the International Computer Archive of Modern and Medieval English (ICAME) and the Executive Committee of University English.
Andrew teaches on the School's distance-learning MA in English Linguistics, of which he is also Course Director.
Andrew has research interests in all aspects of Corpus Linguistics, including the development of software tools for the identification and visualisation of language change across time. He has a particular interest in the use of the web as a source of natural language data and has expertise in the areas of search engine design, topic detection and indexing, web document formats, and the extraction of authorship date from web documents.
2012-13 JISC Embedding Benefits Grant - Integration of eMargin with Virtual Learning Environments (Project Manager)
2011-12 JISC Learning and Teaching Innovation Grant - eMargin: an online collaborative textual annotation resource (Project Manager)
Recent Invited Talks
2015 Reader comments on online news articles: a corpus-based analysis. CRAL Corpus Linguistics Workshop, University of Nottingham, February 20.
2014 "Your blog is (the) sh*t" - the role of context in the analysis of swearing in blogs (with Ursula Lutzky), English Department Research Seminar, University of Liverpool, December 10.
2014 Reader comments on online news articles: a corpus-based analysis. English Department Research Seminar, University of Liverpool, May 21.
2013 The role of context in the analysis of swearing in blogs (with Ursula Lutzky). Workshop on politeness and impoliteness in digital communication: Corpus-related explorations. ESRC Centre for Corpus Approaches to Social Science, Lancaster University, September 20.
2012 eMargin and Linguistic Analysis. UCREL Corpus Research Seminar, Lancaster University, December 6.
2012 eMargin and Text Annotation, AHRC Hidden Collections Doctoral Training Programme, University of Nottingham, November 23.
2012 eMargin in Literary Study, HEA Workshop, University of Leicester, July 5.
2012 Introduction to eMargin, Digital Conversations Workshop, British Library, March 30.
2009-11 Introducing A-Level English Language students to empirical text study using the WebCorp Linguist's Search Engine (AHRC Knowledge Transfer Fellowship) Research Associate / Co-author
2006-08 WebCorp Linguist's Search Engine (EPSRC / HEFCE-SRIF) Technical Lead
2006-07 Repulsion: The investigation of an organising force in text (EPSRC) Researcher Co-investigator / Software Developer
2001-04 SHARES: System of Hypermatrix Analysis, Retrieval, Evaluation and Summarisation (EPSRC) Research Associate / Software Developer
2000-01 WebCorp: The Web as Corpus (EPSRC) Research Assistant / Software Developer
1999-2000 APRIL: Analysis and prediction of innovation in the lexicon (EPSRC) Research Assistant / Software Developer
2009 with Renouf, A. (eds.) Corpus Linguistics: Refinements and Reassessments, Amsterdam: Rodopi.
2006 with Renouf, A. (eds.) The Changing Face of Corpus Linguistics, Amsterdam: Rodopi.
2009 with Gee, M. Weaving Web data into a diachronic corpus patchwork in A. Renouf and A. Kehoe (eds.) Corpus Linguistics: Refinements and Reassessments, Amsterdam: Rodopi.
2006 Diachronic Linguistic Analysis on the Web with WebCorp in A. Renouf and A. Kehoe (eds.) The Changing Face of Corpus Linguistics, Amsterdam: Rodopi.
2004 with Renouf, A. and D. Mezquiriz "The Accidental Corpus: Some Issues in Extracting Linguistic Information from the Web", in K. Aijmer and B. Altenberg (eds.) Advances in Corpus Linguistics, Amsterdam: Rodopi.
2013 with Renouf, A. 'Filling the gaps: Using the WebCorp Linguist's Search Engine to supplement existing text resources'. International Journal of Corpus Linguistics 18:2, 167-198.
2013 with Gee, M. eMargin: A Collaborative Textual Annotation Tool. Ariadne, Issue 71.
2012 with Gee, M. Reader comments as an aboutness indicator in online texts: introducing the Birmingham Blog Corpus in S. Oksefjell Ebeling, J. Ebeling and H. Hasselgård (eds.) Studies in Variation, Contacts and Change in English Volume 12: Aspects of Corpus Linguistics: Compilation, Annotation, Analysis, University of Helsinki e-journal.
2011 with Gee, M. Social Tagging: A new perspective on textual 'aboutness' in P. Rayson, S. Hoffmann and G. Leech (eds.) Studies in Variation, Contacts and Change in English Volume 6: Methodological and Historical Dimensions of Corpus Linguistics, University of Helsinki e-journal.
2007 with Gee, M. New corpora from the web: making web text more 'text-like' in P. Pahta, I. Taavitsainen, T. Nevalainen and J. Tyrkkö (eds.) Towards Multimedia in Corpus Studies, electronic publication, University of Helsinki.
2010 Review article on 'ConcGram 1.0' software, in ICAME Journal: Computers in English Linguistics, No. 34, April 2010.
2005 with Renouf, A. and J. Banerjee The WebCorp Search Engine: a holistic approach to Web text Search in Proceedings from the Corpus Linguistics Conference Series, Vol. 1, no.1, University of Birmingham.
2004 with Renouf, A. 'Textual Distraction as a Basis for Evaluating Automatic Summarisers', in M.T. Lino et al (eds.) Procedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), Paris: ELRA, Vol IV pp. 1347-1350.
2003 with Morley B. and A. Renouf Linguistic Research with the XML / RDF aware WebCorp Tool. World Wide Web 2003 Conference, Budapest.
2002 with Renouf, A. WebCorp: Applying the Web to Linguistics and Linguistics to the Web. World Wide Web 2002 Conference, Honolulu, Hawaii.
2010 The Birmingham Blog Corpus (with Matt Gee and Ursula Lutzky)
2000-ongoing WebCorp software and user guide.
2000 APRIL (Analysis and Prediction of Innovation in the Lexicon) project software, databases and web front-end.
1999 Discourse Tree Manipulation Algorithms: Using Rhetorical Structure Theory to Restructure and Summarise Texts, MSc Dissertation, University of Liverpool (with accompanying C++ software).
Work With Industry
Andrew worked as linguistic consultant to the Grey London communications agency on behalf of the fashion brand Puma and their fragrance partner Procter & Gamble. This work resulted in the creation of the critically acclaimed Puma Dance Dictionary website and accompanying Europe-wide TV advertising campaign to launch the Puma Sync fragrance range.