The UCREL Corpus Research Seminar (CRS) is a forum for all staff, visiting academics, and postgraduate research students interested in corpus-based research in any area of linguistics. CRS is run by UCREL (University Centre for Computer Corpus Research on Language), a research centre between the Department of Linguistics and English Language and the School of Computing and Communications.
CRS meetings offer an opportunity to present work in progress and receive helpful feedback, discuss relevant research, approaches and methods, get experience in using corpus interfaces and tools, and stay up to date with corpus-based research at Lancaster University. We welcome anyone who is a newcomer to this exciting and growing area of linguistics. We welcome presentations from researchers of other departments and universities.
On this site, along with general information, you will find a list of upcoming seminars and an archive of past seminars. If you have any suggestions of things to add to the site, then please get in touch.
In 2015/2016 CRS meetings will be on Thursdays at 3pm during term time, unless otherwise indicated.
Notifications of seminars are sent to the UCREL Mailing List, sign up if you would like to receive them and other UCREL related messages. You can also follow us on Twitter, where we post updates on upcoming seminars.
If you need more information or want to give a presentation, please contact one of the CRS organisers:
[...] = add @lancaster.ac.uk for email address.
The previous website has an archive of past seminars.
We acknowledge the following funding for external speakers: CASS and UCREL research centres, the Faculty of Arts and Social Sciences, the Department of Linguistics and English Language and the School of Computing and Communications.
The UCREL Corpus Research Seminars this academic year (2015/16) will be on Thursdays 3pm-4pm during term time, unless otherwise indicated, please check our upcoming page for the time and location of any future presenations. If you would like to give a talk or have a suggestion for an external speaker to invite, then please get in touch.
Thursday 12th May 2016
Furness LT 3
What can CATS bring to researchers who want to analyze tweets?
Twitter's API (Application Programming Interface) makes it possible for anyone to collect tweets for various purposes, and the increasing number of studies based on tweets confirms the usefulness of such corpora. However, the programing knowledge required to carry this out is a problem for a lot of people who do not have those skills, or the resources to work with computer programers, and this is especially true in social sciences. CATS (Collection and Analysis of Tweets made Simple) is a web interface developed by researchers from the universities of Lyon 1 and 2, and its purpose is to enable everyone to generate corpora of tweets with specific demands (keywords, geolocation, language etc...), even for people without any programing background. Not only does CATS allow people to collect tweets, it also enables one to analyze them thanks to various tools (vocabulary exploration, topic modeling, event detection, word clouds).
The goal of this presentation will be to introduce CATS, and what it can do thanks to a demonstration of the tool, and some practical examples based on previously collected corpora. However, the interface also has certain limitations, that I will be happy to discuss as well.
Week 28: 9th June 2016 (2:00-4:00pm)
Furness LT 3
Introducing Spoken BNC2014, Written BNC2014, and CorCenCC
Week 29: 16th June 2016 (3:00-4:00pm)
Furness LT 3
Poles Apart? Exploring the Extent of Similarity between Extreme and Non-Extreme Message Content