Biggest ever linguistic survey on Twitter could find the next 'selfie' or 'twerk'

March 3, 2014
Credit: Aston University

Academics from Aston University (UK) will analyse more than one billion tweets from the UK and US in a linguistic study to discover how new words emerge and spread.

A team of academics from Aston University is beginning work on a project  analysing more than one billion tweets from the UK and US in a which could uncover the next 'selfie' or 'twerk'. 

The investigation, led by Dr Jack Grieve, lecturer in Forensic Linguistics at Aston, will use Twitter to map out for the first time the way in which new words become popular and how they spread.  Online data is increasingly being used to research language variation, and Dr Grieve's study represents by far the largest of its kind. 

Dr Grieve said: "I'm very excited to begin work on this project. No previous linguistic report has had so much data to work with so we have a great opportunity to map the emergence of new words and their lexical diffusion. 

"In addition to charting the internal movement of words in the UK and US, we hope to look at how words spread across the Atlantic, between the two countries – the first study to do so using the same methods in both nations." 

Many tweets contain location data alongside the time they were sent, and appear similar to spontaneous speech, making them particularly valuable to the study of the spread of new words and expressions. 

Another of the project's research goals is to analyse recent patterns of human migration to gain an understanding of how the movement of people influences linguistic variation. 

Aston University is partnering with the University of South Carolina in the United States, who will conduct research into modern and historic migration patterns using millions of online family trees. The two universities will then share data to assess how modern dialects line up with these

Dr Grieve said: "Throughout history, migration has been a key force in shaping and transforming language. Very little research, however, has looked at how more recent population mobility has shaped dialect variation today. Hopefully, we will be able to discover new and exciting findings." 

The project is being funded by the 'Digging into Data Challenge', which aims to utilise large amounts of complex data, known as 'big data', in humanities and . The challenge seeks to show how computer-based research can be used to ask new questions and gain new insights into the world. 

Online language on sites such as Twitter has been at the forefront of recent linguistic developments, with such as 'selfie', 'twerk', 'vom', 'buzzworthy' and 'squee' all making it into the Oxford Dictionaries Online in 2013. 

Explore further: Map of hateful tweets shows hotspots are mostly in eastern half of U.S.

Related Stories

Recommended for you

The dark side of Nobel prizewinning research

October 4, 2015

Think of the Nobel prizes and you think of groundbreaking research bettering mankind, but the awards have also honoured some quite unhumanitarian inventions such as chemical weapons, DDT and lobotomies.

How much for that Nobel prize in the window?

October 3, 2015

No need to make peace in the Middle East, resolve one of science's great mysteries or pen a masterpiece: the easiest way to get yourself a Nobel prize may be to buy one.

Search for Egypt's Nefertiti gains new momentum (Update)

September 29, 2015

The search for ancient Egypt's Queen Nefertiti in an alleged hidden chamber in King Tut's tomb gained new momentum as Egypt's Antiquities Minister said Tuesday he is now more convinced a queen's tomb may lay hidden behind ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.