MIT develops lecture search engine to aid students

Nov 08, 2007
MIT develops lecture search engine to aid students
The Lecture Server, as shown in this screenshot of MIT physics professor Walter Lewin, displays video and highlighted search terms.

Imagine you are taking an introductory biology course. You're studying for an exam and realize it would be helpful to revisit the professor's explanation of RNA interference. Fortunately for you, a digital recording of the lecture is online, but the 10-minute explanation you want is buried in a 90-minute lecture you don't have time to watch.

A new lecture search engine developed at MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) could help with this dilemma. Created by a team of researchers and students led by MIT associate professor Regina Barzilay and principal research scientist James Glass, the web-based technology allows users to search hundreds of MIT lectures for key topics.

"Our goal is to develop a speech and language technology that will help educators provide structure to these video recordings, so it's easier for students to access the material," said Glass, who is head of CSAIL's Spoken Language Systems Group.

More than 200 MIT lectures are currently available on the site (web.sls.csail.mit.edu/lectures/). So far, most of the users are international students who access the lectures through MIT's OpenCourseWare (OCW) initiative, which makes curriculum materials for most MIT courses available to anyone with Internet access. Although the lecture-browsing system is still in the early development stages, a recent announcement in OCW's newsletter has drawn increased traffic to the site.

Barzilay and Glass expect the system will be most useful for OCW users and for MIT students who want to review lecture material. MIT World, a web site that provides video of significant MIT events such as lectures by speakers from MIT and around the world, is also participating in the project.

Many MIT professors record their lectures and post them online, but it's difficult to search them for specific topics. Because there is no way to easily scan audio, as you can with printed text, "you end up watching the whole thing, and it's hard to keep focused," said Barzilay, the Douglas T. Ross Career Development Associate Professor of Software Development in the Department of Electrical Engineering and Computer Science.

On the prototype web site, users can search lectures for any term they want and then play the relevant sections.

The lecture transcripts are created by speech recognition software. One major challenge is that the lectures usually contain many technical terms that might not be in the computer program's vocabulary, so the researchers use textbooks, lecture notes and abstracts to identify key terms and feed them into the computer.

"These lectures can have a very specialized vocabulary," said Glass. "For example, in an algebra class, the professor might talk about Eigenvalues."

When properly adapted to a speaker and topic, the lecture-based speech recognizer gets about four out of five words correct, however most of the errors occur in words that are not critical to the lecture topic, i.e., not the key vocabulary terms that people would use to search.

Once the transcript is complete, a language processing program divides the text into sections by topic. Chunks of text, about 100 words each, are compared with each other using a mathematical formula that calculates the number of overlapping words between the text blocks. Each word is weighted so that repetition of key terms has more weight than less important words, and chunks with the most similar words are grouped into sections.

In the future, Barzilay and Glass hope to add a lecture summarization feature to the language processing system. They also want to get users more involved in the project, by incorporating a Wikipedia-like function that would let users correct errors in lecture transcripts and allow them to add lecture notes.

The researchers presented their project at the Interspeech 2007 conference in Antwerp, Belgium, in August. The project was originally funded by Microsoft through the iCampus program and is now funded by the National Science Foundation.

Source: MIT

Explore further: Study reveals mature motorists worse at texting and driving

add to favorites email to friend print save as pdf

Related Stories

MIT pulls online lectures over harassment claim

Dec 09, 2014

The Massachusetts Institute of Technology has removed a retired physics professor's lectures from an online learning platform because the school concluded he had sexually harassed a woman, university officials said.

Online classes really do work, according to study

Sep 24, 2014

It's been two years since a New York Times article declared the "year of the MOOC" —short for "massive open online courses." Now, for the first time, researchers have carried out a detailed study that shows ...

30C3: SD card tricks can deliver MITM attacks

Jan 01, 2014

(Phys.org) —This year's 30th Chaos Communication Congress (30C3) in Hamburg from December 27 to December 30 carried numerous informative presentations, including a reverse-engineering story about SD cards, ...

Fluid mechanics suggests alternative to quantum orthodoxy

Sep 12, 2014

The central mystery of quantum mechanics is that small chunks of matter sometimes seem to behave like particles, sometimes like waves. For most of the past century, the prevailing explanation of this conundrum ...

Recommended for you

Study reveals mature motorists worse at texting and driving

16 hours ago

A Wayne State University interdisciplinary research team in the Eugene Applebaum College of Pharmacy and Health Sciences has made a surprising discovery: older, more mature motorists—who typically are better drivers in ...

Napster co-founder to invest in allergy research

Dec 17, 2014

(AP)—Napster co-founder Sean Parker missed most of his final year in high school and has ended up in the emergency room countless times because of his deadly allergy to nuts, shellfish and other foods.

LA mayor plans 7,000 police body cameras in 2015

Dec 16, 2014

Mayor Eric Garcetti announced a plan Tuesday to equip 7,000 Los Angeles police officers with on-body cameras by next summer, making LA's police department the nation's largest law enforcement agency to move ...

Merriam-Webster names 'culture' word of the year

Dec 15, 2014

A nation, a workplace, an ethnicity, a passion, an outsized personality. The people who comprise these things, who fawn or rail against them, are behind Merriam-Webster's 2014 word of the year: culture.

In Curiosity Hacked, children learn to make, not buy

Dec 14, 2014

With her right hand, my 8-year-old daughter, Kalian, presses the red-hot soldering iron against the circuit board. With her left hand, she guides a thin, tin wire until it's pressing against both the circuit board and the ...

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.