澳门金沙网址

How to sound out the Net's big talkers

作者:还悫    发布时间:2019-03-07 02:12:03    

By David Brake THERE’S a lot of information squirrelled away on the Internet as sound files, but how do you find the ones you’re looking for? If you want to find Bill Clinton’s wise words on Kosovo, for instance, where do you start? Compaq’s research team in Cambridge, Massachusetts, thinks it has the answer: Speechbot, a new search engine that can listen to and index the content of speech files on the Net. The Speechbot team took public domain speech recognition software and tuned it to cope better with the low-quality audio available on the Net. Sound files—such as those in RealAudio format or Microsoft’s Media Player—are played into Sphinx and the resulting rough transcript is indexed using a conventional search engine. The results are not accurate enough for reading, but according to Compaq’s business development manager Ron Gentile, repeated keywords allow effective indexing of the file’s content. Speechbot can also match the words in a transcript to the times they were said in a given clip so you don’t have to listen to a three-hour speech to get to the bit you want. Compaq has yet to decide how it will expand and commercialise the service, but some commentators are already impressed. “Other attempts to search multimedia files rely on closed caption information and other indexing material. This is the first system I have heard of which doesn’t do that,” says Danny Sullivan,

 

Copyright © 网站地图