Title: CMU Sphinx - Speech Recognition Toolkit
Description:hrefhttp:lilwondermat.comchatter
Keywords:Body:
CMU Sphinx - Speech Recognition Toolkit
CMU Sphinx
Open Source Toolkit For Speech Recognition
Project by Carnegie Mellon University
Search
a
Download
Learn
Research
Develop
Communicate
CMLLR Adaptation in SphinxTrain
Published on February 17, 2011
The problem is that with a complexity of ASR algorithms it #8217;s very hard to implement them all. While some of them are sometimes better, some are worse. For specific application you can always choose most reasonable approach but it may be not readily available in your system and it might be quite resource-consuming to implement them. That #8217;s why frameworks
like CMUSphinx are valuable for both researchers and speech application developers. That #8217;s why we are so happy to see your contributions to CMUSphinx.
Good example of this is a set of approaches to train MLLR transform. Basically there is MLLR where mean and variance of the gaussians are estimated alone or CMLLR where mean and variance of the gaussian distribution are estimated together. CMLLR is more complex to estimate but because of smaller amount of parameters it does make sense to apply
it when your adaptation data is small. For example if you have just a minute of speech to adapt, CMLLR can give you better results than MLLR.
Why do we write this today you #8217;ll ask? Easy. Today CMLLR estimation code landed in Sphinxtrain trunk. See the file cmllr.py. Thanks a lot to Stephan Vanni who contributed that part, that #8217;s really valuable addition! Enjoy!
Filed in Random stuff |
No replies
We are happy to see every app using CMUSphinx
Published on November 30, 2010
It #8217;s interesting that CMUSphinx gives the developers all over the world the ability to build speech systems, interact with voice and build something unique and useful
embed
This Kinect imitation is not really very impressive or complicated speech recognition, it #8217;s all about freedom of creativity. If you want to build interactive application, just write few lines of code and it will work. In many languages, for many people.
And we #8217;ll be satisfied as well.
Filed in Random stuff |
No replies
InproTK Demonstraction
Published on November 4, 2010
Please consider Sphinx4-based spoken dialog system toolkit demonstration http://www.okkoblog.com/2010/11/04/inprotk-demonstration/ by University of Potsdam and the Inpro Group. Few interesting improvements to the core are there such as prosody-driven end-of-turn classification, mid-utterance action execution and display of partial ASR hypotheses
Filed in Random stuff |
No replies
JVoiceXML and CMUSphinx
Published on October 11, 2010
Are you one of those who think that CMUSphinx is build for mad scientists to support DARPA sniffing on the Arabic broadcasts? Let me show you something interesting. Today #8217;s story is about a very important project that uses CMUSphinx #8211; JVoiceXML, a VoiceXML browser written in Java. It serves hundreds of VoiceXML browsers in the VoIP world. Beside VoiceXML gateways, it provides development tools, including a VoiceXML plugin for Eclipse.
We asked the lead JVoiceXML developer Dr. Dirk Schnelle-Walka several questions about JVoixeXML and CMUSphinx. Here are his answers.
JVoiceXML and CMUSphinx continued raquo;
Filed in Random stuff |
No replies
OpenEars, speech library for IPhone using CMUSphinx
Published on September 18, 2010
We are very pleased to see the ongoing progress on OpenEars. Please consider
http://www.politepix.com/openears/
OpenEars is an iOS library for continuous, multithreaded speech recognition and text-to-speech using CMU Pocketsphinx and CMU Flite, for use in iPhone and iPad development. OpenEars can:
• Do continuous speech recognition on a managed background thread that uses less than 10% CPU on average on an iPhone 3G while listening (decoding and text-to-speech use more CPU),
• Quickly suspend and resume continuous recognition on demand,
• Choose between 8 Flite voices for text-to-speech using a simple config file,
• Suspend recognition during Flite speech automatically when using the external speaker,
• Make use of a Cocoa-standard static library project, allowing SDK and architecture re-targeting from the application project,
• Do management and notification of the state of the Audio Session to handle microphone changes and interruptions like incoming calls,
• Return input/output decibel metering of the audio functions so it is ready for your UI.
• Let you use these features via Objective-C methods.
Please report any bugs at http://www.politepix.com/forums/
Since you probably urge to know more, we asked the OpenEars developer, Halle Winkler a few questions. Halle is a professional developer specializing in software development for the iPhone, iPad, and iPod Touch, as well as UX design, with an emphasis on usability and the emerging interaction possibilities of multitouch platforms.
OpenEars, speech library for IPhone using CMUSphinx continued raquo;
Filed in Random stuff |
No replies
Sphinx4-1.0 beta 5 released
Published on September 1, 2010
Congratulations with the new release.
Get it here:
https://sourceforge.net/projects/cmusphinx/files/sphinx4/1.0 beta5/
New Features and Improvements:
* Alignment demo and grammar to align long speech recordings to
transcription and get word times
* Lattice grammar for multipass decoding
* Explicit-backoff in LexTree linguist
* Significant LVCSR speedup with proper LexTree compression
* Simple filter to drop zero energy frames
* Graphviz for grammar dump vizualization instead of AISee
* Voxforge decoding accuracy test
* Lattice scoring speedup
* JSAPI-free JSGF parser
Bug Fixes:
* Insertion probabilities are counted in lattice scores
* Don #8217;t waste resources and memory on dummy acoustic model
transformations
* Small DMP files are loaded properly
* JSGF parser fixes
* Documentation improvements
* Debian package stuff
Thanks:
Antoine Raux, Marek Lesiak, Yaniv Kunda, Brian Romanowski, Tony
Robinson, Bhiksha Raj, Timo Baumann, Michele Alessandrini, Francisco
Aguilera, Peter Wolf, David Huggins-Daines, Dirk Schnelle-Walka.
Filed in Random stuff |
No replies
Voxforge Spanish Model Released
Published on August 28, 2010
As of today, CMUSphinx project is proud to present you the Spanish acoustic model. It was trained on the acoustic data to support both wideband recognition of microphone speech recordings and narrowband recognition of the telephone speech. Now you can build Spanish IVR using CMUSphinx tools.
You can find the model HERE.
Please try and report the accuracy results.
Also, note that there are models for many languages like German, Dutch, French or Russian. In the future we are going to expand the set of languages for your convenience.
Filed in Random stuff |
7 replies
Python HTK Converter
Published on August 21, 2010
If you are using HTK still and has headaches with it, you have last chance to move to CMUSphinx! You can even take your models with you!
The htk2s3conv package which converts HTK model to SphinxTrain format has been landed in our VCS recently, check it out at
https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx/trunk/htk2s3conv
Don #8217;t forget to read the documentation!
Filed in Random stuff |
No replies
Sphinx4 Creates Future
Published on August 21, 2010
Recently we wrote about one interesting project where sphinx4 was used in contemporary art installation. That #8217;s really enouraging that being very easy for art designer sphinx4 allows to create new virtual reality. Please consider another interesting one:
Chatter from Mia Sorensen on Vimeo.
If you got interested, code and description is here.
Filed in Random stuff |
No replies
Pocketsphinx Is A Perfect Fit For Mobile
Published on August 11, 2010
Pocketsphinx is going to win on mobile platforms. Right now there is no alternative to using it because of performance, accuracy and flexibility. Using pretrained US English acoustic models one can get amazing results.
That was proved once again by amazing Nightingale Browser Project. Check this video how it works
embed
If you like it, feel free to contribute to the project. Checkout source from Gitorious http://gitorious.org/nightingale, clone, send patches, report bugs!
And, to not make you confused. Pocketsphinx is not only for mobile. You can perfectly use it in desktop applications, even in large scalable application. It #8217;s accuracy is the best we can suggest you right now!
Filed in Random stuff |
No replies
laquo; Older Posts
CMU Sphinx In Action
Polls
Is it a problem to have external library dependencies?
You can't implement everything inside CMUSphinx
It's better to avoid external code
Depends on the quality of the library
View Results
Loading ...
Polls Archive
RSS Feeds
Posts
Comments
Content copy; CMU Sphinx |
Hosting by SourceForge |
Powered by WordPress
amp;
built on WordPreciousss
| Top