Pronunciation lexicon
From A Cat's Wiki
A pronunciation lexicon is a mapping between the written representations of words and the corresponding pronunciations so that this mapping can be used for automatic speech recognition, or for text-to-speech applications.
The pronunciation may be represented using the SAMPA standard (US-ASCII compatible). The CMU pronouncing dictionary employs the Arpabet that is US-ASCII compatible. Currently, the IPA standard isn't widely adopted because it isn't US-ASCII compatible. But the compatibility issues can be resolved using the UTF-8 standard that gets more and more support from the software development community.
[edit] German pronunciation lexicon
The dictionary acquisition project collects IPA transcriptions to build a German pronunciation lexicon (this lexicon will be licensed under the GPL). Recently, a first version of this German pronunciation lexicon has been published. The details are being discussed here. To get a German pronunciation lexicon, you may download the file "Lexicon.tgz" from VoxForge.