Simon/Tips, Tricks and Best Practices

From KDE Wiki Sandbox
Revision as of 09:48, 12 July 2012 by Bedahr (talk | contribs) (Created page with "= Recordings = Because simon generates the speech model specifically for each user, the trainings corpus is one of the most important parts in the equation. == Frequent Mistak...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Recordings

Because simon generates the speech model specifically for each user, the trainings corpus is one of the most important parts in the equation.


Frequent Mistakes

This section contains a couple of frequently made mistakes when recording trainings utterances and possible solutions.

Loudness

If you did not use your microphone for simon before, please double-check that it set to an appropriate level.

Louder is basically better. However, your microphone should never clip. That means you better start out low and increase your level step by step until it reaches the maximum amplitute when speaking loudly (you can check the current amplitute with e.g. Audacity).

Newer versions of simon include a level-meter which is displayed while recording samples. The volume is perfectly set up if the meter stays approximately in the center while you speak.

Pauses

simon tries to learn the pronunciation of its users. But of course simon does never really hear what the user is saying - it also gets all of the environment noise.

That is why simon must also learn how what we define as "silence" sounds. This varies by your environment but also by the microphone that you are using.

simon treats everything at the beginning and at the end of the sample as "silence". For that to work, it is best if the user leaves about one or two seconds of silence at the beginning and end of each recording.