This forum has been archived. All content is frozen. Please use KDE Discuss instead.

Russian language bug

Tags: None
(comma "," separated)
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Russian language bug

Sat Feb 16, 2013 2:53 pm
I downloaded and installed Simon 0.4 for Windows 7, download and copy to a folder c:\Program Files\Simon\bin\ exe files HTK.
Download and install the shadow dictionary of the Russian language from here: Link.
Imported language model from here: Link.
I added to the active vocabulary of 40 words from the shadow dictionary and I trained them 4 times.
When processing language model there is a error:
Code: Select all
The recognition reported the following error:
Failed to setup recognition:


I'm doing something wrong or is this a bug of the program?

P.S.: Sorry for my English.
User avatar
DrHouse123
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Sat Feb 16, 2013 3:24 pm
Maybe there is wrong dictionary?
Did you tried another version of programm and dictionaries?
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Sat Feb 16, 2013 3:35 pm
DrHouse123 wrote:Maybe there is wrong dictionary?

Maybe. I do not know.

DrHouse123 wrote:Did you tried another version of programm

I think that early versions of the program are even more unstable.

DrHouse123 wrote:and dictionaries?

I found only this. :'(
User avatar
DrHouse123
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Sat Feb 16, 2013 4:21 pm
Did you tried Simon on Linux ?
Maybe only windows-version has that bug.
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Sun Feb 17, 2013 3:44 pm
For linux I have too low IQ)
I want to start Simon under Windows.
bedahr
Moderator
Posts
141
Karma
0
OS

Re: Russian language bug

Sun Feb 17, 2013 11:06 pm
Hi Ivan!

Simon dev here. Did you set up an appropriate grammar? If you haven't read it already, have a look at our manual!
(I just set up the model with the Russian model you linked and it is indeed compatible to Simon 0.4)

Best regards,
Peter
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Mon Feb 18, 2013 5:39 pm
Hi Peter!
bedahr wrote:Did you set up an appropriate grammar?

No. I'll try.

bedahr wrote:If you haven't read it already, have a look at our manual!

I read The Simon Handbook. Its best part)

bedahr wrote:I just set up the model with the Russian model you linked and it is indeed compatible to Simon 0.4

I tested [EN/H4W/SPHINX] HUB4 WSJ 1.0 base model with the scenario [EN/H4W] of Mouse 0.1. It worked perfectly without any critical errors.
But when I load the Russian base model with own scenario, an error occurs: "The recognition reported the following error:Failed to setup recognition: ".
Peter, please test my basic model as adapted, and scenario under Windows 7.
Download link
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Mon Feb 18, 2013 6:49 pm
Great progress! After I set up an appropriate grammar, I got two new error messages! But not at once, and after dictionary training.
Code: Select all
As the server compiled the model the following error occurred:
Failed to pack to archive. Source directory does not exist ("c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx//default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/model_parameters/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}.cd_semi_200/")

"C:/Program Files/Simon/bin/python.EXE" "C:/Program Files/Simon/bin/sphinxtrain.EXE" -t default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260} setup
C:/Program Files/Simon/bin/..//lib/sphinxtrain
Setting up the database default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}

"C:/Program Files/Simon/bin/python.EXE" "C:/Program Files/Simon/bin/sphinxtrain.EXE" run
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Extracting features from segments starting at (part 1 of 1)
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
Found 13 words using 24 phones
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/один_S1_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/два_S2_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/три_S3_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/четыре_S4_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/пять_S5_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/шесть_S6_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/семь_S7_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/восемь_S8_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/девять_S9_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/отмена_S12_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: -2.13675213675214e-006
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 10
Words in filler dictionary: 3
Phase 7: Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once
MODULE: 0000 train grapheme-to-phoneme model
Skipped (set $CFG_G2P_MODEL = 'yes' to enable)
Feature type is s2_4x which is 4 streams
LDA/MLLT only has sense for single stream features, for example 1s_c_d_dd
Skipping LDA training
Feature type is s2_4x which is 4 streams
LDA/MLLT only has sense for single stream features, for example 1s_c_d_dd
Skipping MLLT training
MODULE: 05 Vector Quantization
This step had 102 ERROR messages and 0 WARNING messages. Please check the log file for details.
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
MODULE: 10 Training Context Independent models for forced alignment and VTLN
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 11 Force-aligning transcripts
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
MODULE: 12 Force-aligning data for VTLN
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Forward-Backward
Baum welch starting for 64 Gaussian(s), iteration: 1 (1 of 1)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Only 0 parts of 1 of Baum Welch were successfully completed
Parts 1 failed to run!
Training failed in iteration 1
MODULE: 30 Training Context Dependent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...
Phase 2: Initialization
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Forward-Backward
Baum welch starting for iteration: 1 (1 of 1)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Only 0 parts of 1 of Baum Welch were successfully completed
Parts 1 failed to run!
Training failed in iteration 1
MODULE: 40 Build Trees
Phase 1: Cleaning up old log files...
Phase 2: Make Questions
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Tree building
Processing each phone with each state
oo 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
oo 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
oo 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
mm 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
mm 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
mm 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
s 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
s 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
s 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
t 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
t 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
t 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
v 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
v 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
v 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
vv 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
vv 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
vv 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ii 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ii 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ii 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
tt 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
tt 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
tt 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
rr 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
rr 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
rr 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ee 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ee 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ee 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
pp 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
pp 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
pp 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
sh 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
sh 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
sh 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
aa 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
aa 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
aa 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
yy 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
yy 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
yy 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
a 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
a 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
a 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ch 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ch 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ch 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ae 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ae 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ae 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
d 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
d 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
d 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
e 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
e 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
e 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ss 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ss 1
This step had 2 ERROR messages and 0 WARNI


Code: Select all
As the server compiled the model the following error occurred:
Failed to pack to archive. Source directory does not exist ("c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx//default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/model_parameters/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}.cd_semi_200/")

"C:/Program Files/Simon/bin/python.EXE" "C:/Program Files/Simon/bin/sphinxtrain.EXE" -t default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260} setup
C:/Program Files/Simon/bin/..//lib/sphinxtrain
Setting up the database default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}

"C:/Program Files/Simon/bin/python.EXE" "C:/Program Files/Simon/bin/sphinxtrain.EXE" run
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Extracting features from segments starting at (part 1 of 1)
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
Found 13 words using 24 phones
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/один_S1_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/два_S2_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/три_S3_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/четыре_S4_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/пять_S5_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/шесть_S6_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/семь_S7_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/восемь_S8_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/девять_S9_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
WARNING: Error in 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/etc/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}_train.fileids', the feature file 'c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/feat/отмена_S12_2013-02-19_00-20-42.0.mfc' does not exist, or is empty
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: -2.13675213675214e-006
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 10
Words in filler dictionary: 3
Phase 7: Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once
MODULE: 0000 train grapheme-to-phoneme model
Skipped (set $CFG_G2P_MODEL = 'yes' to enable)
Feature type is s2_4x which is 4 streams
LDA/MLLT only has sense for single stream features, for example 1s_c_d_dd
Skipping LDA training
Feature type is s2_4x which is 4 streams
LDA/MLLT only has sense for single stream features, for example 1s_c_d_dd
Skipping MLLT training
MODULE: 05 Vector Quantization
This step had 102 ERROR messages and 0 WARNING messages. Please check the log file for details.
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
MODULE: 10 Training Context Independent models for forced alignment and VTLN
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 11 Force-aligning transcripts
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
MODULE: 12 Force-aligning data for VTLN
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Forward-Backward
Baum welch starting for 64 Gaussian(s), iteration: 1 (1 of 1)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Only 0 parts of 1 of Baum Welch were successfully completed
Parts 1 failed to run!
Training failed in iteration 1
MODULE: 30 Training Context Dependent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...
Phase 2: Initialization
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Forward-Backward
Baum welch starting for iteration: 1 (1 of 1)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Only 0 parts of 1 of Baum Welch were successfully completed
Parts 1 failed to run!
Training failed in iteration 1
MODULE: 40 Build Trees
Phase 1: Cleaning up old log files...
Phase 2: Make Questions
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Tree building
Processing each phone with each state
oo 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
oo 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
oo 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
mm 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
mm 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
mm 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
s 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
s 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
s 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
t 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
t 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
t 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
v 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
v 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
v 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
vv 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
vv 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
vv 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ii 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ii 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ii 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
tt 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
tt 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
tt 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
rr 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
rr 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
rr 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ee 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ee 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ee 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
pp 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
pp 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
pp 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
sh 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
sh 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
sh 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
aa 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
aa 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
aa 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
yy 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
yy 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
yy 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
a 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
a 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
a 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ch 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ch 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ch 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ae 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ae 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ae 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
d 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
d 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
d 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
e 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
e 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
e 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ss 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ss 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
ss 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
i 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
i 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
i 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
dd 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
dd 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
dd 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
n 0
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
n 1
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
n 2
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Skipping SIL
MODULE: 45 Prune Trees
Phase 1: Tree Pruning
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 2: State Tying
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
MODULE: 50 Training Context dependent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...
Phase 2: Copy CI to CD initialize
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Forward-Backward
Baum welch starting for 64 Gaussian(s), iteration: 1 (1 of 2)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Baum welch starting for 64 Gaussian(s), iteration: 1 (2 of 2)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Only 0 parts of 2 of Baum Welch were successfully completed
Parts 1 2 failed to run!
Training failed in iteration 1
MODULE: 65 MMIE Training
Skipped: $ST::CFG_MMIE set to 'no' in sphinx_train.cfg
MODULE: 90 deleted interpolation
Phase 1: Cleaning up directories: logs...
Phase 2: Doing interpolation...
This step had 1 ERROR messages and 0 WARNING messages. Please check the log file for details.
Phase 3: Dumping senones for PocketSphinx...
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
MODULE: DECODE Decoding using models previously trained
Decoding 10 segments starting at 0 (part 1 of 1)
0%
This step had 2 ERROR messages and 0 WARNING messages. Please check the log file for details.
Aligning results to find error rate
C:/Program Files/Simon/bin/..//lib/sphinxtrain
Running the training
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/000.comp_feat/slave_feat.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/00.verify/verify_all.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/0000.g2p_train/g2p_train.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/01.lda_train/slave_lda.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/02.mllt_train/slave_mllt.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/05.vector_quantize/slave.VQ.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/10.falign_ci_hmm/slave_convg.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/11.force_align/slave_align.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/12.vtln_align/slave_align.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/20.ci_hmm/slave_convg.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/30.cd_hmm_untied/slave_convg.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/40.buildtrees/slave.treebuilder.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/45.prunetree/slave.state-tying.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/50.cd_hmm_tied/slave_convg.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/60.lattice_generation/slave_genlat.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/61.lattice_pruning/slave_prune.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/62.lattice_conversion/slave_conv.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/65.mmie_train/slave_convg.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/90.deleted_interpolation/deleted_interpolation.pl""
""C:/Program Files/Simon/bin/perl.exe" "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/decode/slave.pl""

Can't open perl script "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/60.lattice_generation/slave_genlat.pl": No such file or directory
Can't open perl script "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/61.lattice_pruning/slave_prune.pl": No such file or directory
Can't open perl script "C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/62.lattice_conversion/slave_conv.pl": No such file or directory
Failed to copy c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/model_architecture/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}.200.mdef to c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/model_parameters/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}.cd_semi_200_delinterp/mdef: No such file or directory at C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/90.deleted_interpolation/deleted_interpolation.pl line 110.
Can't open c:/users/test3/appdata/roaming/.kde/tmp-deeptown/simond/default/compile/sphinx/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}/result/default{b4291551-fc4f-4b3a-8b3b-dcfe1399b260}-1-1.match
word_align.pl failed with error code 256 at C:/Program Files/Simon/bin/..//lib/sphinxtrain/scripts/decode/slave.pl line 173.


My .kde folder:
Download link
bedahr
Moderator
Posts
141
Karma
0
OS

Re: Russian language bug

Mon Feb 18, 2013 8:35 pm
Hi!

I've had a similar problem from another user using adapted SPHINX base models on Windows. I think there could be a deeper issue there but as I'm not running Windows, it's hard to investigate.

Could you please compress the %appdata%\.kde\tmp-* folder(s), upload and link the archive here so that I can take a look?
Thanks.

Best regards,
Peter
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Tue Feb 19, 2013 12:58 pm
Link to the archive tmp folder

Link to the all .kde folder

Click on the big black button with a strange Russian word)
bedahr
Moderator
Posts
141
Karma
0
OS

Re: Russian language bug  Topic is solved

Tue Feb 19, 2013 4:51 pm
Thank you Ivan. Because of your data files, I have finally been able to reproduce the problem and pinpoint the issue.

The reason why the compilation fails is that Simon names the training samples according to the words that are recorded. So if you record "Test" then the sample will be stored as something like "Test_<date>.wav".
It also stores the transcription (in this case: "TEST") and other information in a text file (that is UTF-8 encoded).

However, because Russian uses lots of special characters, those file names need to be encoded as well - and Windows does this with a local 8-bit character set (I presume - it doesn't appear to be UTF-8 and I haven't had time to look it up).

In any case, the files are not found during the adaption because the file names do no longer match (due to the different encoding).

I'm afraid, there is no fix that doesn't require a bit of coding and, more importantly, updated binaries.
There are basically two work-arounds for the mean time:
a.) Don't use special characters in your words. If you want to write the words out afterwards, you can always "link" the safe ascii-command to the UTF-8 text with e.g. a text-macro command. If you go this route, don't forget to remove your old samples (main screen > manage training data > clear training data).
b.) Keep everything the same but manually re-name the training samples later to a "safe" (ASCII) file name. You can find the samples in the path "%appdata%\.kde\share\apps\simon\model\training.data". Afterwards, you also need to re-write the "prompts" file in "%appdata%\.kde\share\apps\simon\model" to reflect the changed file names. Then, update the "TrainingDate" field in "%appdata%\.kde\share\apps\simon\model\modelsrcrc" to the current date / time to let Simon know about the changes to trigger a synchronization. Don't forget to quit Simon before you do this!

I've also added a ticket at https://bugs.kde.org/show_bug.cgi?id=315460. You can add yourself to the CC list if you want to get a message as soon as the bug is fixed.
Sorry for the inconvenience.

Best regards,
Peter

Edit:
Some additional information:
The Linux version should generally be not affected as most distributions use UTF-8 as the default file system encoding.

Using a static base model of course also avoids this problem altogether.

The HTK backend already includes the appropriate workarounds for Windows.
However, you'd have to check if someone built and released a Russian HTK model. If not, you could still use a user-generated model as well.
User avatar
Ivan_Balalaykin
Registered Member
Posts
7
Karma
0
OS

Re: Russian language bug

Wed Feb 20, 2013 2:17 pm
Thanks!!!
You're a real digital magician! Simon worked! According to your advice I transforming Russian letters to a Latin.


Bookmarks



Who is online

Registered users: Bing [Bot], Google [Bot], Sogou [Bot]