Registered Member
|
The product Simon does not work at all. It cannot be installed correctly. It no serviceable documentation like one gives my opinion this product gets to run at all. The speech recognition is not really developed further. There are many handicapped people these are dependent on a speech recognition. The installation does not work already at all.
My experiences till now: 1. At the installation the acoustics model cannot be downloaded. I click on download "accoustic model". I got an error message. "Do you want to change to the web page"? I am passed on a web page. Then being I passed on "GitHub" again. I found hundreds of files on "GitHub". Which file do I have to load now? I am really confused. I skip all questions. 2. What do I have to select in Simon main window? What are the next steps? Must I download the acoustics model, which does not work or go to the manage scenarios first? I want to control the PC with speech recognition and not become an IT specialist. This main window is confusing. Can somebody give me instructions to make this software run. It really cannot be so difficult! I already have tried everything out but it simply does not work. |
Registered Member
|
No help?
My system: OpenSuse 13.2 Thumbleweed Kernel: 3.19.2 Simon: Version 0.4.1 KDE 4.14.5 Installed: sphinxbase-5prealpha sphinxtrain-5prealpha Simon Base Model from TOD [DE/VF/SPHINX] de-2014-04-16 1.4 German voxforge model for SHINX. Scenario from Bedahr Erkennungskontrolle [DE/VF] What is wrong with my simon SR? I've got the error: INFO: cmd_ln.c(691): Parsing command line: \ -hmm /tmp/kde-xpc//simond/default/sphinx/ \ -jsgf /tmp/kde-xpc//simond/default/sphinx/default{dde8dba9-eb73-48be-b290-3e19f15fe26f}.jsgf \ -dict /tmp/kde-xpc//simond/default/sphinx/default{dde8dba9-eb73-48be-b290-3e19f15fe26f}.dic \ -samprate 16000 Current configuration: [NAME] [DEFLT] [VALUE] -agc none none -agcthresh 2.0 2,000000e+00 -alpha 0.97 9,700000e-01 -ascale 20.0 2,000000e+01 -aw 1 1 -backtrace no no -beam 1e-48 1,000000e-48 -bestpath yes yes -bestpathlw 9.5 9,500000e+00 -bghist no no -ceplen 13 13 -cmn current current -cmninit 8.0 8.0 -compallsen no no -debug 0 -dict /tmp/kde-xpc//simond/default/sphinx/default{dde8dba9-eb73-48be-b290-3e19f15fe26f}.dic -dictcase no no -dither no no -doublebw no no -ds 1 1 -fdict -feat 1s_c_d_dd 1s_c_d_dd -featparams -fillprob 1e-8 1,000000e-08 -frate 100 100 -fsg -fsgusealtpron yes yes -fsgusefiller yes yes -fwdflat yes yes -fwdflatbeam 1e-64 1,000000e-64 -fwdflatefwid 4 4 -fwdflatlw 8.5 8,500000e+00 -fwdflatsfwin 25 25 -fwdflatwbeam 7e-29 7,000000e-29 -fwdtree yes yes -hmm /tmp/kde-xpc//simond/default/sphinx/ -input_endian little little -jsgf /tmp/kde-xpc//simond/default/sphinx/default{dde8dba9-eb73-48be-b290-3e19f15fe26f}.jsgf -kdmaxbbi -1 -1 -kdmaxdepth 0 0 -kdtree -latsize 5000 5000 -lda -ldadim 0 0 -lextreedump 0 0 -lifter 0 0 -lm -lmctl -lmname default default -logbase 1.0001 1,000100e+00 -logfn -logspec no no -lowerf 133.33334 1,333333e+02 -lpbeam 1e-40 1,000000e-40 -lponlybeam 7e-29 7,000000e-29 -lw 6.5 6,500000e+00 -maxhmmpf -1 -1 -maxnewoov 20 20 -maxwpf -1 -1 -mdef -mean -mfclogdir -min_endfr 0 0 -mixw -mixwfloor 0.0000001 1,000000e-07 -mllr -mmap yes yes -ncep 13 13 -nfft 512 512 -nfilt 40 40 -nwpen 1.0 1,000000e+00 -pbeam 1e-48 1,000000e-48 -pip 1.0 1,000000e+00 -pl_beam 1e-10 1,000000e-10 -pl_pbeam 1e-5 1,000000e-05 -pl_window 0 0 -rawlogdir -remove_dc no no -round_filters yes yes -samprate 16000 1,600000e+04 -seed -1 -1 -sendump -senlogdir -senmgau -silprob 0.005 5,000000e-03 -smoothspec no no -svspec -tmat -tmatfloor 0.0001 1,000000e-04 -topn 4 4 -topn_beam 0 0 -toprule -transform legacy legacy -unit_area yes yes -upperf 6855.4976 6,855498e+03 -usewdphones no no -uw 1.0 1,000000e+00 -var -varfloor 0.0001 1,000000e-04 -varnorm no no -verbose no no -warp_params -warp_type inverse_linear inverse_linear -wbeam 7e-29 7,000000e-29 -wip 0.65 6,500000e-01 -wlen 0.025625 2,562500e-02 INFO: cmd_ln.c(691): Parsing command line: \ -lowerf 130 \ -upperf 6800 \ -nfilt 25 \ -transform dct \ -lifter 22 \ -feat 1s_c_d_dd \ -agc none \ -cmn current \ -varnorm no Current configuration: [NAME] [DEFLT] [VALUE] -agc none none -agcthresh 2.0 2,000000e+00 -alpha 0.97 9,700000e-01 -ceplen 13 13 -cmn current current -cmninit 8.0 8.0 -dither no no -doublebw no no -feat 1s_c_d_dd 1s_c_d_dd -frate 100 100 -input_endian little little -lda /tmp/kde-xpc//simond/default/sphinx//feature_transform -ldadim 0 0 -lifter 0 22 -logspec no no -lowerf 133.33334 1,300000e+02 -ncep 13 13 -nfft 512 512 -nfilt 40 25 -remove_dc no no -round_filters yes yes -samprate 16000 1,600000e+04 -seed -1 -1 -smoothspec no no -svspec -transform legacy dct -unit_area yes yes -upperf 6855.4976 6,800000e+03 -varnorm no no -verbose no no -warp_params -warp_type inverse_linear inverse_linear -wlen 0.025625 2,562500e-02 INFO: acmod.c(242): Parsed model-specific feature parameters from /tmp/kde-xpc//simond/default/sphinx//feat.params INFO: feat.c(713): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none' INFO: cmn.c(142): mean[0]= 12,00, mean[1..12]= 0.0 INFO: acmod.c(153): Reading linear feature transformation from /tmp/kde-xpc//simond/default/sphinx//feature_transform INFO: mdef.c(520): Reading model definition: /tmp/kde-xpc//simond/default/sphinx//mdef INFO: bin_mdef.c(173): Allocating 101196 * 8 bytes (790 KiB) for CD tree INFO: tmat.c(205): Reading HMM transition probability matrices: /tmp/kde-xpc//simond/default/sphinx//transition_matrices INFO: acmod.c(117): Attempting to use SCHMM computation module INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /tmp/kde-xpc//simond/default/sphinx//means INFO: ms_gauden.c(292): 3177 codebook, 1 feature, size: INFO: ms_gauden.c(294): 16x29 INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /tmp/kde-xpc//simond/default/sphinx//variances INFO: ms_gauden.c(292): 3177 codebook, 1 feature, size: INFO: ms_gauden.c(294): 16x29 INFO: ms_gauden.c(354): 522 variance values floored INFO: acmod.c(119): Attempting to use PTHMM computation module INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /tmp/kde-xpc//simond/default/sphinx//means INFO: ms_gauden.c(292): 3177 codebook, 1 feature, size: INFO: ms_gauden.c(294): 16x29 INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /tmp/kde-xpc//simond/default/sphinx//variances INFO: ms_gauden.c(292): 3177 codebook, 1 feature, size: INFO: ms_gauden.c(294): 16x29 INFO: ms_gauden.c(354): 522 variance values floored INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 3177 INFO: acmod.c(121): Falling back to general multi-stream GMM computation INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /tmp/kde-xpc//simond/default/sphinx//means INFO: ms_gauden.c(292): 3177 codebook, 1 feature, size: INFO: ms_gauden.c(294): 16x29 INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /tmp/kde-xpc//simond/default/sphinx//variances INFO: ms_gauden.c(292): 3177 codebook, 1 feature, size: INFO: ms_gauden.c(294): 16x29 INFO: ms_gauden.c(354): 522 variance values floored INFO: ms_senone.c(160): Reading senone mixture weights: /tmp/kde-xpc//simond/default/sphinx//mixture_weights INFO: ms_senone.c(211): Truncating senone logs3(pdf) values by 10 bits INFO: ms_senone.c(218): Not transposing mixture weights in memory INFO: ms_senone.c(277): Read mixture weights for 3177 senones: 1 features x 16 codewords INFO: ms_senone.c(331): Mapping senones to individual codebooks INFO: ms_mgau.c(122): The value of topn: 4 INFO: dict.c(306): Allocating 4102 * 32 bytes (128 KiB) for word entries INFO: dict.c(321): Reading main dictionary: /tmp/kde-xpc//simond/default/sphinx/default{dde8dba9-eb73-48be-b290-3e19f15fe26f}.dic ERROR: "dict.c", line 194: Line 1: Phone 'f' is mising in the acoustic model; word 'fortsetzen' ignored ERROR: "dict.c", line 194: Line 2: Phone 'p' is mising in the acoustic model; word 'pausieren' ignored ERROR: "dict.c", line 194: Line 3: Phone 's' is mising in the acoustic model; word 'System' ignored INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones INFO: dict.c(324): 0 words read INFO: dict.c(330): Reading filler dictionary: /tmp/kde-xpc//simond/default/sphinx//noisedict INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones INFO: dict.c(333): 3 words read INFO: dict2pid.c(396): Building PID tables for dictionary INFO: dict2pid.c(404): Allocating 59^3 * 2 bytes (401 KiB) for word-initial triphones INFO: dict2pid.c(131): Allocated 84016 bytes (82 KiB) for word-final triphones INFO: dict2pid.c(195): Allocated 84016 bytes (82 KiB) for single-phone word triphones INFO: fsg_search.c(145): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26, pip: 0) INFO: jsgf.c(581): Defined rule: INFO: jsgf.c(581): Defined rule: INFO: jsgf.c(581): Defined rule: INFO: jsgf.c(581): Defined rule: PUBLIC INFO: fsg_model.c(215): Computing transitive closure for null transitions INFO: fsg_model.c(270): 7 null transitions added ERROR: "fsg_search.c", line 332: The word 'System' is missing in the dictionary |
Moderator
|
Hello there,
Simon actually comes with quite a comprehensive handbook. Did you look into that? Best regards, Peter |
Registered users: bartoloni, Bing [Bot], Google [Bot], Sogou [Bot], Yahoo [Bot]