Programmer Guide/Command Reference/EVAL/em: Difference between revisions
From STX Wiki
< Programmer Guide | Command Reference | EVAL
Jump to navigationJump to search
(Created page with '{{DISPLAYTITLE:{{SUBPAGENAME}}}} Implementation of the Expectation Maximization (EM) algorithm for finding maximum likelihood estimates of the parameters of a Gaussian Mixture Mo…') |
No edit summary |
||
(2 intermediate revisions by the same user not shown) | |||
Line 14: | Line 14: | ||
::*if '''1''', the accurate log-likelihood estimation is used | ::*if '''1''', the accurate log-likelihood estimation is used | ||
::*otherwise use the Intel IPP fast log. add method | ::*otherwise use the Intel IPP fast log. add method | ||
;Result: The estimated model parameters are stored in the references ''meantable'', ''sigmatable'', ''wtable'' and (optional) ''aposterioritable''. The return value | ;Result: The estimated model parameters are stored in the references ''meantable'', ''sigmatable'', ''wtable'' and (optional) ''aposterioritable''. The return value ''r'' is vector with 10 elements, containing informations about the estimation process. | ||
::{|class="einrahmen" | ::{|class="einrahmen" | ||
|''r''[0] ||number of feature vectors | |''r''[0] ||number of feature vectors | ||
|''r''[ | |- | ||
|''r''[ | |''r''[1] ||number of dimensions | ||
|''r''[ | |- | ||
|''r''[ | |''r''[2] ||number of mixture components | ||
|''r''[ | |- | ||
|''r''[ | |''r''[3] ||number of iterations performed by the algorithm | ||
|''r''[ | |- | ||
|''r''[ | |''r''[4] ||joint log-likelihood of the data in the estimated model | ||
|''r''[ | |- | ||
|''r''[5] ||'''-1''' (initialisation max. iterations, apparently unused) | |||
|- | |||
|''r''[6] ||maximum number of iterations | |||
|- | |||
|''r''[7] ||log-likelihood threshold | |||
|- | |||
|''r''[8] ||value of the "accurate" parameter | |||
|- | |||
|''r''[9] ||'''0''' (= method used is EM for diagonal covariance matrices) | |||
|} | |} | ||
;See also: [[../haclust|haclust]], [[../modclust|modclust]], [[../density|density]], [[../svd|svd]] | ;See also: [[../haclust|haclust]], [[../modclust|modclust]], [[../density|density]], [[../svd|svd]] | ||
[[../#Functions|<function list>]] | [[../#Functions|<function list>]] |
Latest revision as of 07:19, 21 April 2011
Implementation of the Expectation Maximization (EM) algorithm for finding maximum likelihood estimates of the parameters of a Gaussian Mixture Model (GMM) (i.e. estimates the model parameters for which the observed data are most likely).
- Usage
em(x, meantable, membertable, sigmatable, wtable, aposterioritable, minvar, emmaxit, likthresh {, accurate})
- x
- feature vectors, 1 feature vector per row ((Nvectors x Ndimensions)
- meantable
- output matrix for estimated mean vectors (Nmixtures x Ndimensions)
- membertable
- vector with Nvectors elements, defining the feature vector memberships to mixtures, resulting from prior clustering
- sigmatable
- output matrix for estimated covariance vectors (i.e. diagonal of covariance matrices) of mixture model (Nmixtures x Ndimensions)
- wtable
- output vector of estimated mixture weights (Nmixtures)
- aposterioritable
- if a table item is given, it will contain the output a posteriori probabilities of the feature vectors, otherwise (if a number is given) it won't be changed
- minvar
- minimum variance
- emmaxit
- maximum number of iterations
- liktresh
- log-likelihood threshold; if the difference between log-likelihoods of two consecutive iterations is less than this threshold, the algorithm halts
- accurate
-
- if 1, the accurate log-likelihood estimation is used
- otherwise use the Intel IPP fast log. add method
- Result
- The estimated model parameters are stored in the references meantable, sigmatable, wtable and (optional) aposterioritable. The return value r is vector with 10 elements, containing informations about the estimation process.
r[0] number of feature vectors r[1] number of dimensions r[2] number of mixture components r[3] number of iterations performed by the algorithm r[4] joint log-likelihood of the data in the estimated model r[5] -1 (initialisation max. iterations, apparently unused) r[6] maximum number of iterations r[7] log-likelihood threshold r[8] value of the "accurate" parameter r[9] 0 (= method used is EM for diagonal covariance matrices)