Programmer Guide/Command Reference/EVAL/formants: Difference between revisions

From STX Wiki
Jump to navigationJump to search
No edit summary
No edit summary
 
(10 intermediate revisions by 2 users not shown)
Line 3: Line 3:
;Usage:<code>formants(<var>cf</var>, <var>ca</var>, <var>f0</var>, <var>mflag</var>, <var>bias</var>, <var>lmin</var>)</code>
;Usage:<code>formants(<var>cf</var>, <var>ca</var>, <var>f0</var>, <var>mflag</var>, <var>bias</var>, <var>lmin</var>)</code>
:;<var>cf</var>:a matrix containing the frequencies of the formant candidates in Hz
:;<var>cf</var>:a matrix containing the frequencies of the formant candidates in Hz
:;<var>ca</var>:a matrix containing the amplitudes of formant candidates in dB
::*each row of ''cf''/''ca'' contains the candidates of one frame
::*number of frames: N=<code>nrow(''cf'')</code>
::*number of frames: N=<code>nrow(''cf'')</code>
::*maximum number of candidates M=<code>ncol(''cf'')</code>
::*maximum number of candidates M=<code>ncol(''cf'')</code>
::*''ca'' must be a matrix with the same dimensions as ''cf'' or a scalar (if amplitudes should not be used for formant tracking)
::*each row of ''cf'' contains the candidates of one frame
::*if <code>''cf''[i,k]=0 (0&le;i<N, 0&le;k<M)</code> the number of candidates in frame i equals k
::*the frequencies of frame i must be in ascending order: <code>''cf''[i,j]<''cf''[i,j+1] (0&le;j<k-1)</code>
:;<var>ca</var>:a matrix containing the amplitudes of formant candidates in dB
::*''ca'' must be a matrix with the same dimensions as ''cf''<BR>or a scalar, if amplitudes should not be used for formant tracking
:;<var>f0</var>:f0 vector
:;<var>f0</var>:f0 vector
::*''f0'' must be a vector with length N containing the f0 values or a scalar (if f0 should not be used for formant tracking)
::*''f0'' must be a vector with length N containing the f0 values int Hz<BR>or a scalar, if f0 should not be used for formant tracking
::*if ''f0'' is supplied, the formant tracking is only applied to voiced regions
::*if ''f0'' is supplied, the formant tracking is only applied to voiced regions
:;<var>mflag</var>selects the tracking method
:;<var>mflag</var>:selects the tracking method
::*currently only one method (''mflag''=0) is implemented (see [[Programmer_Guide/SPU_Reference/FORMANTS|sp-atom FORMANTS]] for more details)
::*currently only one method (''mflag''=0) is implemented (see [[Programmer_Guide/SPU_Reference/FORMANTS|sp-atom FORMANTS]] for more details)
::*a second method using a formant modell is under development
::*a second method using a formant modell is under development
:;<var>bias</var>:tracking bias in Hz; This value is used to check if two candidates can be connected (<code>|cf[i1,j1]-cf[i2,j2]) / (cf[i1,j1]+cf[i2,j2]+2*bias)</code> must be lower than '''0.1''').
:;<var>bias</var>:tracking bias in Hz
:;<var>lmin</var>:the minimum length of a track in frames
:;<var>lmin</var>:the minimum length of a track in frames
;Result: The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector ''x'' and the hopsize ''dt''. For unvoiced frames the f0 values are set to zero.
;Result: A matrix with N rows and L columns. Each column ''l'' contains the formant track with order ''l''+1. L is the maximum formant order or the maximum number of parallel tracks.
;See also: [[../f0ac|f0ac]], [[../lpc|lpc]], [[../cepstrum|cepstrum]], [[../ipeaks|ipeaks]],  [[Programmer_Guide/SPU_Reference/FORMANTS|FORMANTS]]
;See also: [[../f0ac|f0ac]], [[../lpc|lpc]], [[../cepstrum|cepstrum]], [[../ipeak|ipeak]],  [[Programmer_Guide/SPU_Reference/FORMANTS|sp-atom FORMANTS]]


[[../#Functions|<function list>]]
[[../#Functions|<function list>]]

Latest revision as of 19:16, 21 April 2011

Format tracking algorithm.

Usage
formants(cf, ca, f0, mflag, bias, lmin)
cf
a matrix containing the frequencies of the formant candidates in Hz
  • number of frames: N=nrow(cf)
  • maximum number of candidates M=ncol(cf)
  • each row of cf contains the candidates of one frame
  • if cf[i,k]=0 (0≤i<N, 0≤k<M) the number of candidates in frame i equals k
  • the frequencies of frame i must be in ascending order: cf[i,j]<cf[i,j+1] (0≤j<k-1)
ca
a matrix containing the amplitudes of formant candidates in dB
  • ca must be a matrix with the same dimensions as cf
    or a scalar, if amplitudes should not be used for formant tracking
f0
f0 vector
  • f0 must be a vector with length N containing the f0 values int Hz
    or a scalar, if f0 should not be used for formant tracking
  • if f0 is supplied, the formant tracking is only applied to voiced regions
mflag
selects the tracking method
  • currently only one method (mflag=0) is implemented (see sp-atom FORMANTS for more details)
  • a second method using a formant modell is under development
bias
tracking bias in Hz
lmin
the minimum length of a track in frames
Result
A matrix with N rows and L columns. Each column l contains the formant track with order l+1. L is the maximum formant order or the maximum number of parallel tracks.
See also
f0ac, lpc, cepstrum, ipeak, sp-atom FORMANTS

<function list>

Navigation menu

Personal tools