Programmer Guide/Command Reference/EVAL/formants: Difference between revisions

From STX Wiki
Jump to navigationJump to search
m (1 revision: Initial import)
No edit summary
Line 1: Line 1:
{{DISPLAYTITLE:{{SUBPAGENAME}}}}
{{DISPLAYTITLE:{{SUBPAGENAME}}}}
=====formants=====
Format tracking algorithm.
Format tracking algorithm.
;Usage:<code>formants(<var>cf</var>, <var>ca</var>, <var>f0</var>, <var>mflag</var>, <var>bias</var>, <var>lmin</var>)</code>
:;<var>cf</var>:a matrix containing the frequencies of the formant candidates in Hz
:;<var>ca</var>:a matrix containing the amplitudes of formant candidates in dB
::*each row of ''cf''/''ca'' contains the candidates of one frame
::*number of frames: N=<code>nrow(''cf'')</code>
::*maximum number of candidates M=<code>ncol(''cf'')</code>
::*''ca'' must be a matrix with the same dimensions as ''cf'' or a scalar (if amplitudes should not be used for formant tracking)
:;<var>f0</var>:f0 vector
::*''f0'' must be a vector with length N containing the f0 values or a scalar (if f0 should not be used for formant tracking)
::*if ''f0'' is supplied, the formant tracking is only applied to voiced regions
:;<var>mflag</var>selects the tracking method
::*currently only one method (''mflag''=0) is implemented
::*a second method using a formant modell is under development
:;<var>bias</var>:tracking bias in Hz; This value is used to check if two candidates can be connected (<code>|cf[i1,j1]-cf[i2,j2]) / (cf[i1,j1]+cf[i2,j2]+2*bias)</code> must be lower than '''0.1''').
:;<var>lmin</var>:the minimum length of a track in frames
;Result: The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector ''x'' and the hopsize ''dt''. For unvoiced frames the f0 values are set to zero.
;See also: [[../f0ac|f0ac]], [[../lpc|lpc]], [[../cepstrum|cepstrum]], [[../ipeaks|ipeaks]],  [[Programmer_Guide/SPU_Reference/FORMANTS]]


=====Usage:=====
[[../#Functions|<function list>]]
 
<code>formants(<var>type</var>, <var>fcd</var>, <var>f0</var>, <var>fmd</var>, <var>dt</var>, <var>kdt</var>, <var>minsd</var>)</code>
 
=====Parameters:=====
 
;<var>type</var>
 
:The type of formant candidate array:
 
:<code>0</code> - frequencies only. <code>fcd(i,j)</code> = frq of candidate j in frame i
 
:<code>1</code> - frequencies and amplitudes. <code>fcd(i,2*j)</code> = frq of candidate j in frame i
 
:<code>fcd(i,2*j+1)</code> = mp of candidate j in frame i
 
;<var>fcd</var>
 
:The array of formant candidates, format defined by the argument <var>type</var>.
 
;<var>f0</var>
 
:The f0 vector.
 
;<var>fmd</var>
 
:The formant model parameters:
 
:fmd(i,0) = minimum frequency of formant i
 
:fmd(i,1) = maximum frequency of formant i
 
:fmd(i,2) = maximum frequency slope of formant i (Hz / kdt)
 
:fmd(i,3) = maximum amplitude slope of formant i (dB / kdt)
 
;<var>dt</var>
 
:The hopsize.
 
;<var>kdt</var>
 
:The reference time for the slope values of the model (in seconds). The default is the same as <var>dt</var>.
 
;<var>minsd</var>
 
:The minimum duration of the tracking sections (in seconds). The default is <code>2 *</code> <var>dt</var>.
 
=====Function:=====
 
Formant tracking algorithm.
 
=====Result:=====
 
The formant frequency tracks.
 
=====Return Type:=====
 
array

Revision as of 14:49, 20 April 2011

Format tracking algorithm.

Usage
formants(cf, ca, f0, mflag, bias, lmin)
cf
a matrix containing the frequencies of the formant candidates in Hz
ca
a matrix containing the amplitudes of formant candidates in dB
  • each row of cf/ca contains the candidates of one frame
  • number of frames: N=nrow(cf)
  • maximum number of candidates M=ncol(cf)
  • ca must be a matrix with the same dimensions as cf or a scalar (if amplitudes should not be used for formant tracking)
f0
f0 vector
  • f0 must be a vector with length N containing the f0 values or a scalar (if f0 should not be used for formant tracking)
  • if f0 is supplied, the formant tracking is only applied to voiced regions
mflagselects the tracking method
  • currently only one method (mflag=0) is implemented
  • a second method using a formant modell is under development
bias
tracking bias in Hz; This value is used to check if two candidates can be connected (|cf[i1,j1]-cf[i2,j2]) / (cf[i1,j1]+cf[i2,j2]+2*bias) must be lower than 0.1).
lmin
the minimum length of a track in frames
Result
The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector x and the hopsize dt. For unvoiced frames the f0 values are set to zero.
See also
f0ac, lpc, cepstrum, ipeaks, Programmer_Guide/SPU_Reference/FORMANTS

<function list>

Navigation menu

Personal tools