Programmer Guide/Command Reference/EVAL/f0ac: Difference between revisions

From STX Wiki
Jump to navigationJump to search
m (1 revision: Initial import)
No edit summary
 
(4 intermediate revisions by the same user not shown)
Line 1: Line 1:
{{DISPLAYTITLE:{{SUBPAGENAME}}}}
{{DISPLAYTITLE:{{SUBPAGENAME}}}}
=====f0ac=====
This function implements a F0 detection algorithm using autocorrelation, as described by Paul Boersma (1993: Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-To-Noise Ratio of a Sampled Sound. Proceedings of the IFA. Vol 17. pp 97-110).
;Usage:<code>f0ac(<var>x</var>, <var>sr</var>, <var>dt</var>, <var>minf0</var>, <var>maxf0</var>, <var>nppw</var>, <var>nMaxCandidates</var>, <var>thSilence</var>, <var>thVoice</var>, <var>costOct</var>, <var>costOctJump</var>, <var>costVUV</var>)</code>
:;<var>x</var>:signal vector.
:;<var>sr</var>:sampling rate in Hertz.
:;<var>dt</var>:analysis hopsize in seconds; If this argument equals '''0''', the hopsize is set to <code>''nppw'' / ''minf0'' / 4</code> (default=0)
:;<var>minf0</var>:minimum f0 in Hz (default=75)
:;<var>maxf0</var>:maximum f0 in Hz (default=400)
:;<var>nppw</var>:number of f0 periods per analysis window (default=3)
:;<var>nMaxCandidates</var>:maximum number of f0 candidates (default=15)
:;<var>thSilence</var>:silence threshold (default=0.05)
:;<var>thVoice</var>:voicing threshold (default=0.4)
:;<var>costOct</var>:octave cost (default=0.01)
:;<var>costOctJump</var>:octave jump cost (default=0.4)
:;<var>costVUV</var>:voiced/unvoiced transition cost (default=0.2)
;Result: The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector ''x'' and the hopsize ''dt''. For unvoiced frames the f0 values are set to zero.
;See also: [[../formants|formants]], [[Programmer_Guide/SPU_Reference/F0AC|SPU F0AC]]


F0 detection using autocorrelation, as described by Paul Boersma (1993: Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-To-Noise Ratio of a Sampled Sound. Proceedings of the IFA. Vol 17. pp 97-110).
[[../#Functions|<function list>]]
 
=====Usage:=====
 
<code>f0ac(<var>x</var>, <var>sr</var>, <var>hopsize</var>, <var>minf0</var>, <var>maxf0</var>, <var>nppw</var>, <var>nMaxCandidates</var>, <var>thSilence</var>, <var>thVoice</var>, <var>costOct</var>, <var>costOctJump</var>, <var>costVUV</var>)</code>
 
=====Parameters:=====
 
;<var>x</var>
 
:The signal vector.
 
;<var>sr</var>
 
:The sampling rate in Hertz.
 
;<var>hopsize</var>
 
:The hop size in seconds. The default is <code>0</code>.
 
;<var>minf0</var>
 
:The minimum f0 in Hertz. The default is <code>75</code>.
 
;<var>maxf0</var>
 
:The maximum f0 in Hertz. The default is <code>400</code>.
 
;<var>nppw</var>
 
:The number of periods per window. The default is <code>3</code>.
 
;<var>nMaxCandidates</var>
 
:The maximum number of candidates. The default is <code>15</code>.
 
;<var>thSilence</var>
 
:The silence threshold. The default is <code>0.05</code>.
 
;<var>thVoice</var>
 
:The voicing threshold. The default is <code>0.4</code>.
 
;<var>costOct</var>
 
:The octave cost. The default is <code>0.01</code>.
 
;<var>costOctJump</var>
 
:The octave jump cost. The default is <code>0.4</code>.
 
;<var>costVUV</var>
 
:The voiced/unvoiced transition cost. The default is <code>0.2</code>.
 
=====Result:=====
 
An f0 vector in Hertz.

Latest revision as of 09:52, 1 October 2015

This function implements a F0 detection algorithm using autocorrelation, as described by Paul Boersma (1993: Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-To-Noise Ratio of a Sampled Sound. Proceedings of the IFA. Vol 17. pp 97-110).

Usage
f0ac(x, sr, dt, minf0, maxf0, nppw, nMaxCandidates, thSilence, thVoice, costOct, costOctJump, costVUV)
x
signal vector.
sr
sampling rate in Hertz.
dt
analysis hopsize in seconds; If this argument equals 0, the hopsize is set to nppw / minf0 / 4 (default=0)
minf0
minimum f0 in Hz (default=75)
maxf0
maximum f0 in Hz (default=400)
nppw
number of f0 periods per analysis window (default=3)
nMaxCandidates
maximum number of f0 candidates (default=15)
thSilence
silence threshold (default=0.05)
thVoice
voicing threshold (default=0.4)
costOct
octave cost (default=0.01)
costOctJump
octave jump cost (default=0.4)
costVUV
voiced/unvoiced transition cost (default=0.2)
Result
The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector x and the hopsize dt. For unvoiced frames the f0 values are set to zero.
See also
formants, SPU F0AC

<function list>

Navigation menu

Personal tools