Programmer Guide/Command Reference/EVAL/f0ac: Difference between revisions
From STX Wiki
< Programmer Guide | Command Reference | EVAL
Jump to navigationJump to search
No edit summary |
No edit summary |
||
(2 intermediate revisions by the same user not shown) | |||
Line 5: | Line 5: | ||
:;<var>sr</var>:sampling rate in Hertz. | :;<var>sr</var>:sampling rate in Hertz. | ||
:;<var>dt</var>:analysis hopsize in seconds; If this argument equals '''0''', the hopsize is set to <code>''nppw'' / ''minf0'' / 4</code> (default=0) | :;<var>dt</var>:analysis hopsize in seconds; If this argument equals '''0''', the hopsize is set to <code>''nppw'' / ''minf0'' / 4</code> (default=0) | ||
:;<var>minf0</var>minimum f0 in Hz (default=75) | :;<var>minf0</var>:minimum f0 in Hz (default=75) | ||
:;<var>maxf0</var>maximum f0 in Hz (default=400) | :;<var>maxf0</var>:maximum f0 in Hz (default=400) | ||
:;<var>nppw</var>number of f0 periods per analysis window (default=3) | :;<var>nppw</var>:number of f0 periods per analysis window (default=3) | ||
:;<var>nMaxCandidates</var>:maximum number of f0 candidates (default=15) | :;<var>nMaxCandidates</var>:maximum number of f0 candidates (default=15) | ||
:;<var>thSilence</var>:silence threshold (default=0.05) | :;<var>thSilence</var>:silence threshold (default=0.05) | ||
:;<var>thVoice</var>:voicing threshold (default=0.4) | :;<var>thVoice</var>:voicing threshold (default=0.4) | ||
:;<var>costOct</var>:octave cost (default=0.01) | :;<var>costOct</var>:octave cost (default=0.01) | ||
:;<var>costOctJump</var>octave jump cost (default=0.4) | :;<var>costOctJump</var>:octave jump cost (default=0.4) | ||
:;<var>costVUV</var>:voiced/unvoiced transition cost (default=0.2) | :;<var>costVUV</var>:voiced/unvoiced transition cost (default=0.2) | ||
;Result: The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector ''x'' and the hopsize ''dt''. For unvoiced frames the f0 values are set to zero. | ;Result: The f0 vector. The vector contains the f0 values in Hz. The vector length depends mainly on the length of the signal vector ''x'' and the hopsize ''dt''. For unvoiced frames the f0 values are set to zero. | ||
;See also: [[../formants|formants]], [[Programmer_Guide/SPU_Reference/ | ;See also: [[../formants|formants]], [[Programmer_Guide/SPU_Reference/F0AC|SPU F0AC]] | ||
[[../#Functions|<function list>]] | [[../#Functions|<function list>]] |
Latest revision as of 09:52, 1 October 2015
This function implements a F0 detection algorithm using autocorrelation, as described by Paul Boersma (1993: Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-To-Noise Ratio of a Sampled Sound. Proceedings of the IFA. Vol 17. pp 97-110).
- Usage
f0ac(x, sr, dt, minf0, maxf0, nppw, nMaxCandidates, thSilence, thVoice, costOct, costOctJump, costVUV)
- x
- signal vector.
- sr
- sampling rate in Hertz.
- dt
- analysis hopsize in seconds; If this argument equals 0, the hopsize is set to
nppw / minf0 / 4
(default=0) - minf0
- minimum f0 in Hz (default=75)
- maxf0
- maximum f0 in Hz (default=400)
- nppw
- number of f0 periods per analysis window (default=3)
- nMaxCandidates
- maximum number of f0 candidates (default=15)
- thSilence
- silence threshold (default=0.05)
- thVoice
- voicing threshold (default=0.4)
- costOct
- octave cost (default=0.01)
- costOctJump
- octave jump cost (default=0.4)
- costVUV
- voiced/unvoiced transition cost (default=0.2)