Zero-crossing rate

The zero-crossing rate is the rate of sign-changes along a signal, i.e., the rate at which the signal changes from positive to negative or back.[1] This feature has been used heavily in both speech recognition and music information retrieval, being a key feature to classify percussive sounds.[2]

ZCR is defined formally as

where is a signal of length and is an indicator function.

In some cases only the "positive-going" or "negative-going" crossings are counted, rather than all the crossings - since, logically, between a pair of adjacent positive zero-crossings there must be one and only one negative zero-crossing.

For monophonic tonal signals, the zero-crossing rate can be used as a primitive pitch detection algorithm.

Applications

Zero crossing rates are used for Voice activity detection (VAD), i.e., finding whether human speech is present in an audio segment or not.

See also

References

    • Chen, C. H., Signal processing handbook, Dekker, New York, 1988
  1. Gouyon F., Pachet F., Delerue O. (2000),Classifying percussive sounds: a matter of zero-crossing rate?, in Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-00), Verona, Italy, December 7–9, 2000. Accessed 26 April 2011.


This article is issued from Wikipedia - version of the 11/2/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.