Robust Voice Activity Detection for Interview Speech in NIST Speaker Recognition Evaluation

The introduction of interview speech in recent NIST Speaker Recognition Evaluations (SREs) has necessitated the development of robust voice activity detectors (VADs) that can work under very low signal-to-noise ratio. This paper highlights the characteristics of interview speech ﬁles in NIST SREs and discusses the difﬁculties of detecting speech/non-speech segments in these ﬁles. To alleviate these difﬁculties, this paper proposes a VAD that uses noise reduction as a pre-processing step. A strategy to avoid the undesirable effects of impulsive signals and sinusoidal background-signals on the VAD is also proposed. The proposed VAD is compared with the VAD in the ETSI-AMR speech coder for removing silence regions of interview speech ﬁles. The results show that the proposed VAD is more robust in detecting speech segments under very low SNR

Click here for free

download this paper

CSE PROJECTS

FREE IEEE PAPER AND PROJECTS

FREE IEEE PAPER

Robust Voice Activity Detection for Interview Speech in NIST Speaker Recognition Evaluation

download this paper

FREE IEEE PAPER AND PROJECTS

IEEE PROJECTS 2022

SEMINAR REPORTS

FREE IEEE PROJECTS IEEE PAPERS