IITB-TCS

Current Work

The existing work in the literature is being reviewed. Source localization and beamforming was applied to existing TCS recordings. We initially take forward this work for a scenario of non-overlapping speakers. Post-filtering is done for single-channel recording of the microphone array using CNMF model with/without NMF model for speech. For illustration, sound files of a recorded non-overlapping speaker and it's beamformed output is given (obtained from the system mentioned in the proposed model):

	Distant speaker
	DSB (beamforming)
	CNMF-NMF (Post-filtering)

ASR needs to be performed to compare these methods with respect to the obtained word error rate (WER) ( for the recordings being shared on the google drive). Post filtering was done on mono-channel .wav files by considering a channel recording of microphone array. After obtaining the ASR results for the recordings being shared, the following needs to be updated:

Method	CD	f-SNR	SRMR	WER
Without beamforming	2.57	4.30	6.60	?
DSB	2.48	7.70	8.22	?
MVDR	2.33	6.23	6.77	?
LCMV	2.44	6.10	7.40	?
SDB	2.70	5.70	6.90	?

Single-channel enhancement methods	WER
CNMF	?
CNMF-NMF	?

TCS Project -Distant speaker recognition using microphone array

Electrical Engineering Dept. IIT Bombay, Powai

Latest Activity

None

Note: Recordings will be shared on Google Drive

Useful Links

Current Work