Current Work

The existing work in the literature is being reviewed. Source localization and beamforming was applied to existing TCS recordings. We initially take forward this work for a scenario of non-overlapping speakers. Post-filtering is done for single-channel recording of the microphone array using CNMF model with/without NMF model for speech. For illustration, sound files of a recorded non-overlapping speaker and it's beamformed output is given (obtained from the system mentioned in the proposed model):

Distant speaker
DSB (beamforming)
CNMF-NMF (Post-filtering)

ASR needs to be performed to compare these methods with respect to the obtained word error rate (WER) ( for the recordings being shared on the google drive). Post filtering was done on mono-channel .wav files by considering a channel recording of microphone array. After obtaining the ASR results for the recordings being shared, the following needs to be updated:

MethodCDf-SNRSRMRWER
Without beamforming2.574.306.60 ?
 DSB2.487.708.22 ?
 MVDR2.336.236.77 ?
 LCMV2.446.107.40 ?
 SDB2.705.706.90 ?
Single-channel
enhancement methods
WER
CNMF   ?
CNMF-NMF   ?