Statistics Pooling Time Delay Neural Network Based On X-Vector For Speaker Verification

This paper aims to improve speaker embedding representation based on x-vector for extracting more detailed information for speaker verification. We propose a statistics pooling time delay neural network (TDNN), in which the TDNN structure integrates stati
