An Efficient Optimal Reconstruction Based Speech Separation Based on Hybrid Deep Learning Technique

Yannam Vasantha Koteswararao; C.B. Rama Rao

doi:10.14429/dsj.72.17640

Authors

Yannam Vasantha Koteswararao National Institute Of Technology, Waranagal https://orcid.org/0000-0003-2040-5470
C.B. Rama Rao Dept. of ECE, National Institute of Technology, Warangal, Telangana, India

DOI:

https://doi.org/10.14429/dsj.72.17640

Keywords:

Deep learning, Integral fox ride optimization, Hybrid retrieval, Speech separation, Optimal reconstruction

Abstract

Conventional single-channel speech separation has two long-standing issues. The first issue, over-smoothing, is addressed, and estimated signals are used to expand the training data set. Second, DNN generates prior knowledge to address the problem of incomplete separation and mitigate speech distortion. To overcome all current issues, we suggest employing an efficient optimal reconstruction-based speech separation (ERSS) to overcome those problems using a hybrid deep learning technique. First, we propose an integral fox ride optimization (IFRO) algorithm for spectral structure reconstruction with the help of multiple spectrum features: time dynamic information, binaural and mono features. Second, we introduce a hybrid retrieval-based deep neural network (RDNN) to reconstruct the spectrograms size of speech and noise directly. The input signals are sent to Short Term Fourier Transform (STFT). STFT converts a clean input signal into spectrograms then uses a feature extraction technique called IFRO to extract features from spectrograms. After extracting the features, using the RDNN classification algorithm, the classified features are converted to softmax. ISTFT then applies to softmax and correctly separates speech signals. Experiments show that our proposed method achieves the highest gains in SDR, SIR, SAR STIO, and PESQ outcomes of 10.9, 15.3, 10.8, 0.08, and 0.58, respectively. The Joint-DNN-SNMF obtains 9.6, 13.4, 10.4, 0.07, and 0.50, comparable to the Joint-DNN-SNMF. The proposed result is compared to a different method and some previous work. In comparison to previous research, our proposed methodology yields better results.

An Efficient Optimal Reconstruction Based Speech Separation Based on Hybrid Deep Learning Technique

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Information

Make a Submission