Abstract

Speech recognition has the problem of low recognition accuracy because of poor denoising effect and low endpoint detection accuracy. Therefore, a new intelligent speech multifeature recognition method based on deep machine learning is proposed. In this method, speech signals are digitally processed, a first-order finite impulse response (FIR) high pass digital filter is used to preemphasize digital speech signals, and short-term energy and zero crossing rate are combined to detect speech signals to expand endpoints. The detected speech signal is input into the depth autoencoder, and the features of the speech signal are extracted through deep learning. The Gaussian mixture model of deep machine learning is constructed using a continuous distribution hidden Markov model, and the extracted features are input into the model to complete feature recognition. The experimental results show that the proposed method has high endpoint detection accuracy, good denoising effect, and high recognition accuracy, and this method has higher application value.

References

1.
Müezzinoğlu
T.
and
Karaköse
M.
, “
An Intelligent Human–Unmanned Aerial Vehicle Interaction Approach in Real Time Based on Machine Learning Using Wearable Gloves
,”
Sensors
21
, no. 
5
(March
2021
): 1766, https://doi.org/10.3390/s21051766
2.
Chi
O. H.
,
Jia
S.
,
Li
Y.
, and
Gursoy
D.
, “
Developing a Formative Scale to Measure Consumers’ Trust toward Interaction with Artificially Intelligent (AI) Social Robots in Service Delivery
,”
Computers in Human Behavior
118
(May
2021
): 106700, https://doi.org/10.1016/j.chb.2021.106700
3.
Sun
Z.
and
Tang
P.
, “
Automatic Communication Error Detection Using Speech Recognition and Linguistic Analysis for Proactive Control of Loss of Separation
,”
Transportation Research Record
2675
, no. 
5
(May
2021
):
1
12
, https://doi.org/10.1177/0361198120983004
4.
Lin
L.
and
Tan
L.
, “
Multi-distributed Speech Emotion Recognition Based on Mel Frequency Cepstogram and Parameter Transfer
,”
Chinese Journal of Electronics
31
, no. 
1
(January
2022
):
155
167
.
5.
Messaoudi
A.
,
Haddad
H.
,
Fourati
C.
,
Hmida
M. B.
,
Elhaj Mabrouk
A. B.
, and
Graiet
M.
, “
Tunisian Dialectal End-to-End Speech Recognition Based on DeepSpeech
,”
Procedia Computer Science
189
(
2021
):
183
190
, https://doi.org/10.1016/j.procs.2021.05.082
6.
Liang
Z.-Y.
,
Li
Y.-X.
,
Sun
Y.
, and
Yao
Q.
, “
Speech Recognition of Dysarthria Based on Multi-feature Combination
” (in Chinese),
Computer Engineering and Design
43
, no. 
2
(February
2022
):
567
572
, https://doi.org/10.16208/j.issn1000-7024.2022.02.037
7.
Zheng
J.
,
Tagawa
N.
,
Yoshizawa
M.
, and
Irie
T.
, “
Plane Wave Beamforming with Adaptively Weighted Frequency Compound Using Bandpass Filtering
,”
Japanese Journal of Applied Physics
60
, no. 
SD
(July
2021
): SDDB08, https://doi.org/10.35848/1347-4065/abf989
8.
Gomez-Garcia
R.
,
Psychogiou
D.
,
Munoz-Ferreras
J.-M.
, and
Yang
L.
, “
Avoiding RF Isolators: Reflectionless Microwave Bandpass Filtering Components for Advanced RF Front Ends
,”
IEEE Microwave Magazine
21
, no. 
12
(December
2020
):
68
86
, https://doi.org/10.1109/MMM.2020.3023222
9.
Ahmmed
K. T.
,
Chan
H. P.
, and
Li
B.
, “
Scalable Selective High Order Mode Pass Filter Architecture with Asymmetric Directional Coupler
,”
Optics Express
28
, no. 
19
(September
2020
):
28465
28478
, https://doi.org/10.1364/OE.402751
10.
Wang
P.
,
Zhang
D.
, and
Lu
B.
, “
Robust Fuzzy Sliding Mode Control Based on Low Pass Filter for the Welding Robot with Dynamic Uncertainty
,”
Industrial Robot
47
, no. 
1
(
2020
):
111
120
, https://doi.org/10.1108/IR-04-2019-0074
11.
Zhou
J.
,
Lu
J.
,
Zhou
G.
, and
Lu
C.
, “
Joint OSNR and Frequency Offset Estimation Using Signal Spectrum Correlations
,”
Journal of Lightwave Technology
39
, no. 
9
(May
2021
):
2854
2863
, https://doi.org/10.1109/JLT.2021.3063251
12.
Liu
X.
and
Zhang
Z.
, “
A Two-Stage Deep Autoencoder-Based Missing Data Imputation Method for Wind Farm SCADA Data
,”
IEEE Sensors Journal
21
, no. 
9
(May
2021
):
10933
10945
, https://doi.org/10.1109/JSEN.2021.3061109
13.
Min
B.
,
Yoo
J.
,
Kim
S.
,
Shin
D.
, and
Shin
D.
, “
Network Anomaly Detection Using Memory-Augmented Deep Autoencoder
,”
IEEE Access
9
(
2021
):
104695
104706
, https://doi.org/10.1109/ACCESS.2021.3100087
14.
Chen
L.
,
Su
W.
,
Wu
M.
,
Pedrycz
W.
, and
Hirota
K.
, “
A Fuzzy Deep Neural Network with Sparse Autoencoder for Emotional Intention Understanding in Human-Robot Interaction
,”
IEEE Transactions on Fuzzy Systems
28
, no. 
7
(July
2020
):
1252
1264
, https://doi.org/10.1109/TFUZZ.2020.2966167
15.
Yang
F.
,
Herranz
L.
,
van de Weijer
J.
,
Guitián
J. A. I.
,
López
A. M.
, and
Mozerov
M. G.
, “
Variable Rate Deep Image Compression with Modulated Autoencoder
,”
IEEE Signal Processing Letters
27
(
2020
):
331
335
, https://doi.org/10.1109/LSP.2020.2970539
16.
Saleem
N.
,
Khattak
M. I.
,
Al-Hasan
M.
, and
Jan
A.
, “
Learning Time-Frequency Mask for Noisy Speech Enhancement Using Gaussian-Bernoulli Pre-trained Deep Neural Networks
,”
Journal of Intelligent and Fuzzy Systems
40
, no. 
1
(
2020
):
849
864
, https://doi.org/10.3233/JIFS-201014
17.
Gupta
D.
and
Shekhawat
H. S.
, “
High-Band Feature Extraction for Artificial Bandwidth Extension Using Deep Neural Network and H Optimisation
,”
IET Signal Processing
14
, no. 
10
(December
2020
):
783
790
, https://doi.org/10.1049/iet-spr.2020.0214
18.
Sun
K.
,
Tao
W.
, and
Qian
Y.
, “
Guide to Match: Multi-layer Feature Matching with a Hybrid Gaussian Mixture Model
,”
IEEE Transactions on Multimedia
22
, no. 
9
(September
2020
):
2246
2261
, https://doi.org/10.1109/TMM.2019.2957984
19.
He
J.
,
Zhou
Y.
,
Shi
J.
, and
Tang
Q.
, “
Modulation Classification Method Based on Clustering and Gaussian Model Analysis for VLC System
,”
IEEE Photonics Technology Letters
32
, no. 
11
(June
2020
):
651
654
, https://doi.org/10.1109/LPT.2020.2991125
20.
Nettasinghe
B.
and
Krishnamurthy
V.
, “
Maximum Likelihood Estimation of Power-Law Degree Distributions via Friendship Paradox-Based Sampling
,”
ACM Transactions on Knowledge Discovery from Data
15
, no. 
6
(June
2021
):
1
28
, https://doi.org/10.1145/3451166
21.
Wu
C.-H.
,
Tsai
T.-R.
, and
Lee
M.-Y.
, “
Two-Stage Maximum Likelihood Estimation Procedure for Parallel Constant-Stress Accelerated Degradation Tests
,”
IEEE Transactions on Reliability
70
, no. 
2
(June
2021
):
446
458
, https://doi.org/10.1109/TR.2021.3053312
This content is only available via PDF.
You do not currently have access to this content.