Recognition of cross-language acoustic emotional valence using stacked ensemble learning

Zvarevashe, Kudakwashe; Olugbara, Oludayo O

Please use this identifier to cite or link to this item: https://hdl.handle.net/10646/4367

Full metadata record

DC Field	Value	Language
dc.contributor.author	Zvarevashe, Kudakwashe	-
dc.contributor.author	Olugbara, Oludayo O	-
dc.date.accessioned	2022-01-20T12:44:55Z	-
dc.date.available	2022-01-20T12:44:55Z	-
dc.date.issued	2020-09	-
dc.identifier.citation	Zvarevashe, K. and Olugbara, O. O. (2020). Recognition of cross-language acoustic emotional valence using stacked ensemble learning. Algorithms, 13 (246). http://doi:10.3390/a13100246	en_ZW
dc.identifier.issn	1999-4893	-
dc.identifier.uri	https://hdl.handle.net/10646/4367	-
dc.description.abstract	Most of the studies on speech emotion recognition have used single-language corpora, but little research has been done in cross-language valence speech emotion recognition. Research has shown that the models developed for single-language speech recognition systems perform poorly when used in different environments. Cross-language speech recognition is a craving alternative, but it is highly challenging because the corpora used will have been recorded in different environments and under varying conditions. The differences in the quality of recording devices, elicitation techniques, languages, and accents of speakers make the recognition task even more arduous. In this paper, we propose a stacked ensemble learning algorithm to recognize valence emotion in a cross-language speech environment. The proposed ensemble algorithm was developed from random decision forest, AdaBoost, logistic regression, and gradient boosting machine and is therefore called RALOG. In addition, we propose feature scaling using random forest recursive feature elimination and a feature selection algorithm to boost the performance of RALOG. The algorithm has been evaluated against four widely used ensemble algorithms to appraise its performance. The amalgam of five benchmarked corpora has resulted in a cross-language corpus to validate the performance of RALOG trained with the selected acoustic features. The comparative analysis results have shown that RALOG gave better performance than the other ensemble learning algorithms investigated in this study.	en_ZW
dc.language.iso	en	en_ZW
dc.publisher	MDPI	en_ZW
dc.subject	deep learning	en_ZW
dc.subject	feature elimination	en_ZW
dc.subject	speech emotion	en_ZW
dc.subject	speech recognition	en_ZW
dc.title	Recognition of cross-language acoustic emotional valence using stacked ensemble learning	en_ZW
dc.type	Article	en_ZW
Appears in Collections:	Department of Analytics and Informatics Staff Publications

Files in This Item:

File	Description	Size	Format
Zvarevashe_Recognition_of_cross_language_acoustic_emotional_valence.pdf		2.01 MB	Adobe PDF	View/Open

Show simple item record Recommend this item

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets