• Login
    View Item 
    •   UZ eScholar Home
    • Faculty of Computer Engineering, Informatics and Communications
    • Department of Analytics and Informatics
    • Department of Analytics and Informatics Staff Publications
    • View Item
    •   UZ eScholar Home
    • Faculty of Computer Engineering, Informatics and Communications
    • Department of Analytics and Informatics
    • Department of Analytics and Informatics Staff Publications
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Recognition of cross-language acoustic emotional valence using stacked ensemble learning

    Thumbnail
    View/Open
    Zvarevashe_Recognition_of_cross_language_acoustic_emotional_valence.pdf (1.963Mb)
    Date
    2020-09
    Author
    Zvarevashe, Kudakwashe
    Olugbara, Oludayo O
    Type
    Article
    Metadata
    Show full item record

    Abstract
    Most of the studies on speech emotion recognition have used single-language corpora, but little research has been done in cross-language valence speech emotion recognition. Research has shown that the models developed for single-language speech recognition systems perform poorly when used in different environments. Cross-language speech recognition is a craving alternative, but it is highly challenging because the corpora used will have been recorded in different environments and under varying conditions. The differences in the quality of recording devices, elicitation techniques, languages, and accents of speakers make the recognition task even more arduous. In this paper, we propose a stacked ensemble learning algorithm to recognize valence emotion in a cross-language speech environment. The proposed ensemble algorithm was developed from random decision forest, AdaBoost, logistic regression, and gradient boosting machine and is therefore called RALOG. In addition, we propose feature scaling using random forest recursive feature elimination and a feature selection algorithm to boost the performance of RALOG. The algorithm has been evaluated against four widely used ensemble algorithms to appraise its performance. The amalgam of five benchmarked corpora has resulted in a cross-language corpus to validate the performance of RALOG trained with the selected acoustic features. The comparative analysis results have shown that RALOG gave better performance than the other ensemble learning algorithms investigated in this study.
    URI
    https://hdl.handle.net/10646/4367
    Additional Citation Information
    Zvarevashe, K. and Olugbara, O. O. (2020). Recognition of cross-language acoustic emotional valence using stacked ensemble learning. Algorithms, 13 (246). http://doi:10.3390/a13100246
    Publisher
    MDPI
    Subject
    deep learning
    feature elimination
    speech emotion
    speech recognition
    Collections
    • Department of Analytics and Informatics Staff Publications [3]

    University of Zimbabwe: Educating To Change Lives!
    DSpace software copyright © 2002-2020  DuraSpace | Contact Us | Send Feedback
     

     

    Browse

    All of UZ eScholarCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage StatisticsView Google Analytics Statistics

    University of Zimbabwe: Educating To Change Lives!
    DSpace software copyright © 2002-2020  DuraSpace | Contact Us | Send Feedback