Abstract: Speech emotion recognition is important in intelligent human-computer interaction, but modeling to handle long-range dependencies and local emotional cues remains challenging. This paper ...