Automation & Computer Technologies

Unraveling Predictive Mechanism in Speech Perception and Production: Insights from EEG Analyses of Brain Network Dynamics

Expand
  • 1. Key Laboratory of Linguistics, Chinese Academy of Social Sciences, Beijing 100732, China; 2. Corpus and Computational Linguistics Center, Institute of Linguistics, Chinese Academy of Social Sciences, Beijing 100732, China; 3. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, Guangdong, China

Received date: 2023-12-19

  Accepted date: 2024-01-05

  Online published: 2024-04-22

Abstract

How neural networks coordinate to support speech perception and speech production represents a forefront research topic in both contemporary neuroscience and artificial intelligence. Despite the successful incorporation of hierarchical and predictive attributes from biological neural networks (BNNs) into artificial counterparts, substantial disparities persist, particularly in terms of real-time feedback and nonlinear regulation. To gain a more profound understanding of how BNNs manifest these attributes, the present study employed electroencephalography (EEG) techniques to examine the spatiotemporal brain network dynamics involved in listening and oral reading of identical sentences. These two tasks engage distinct sensorimotor modalities while sharing high-level semantic and syntactic representations. According to a hierarchical feedforward model, the low-level auditory and visual inputs would be progressively transformed towards abstract representations of the sentence meaning, leading to a convergence of brain network patterns in higher cognitive regions. However, our findings challenged this viewpoint by revealing an early resemblance of network activation in the prefrontal and parietal areas in both tasks. It implies a top-down predictive mechanism along with the bottom-up progression. This bidirectional interaction could be potentially implemented through frequency-specific synchronization and desynchronization between functional-specific cortical regions, laying the foundation of the speech chain system with common neural substrates.

Cite this article

Zhao Bin, Dang Jianwu, Li Aijun . Unraveling Predictive Mechanism in Speech Perception and Production: Insights from EEG Analyses of Brain Network Dynamics[J]. Journal of Shanghai Jiaotong University(Science), 2026 , 31(2) : 273 -281 . DOI: 10.1007/s12204-024-2729-9

References

[1] HOHENSTEIN J, KIZILCEC R F, DIFRANZO D, et al. Artificial intelligence in communication impacts language and social relationships [J]. Scientific Reports, 2023, 13(1): 5487.
[2] SHEWALKAR A, NYAVANANDI D, LUDWIG S A. Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU [J]. Journal of Artificial Intelligence and Soft Computing Research, 2019, 9(4): 235-245.
[3] SCHRIMPF M, BLANK I A, TUCKUTE G, et al. The neural architecture of language: Integrative modeling converges on predictive processing [J]. Proceedings of the National Academy of Sciences of the United States of America, 2021, 118(45): e2105646118.
[4] BEIER E J, CHANTAVARIN S, REHRIG G, et al. Cortical tracking of speech: Toward collaboration between the fields of signal and sentence processing [J]. Journal of Cognitive Neuroscience, 2021, 33(4): 574-593.
[5] VIGNEAU M, BEAUCOUSIN V, HERVÉ P Y, et al. Meta-analyzing left hemisphere language areas: Phonology, semantics, and sentence processing [J]. NeuroImage, 2006, 30(4): 1414-1432.
[6] ZHU Y M, XU M, LU J F, et al. Distinct spatiotemporal patterns of syntactic and semantic processing in human inferior frontal gyrus [J]. Nature Human Behaviour, 2022, 6: 1104-1111.
[7] HAMILTON L S, OGANIAN Y, HALL J, et al. Parallel and distributed encoding of speech across human auditory cortex [J]. Cell, 2021, 184(18): 4626-4639.e13.
[8] APŠVALKA D, FERREIRA C S, SCHMITZ T W, et al. Dynamic targeting enables domain-general inhibitory control over action and thought by the prefrontal cortex [J]. Nature Communications, 2022, 13: 274.
[9] BINDER J R, DESAI R H, GRAVES W W, et al. Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies [J]. Cerebral Cortex, 2009, 19(12): 2767-2796.
[10] WALENSKI M, EUROPA E, CAPLAN D, et al. Neural networks for sentence comprehension and production: An ALE‐based meta‐analysis of neuroimaging studies [J]. Human brain mapping, 2019, 40(8): 2275-2304.
[11] HICKOK G, POEPPEL D. Dorsal and ventral streams: A framework for understanding aspects of the functional anatomy of language [J]. Cognition, 2004, 92(1/2): 67-99.
[12] FRIDRIKSSON J, YOURGANOV G, BONILHA L, et al. Revealing the dual streams of speech processing [J]. Proceedings of the National Academy of Sciences of the United States of America, 2016, 113(52): 15108-15113.
[13] FRIEDERICI A D, RÜSCHEMEYER S A, HAHNE A, et al. The role of left inferior frontal and superior temporal cortex in sentence comprehension: Localizing syntactic and semantic processes [J]. Cerebral Cortex, 2003, 13(2): 170-177.
[14] TOURVILLE J A, REILLY K J, GUENTHER F H. Neural mechanisms underlying auditory feedback control of speech [J]. NeuroImage, 2008, 39(3): 1429-1443.
[15] BERENT I, PLATT M, THEODORE R, et al. Speech perception triggers articulatory action: Evidence from mechanical stimulation [J]. Frontiers in Communication, 2020, 5: 34.
[16] LIBERMAN A M, MATTINGLY I G. The motor theory of speech perception revised [J]. Cognition, 1985, 21(1): 1-36.
[17] JUNG T P, MAKEIG S, BELL A J, et al. Independent component analysis of electroencephalographic and event-related potential data [M]// Central auditory processing and neural modeling. Boston: Springer, 1998: 189-197.
[18] MULLEN T, DELORME A, KOTHE C, et al. An electrophysiological information flow toolbox for EEGLAB [J]. Biological Cybernetics, 2010, 83: 35-45.
[19] MAKEIG S, DEBENER S, ONTON J, et al. Mining event-related brain dynamics [J]. Trends in Cognitive Sciences, 2004, 8(5): 204-210.
[20] KRIEGESKORTE N, MUR M, BANDETTINI P. Representational similarity analysis - connecting the branches of systems neuroscience [J]. Wellcome Open Research, 2008, 2: 4.
[21] DELORME A, MAKEIG S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis [J]. Journal of Neuroscience Methods, 2004, 134(1): 9-21.
[22] MULLEN T R, KOTHE C A E, CHI Y M, et al. Real-time neuroimaging and cognitive monitoring using wearable dry EEG [J]. IEEE Transactions on Biomedical Engineering, 2015, 62(11): 2553-2567.
[23] HSU S H, PION-TONACHINI L, PALMER J, et al. Modeling brain dynamic state changes with adaptive mixture independent component analysis [J]. NeuroImage, 2018, 183: 47-61.
[24] OOSTENVELD R, OOSTENDORP T F. Validating the boundary element method for forward and inverse EEG computations in the presence of a hole in the skull [J]. Human Brain Mapping, 2002, 17(3): 179-192.
[25] PION-TONACHINI L, KREUTZ-DELGADO K, MAKEIG S. ICLabel: An automated electroencephalographic independent component classifier, dataset, and website [J]. NeuroImage, 2019, 198: 181-197.
[26] DELORME A, MULLEN T, KOTHE C, et al. EEGLAB, SIFT, NFT, BCILAB, and ERICA: New tools for advanced EEG processing [J]. Computational Intelligence and Neuroscience, 2011, 2011: 10.
[27] SCHELTER B, WINTERHALDER M, EICHLER M, et al. Testing for directed influences among neural signals using partial directed coherence [J]. Journal of Neuroscience Methods, 2006, 152(1/2): 210-219.
[28] BONHAGE C E, MEYER L, GRUBER T, et al. Oscillatory EEG dynamics underlying automatic chunking during sentence processing [J]. NeuroImage, 2017, 152: 647-657.
[29] VON STEIN A, SARNTHEIN J. Different frequencies for different scales of cortical integration: From local gamma to long range alpha/theta synchronization [J]. International Journal of Psychophysiology, 2000, 38(3): 301-313.
[30] PALVA S, PALVA J M. New vistas for α-frequency band oscillations [J]. Trends in Neurosciences, 2007, 30(4): 150-158.
[31] CUELLAR M, BOWERS A, HARKRIDER A W, et al. Mu suppression as an index of sensorimotor contributions to speech processing: Evidence from continuous EEG signals [J]. International Journal of Psychophysiology, 2012, 85(2): 242-248.
[32] KOECHLIN E, ODY C, KOUNEIHER F. The architecture of cognitive control in the human prefrontal cortex [J]. Science, 2003, 302(5648): 1181-1185.
[33] HELFRICH R F, KNIGHT R T. Oscillatory dynamics of prefrontal cognitive control [J]. Trends in Cognitive Sciences, 2016, 20(12): 916-930.
[34] GERANMAYEH F, WISE R J S, MEHTA A, et al. Overlapping networks engaged during spoken language production and its cognitive control [J]. Journal of Neuroscience, 2014, 34(26): 8728-8740.
[35] BABILONI C, DEL PERCIO C, VECCHIO F, et al. Alpha, beta and gamma electrocorticographic rhythms in somatosensory, motor, premotor and prefrontal cortical areas differ in movement execution and observation in humans [J]. Clinical Neurophysiology, 2016, 127(1): 641-654.
[36] LIU C, HAN T, XU Z, et al. Modulating gamma oscillations promotes brain connectivity to improve cognitive impairment [J]. Cerebral Cortex, 2022, 32(12): 2644-2656.
Outlines

/