Rhythmic sampling and competition of target and distractor in a motion detection task

Sustained visual attention is required in many real-life situations such as driving a vehicle or operating machinery and is characterized by limited capacity; not all information available to the visual system can be processed in-depth. Recent work has suggested that to manage the limited capacity problem, the visual system samples the attended information in a rhythmic fashion, mediated by low-frequency intrinsic brain oscillations (Chota et al., 2022; Dugué et al., 2015; Fiebelkorn et al., 2013; Fiebelkorn et al., 2018; Fiebelkorn and Kastner, 2019; Helfrich et al., 2018; Michel et al., 2022; Re et al., 2019; VanRullen, 2013; Zalta et al., 2020). In this view, the cycle of a low-frequency intrinsic brain oscillation can be divided into two phases: a high excitability phase and a low excitability phase. When a stimulus occurs during the high excitability phase, behavioral performance tends to be better than average; conversely, if the stimulus occurs during the low excitability phase, performance is generally worse than average (Lakatos et al., 2008; VanRullen, 2013). Behavioral performance may thus exhibit rhythmic fluctuations at the frequency of the aforementioned low-frequency intrinsic brain oscillation. One paradigm that has been used to test the idea of rhythmic visual sampling is the cue-target paradigm (Posner, 1980; Posner et al., 1987; Posner et al., 1988). The cue at the beginning of each trial, in addition to providing instructions on how the impending target stimulus should be responded to, helps to reset the phase of the low-frequency intrinsic oscillation such that all the trials start at approximately the same phase. By varying the stimulus onset asynchrony (SOA) between the cue and the target, one obtains the behavioral response (e.g. accuracy and/or reaction time) as a function of the SOA. The rhythmic nature and the frequency of this function can then be assessed by applying time-domain and/or spectral-domain analysis.

When attending to one object in isolation, the frequency of rhythmic sampling tends to be in the high theta or low alpha frequency range, i.e., around 8 Hz (Fiebelkorn et al., 2013; Senoussi et al., 2019; van der Werf et al., 2023). When attention is directed to multiple objects in the environment, it has been suggested that rather than sampling all the objects simultaneously, the brain samples the objects in a serial fashion (Cohen et al., 1990; Wyart et al., 2012). This would then lead to a slower rhythmic sampling of any given object, in the low range of the theta frequency band, i.e., around 4 Hz (Thigpen et al., 2019). For example, when participants were cued to attend one visual hemifield but were asked to detect the appearance of a weak stimulus in either the cued or the uncued visual hemifield, the rhythmic detection rate for the target appearing in a given visual hemifield decreased from 8 Hz to 4 Hz (Chota et al., 2022; Fiebelkorn et al., 2013; VanRullen, 2013). Interestingly, when the detection rate functions of the cued and uncued targets were compared, a 180-degree relative phase was apparent, suggesting that the visual system indeed sampled the two visual hemifields in a serial, alternating fashion (Fiebelkorn et al., 2013; Jiang et al., 2024). In another example, two spatially overlapping clouds of moving dots, one in red color and the other in blue color, moved in orthogonal directions (Re et al., 2019), and the participant was cued to attend both the red dots and the blue dots and instructed to report the change in either the red dots or the blue dots as soon as it occurred. When there was only one cloud of moving dots, the detection accuracy exhibited rhythmic fluctuations as a function of the SOA at a frequency around 8 Hz. When both clouds of moving dots were present, rhythmic fluctuations in the accuracy of detecting changes in a given cloud of moving dots were again identified, and the sampling frequency was reduced to 4 Hz. In this case, however, no apparent 180-degree relative phase between the rhythmic behavioral response functions to the red and blue dots was found, suggesting that there was no serial, alternating sampling between the two attended objects if they appeared at the same spatial location.

 The real world visual environment contains both task-relevant information (target) and task-irrelevant (distractor) information. It is well established that in the presence of a distractor, the processing of the target is negatively impacted, leading to reduced task performance (Lavie, 2005; Murphy et al., 2016). This implies that the distractor, despite the need for it to be suppressed by the brain’s executive control system (Kastner et al., 1998; Kastner et al., 1999; Kastner and Pinsk, 2004; Seidl et al., 2012; Kastner and Ungerleider, 2000), is nevertheless processed in the brain, and the competition between the target and the distractor at the neural representational level causes the detriment in behavioral performance. Does the rhythmic sampling theory extend to the target-distractor scenario? If so, what is the temporal relationship between the rhythmic sampling of attended vs distracting stimuli? These questions have hitherto not been addressed. Part of the reason is that the majority of the studies on rhythmic environmental sampling focuses on behavioral evidence, e.g., rhythmicity in the aforementioned performance-vs-SOA function (Fiebelkorn and Kastner, 2019; Landau and Fries, 2012). Since the distractor is not responded to, its sampling by the visual system cannot be inferred purely on the basis of response behavior, and consequently, it is also not possible to study how the target and the distractor might compete for neural representations purely behaviorally.

 In this study, we addressed these limitations by recording neural activities and investigating rhythmic sampling during a target-distractor scenario using steady-state visual evoked potential (SSVEP) frequency tagging. The stimuli were a cloud of randomly moving dots (the target) superimposed on emotional images from the International Affective Picture System (IAPS; Lang et al., 1997) (the distractor). The target and the distractor were flickered at two different frequencies for an extended duration of ~12 s. The participants were asked to focus on the randomly moving dots and report the number of times the dots moved coherently. In this paradigm, the onset of the stimulus array is the event that resets the phase of the putative low-frequency brain oscillation underlying rhythmic sampling, and the time from the stimulus array onset, referred to as time-from-onset (TFO), is analogous to the SOA in the traditional cue-target paradigm. It is worth noting that, although this paradigm has been used extensively in studies of target-distractor competition with electroencephalography (EEG) (Hindi Attar and Müller, 2012; Müller et al., 2008), it has not yet been examined in the context of rhythmic sampling. Aided by frequency tagging, from the EEG data, we extracted neural representations of target and distractor processing separately as a function of TFO. By examining the rhythmicity of these representations as functions of TFO and the phase relationship between these functions, we assessed (1) whether the target and the distractor were sampled rhythmically and (2) how their temporal competition for neural representations impacted behavioral performance.

Continue Reading