Crossing Differently: Divergent Cognitive and Visual Strategies in Autistic and Neurotypical Pedestrians Facing Autonomous Vehicles

Abstract

The rise of autonomous vehicles (AVs) necessitates a deeper understanding of how diverse populations, particularly those with atypical cognitive profiles, perceive and interact with these systems. This study examined how autistic and neurotypical (NT) adults approached street crossing in a 3D simulated AV environment. Behavioral data, eye-tracking, and functional near-infrared spectroscopy (fNIRS) were integrated to evaluate decision-making, visual attention, and neural activation. Results revealed significantly different success rates and crossing strategies, with autistic individuals exhibiting delayed decision initiation and reduced autonomic arousal as indexed by pupil dilation. Despite these differences, crossing duration post-initiation remained similar. Neuroimaging data supported these findings, indicating distinct cognitive processing during crossing judgments. This study underscores the need for inclusive urban mobility design and highlights potential avenues for tailoring AV communication to accommodate neurodiversity.

Keywords

intention inference, human-machine interaction, autism spectrum disorder, self-driving cars, pedestrian safety, eye-tracking, fNIRS

1. Introduction

According to the World Health Organization’s global report on road safety, there were 1.19 million road traffic deaths worldwide in 2021. Pedestrians, the second largest group of fatalities, accounted for 23% of these deaths, highlighting their vulnerability to traffic. Almost 95% of people across 48 countries are identified as pedestrians (Global Status Report on Road Safety 2023 2023). Seniors, children, and individuals with disabilities (whether visible or not) are considered particularly at risk of injury as pedestrians (Xiang et al. 2006; Organization 2019; Schwartz et al. 2022). For people with visible disabilities, such as those using a white walking cane to signal visual impairment, human drivers tend to exhibit more yielding behaviors (Andrew Harrell 1992; Guth et al. 2005). What about people with invisible disabilities ? Individuals with cognitive or social impairments, who do not have easily recognizable signs, face additional stress and danger during daily road-crossing experiences, often without understanding or consideration from human drivers (“Going Our Own Way: Public Transit Accessibility for Neurodivergent People” 2023).

As one of the invisible disabilities, Autistic Spectrum Disorder (ASD, or autism) is a lifelong developmental disorder that can vary in individuals along a continuum of severity, characterized by difficulties in social interaction and communication, repetitive behaviors, and intense focus on specific interests (Diagnostic and Statistical Manual of Mental Disorders: DSM-5 2013; Lord et al. 2018). Every 1 in 36 (2.8%) patients was diagnosed with autism in the US (CDC 2024). The latest global reports show a remarkably increased prevalence estimated that at least 1 in 100 children on Earth have been diagnosed with autism (Zeidan et al. 2022; Talantseva et al. 2023). Although much of the existing literature focuses on Individuals with ASD in traffic as drivers (D. Y. Chee et al. 2017; D. Y. T. Chee et al. 2019), few studies have examined their role as pedestrians (Earl et al. 2016; Earl et al. 2018; Wilmut and Purcell 2021). This gap is critical because pedestrian activity is more common and exposes individuals to greater vulnerability to traffic (Global Status Report on Road Safety 2023 2023).

The clinical characteristics of ASD include deficits in social interaction and non-verbal communication, whereas social cues and non-verbal negotiations with the driver actually serve an important role in road-crossing when traffic lights are not available (Hamilton-Baillie 2004; Bishop, Biasini, and Stavrinos 2017). These social elements in road crossing exacerbate pedestrian vulnerability among autistic individuals. When traffic lights are absent in a crossing, an effective driver-pedestrian interaction becomes primal. Body language, eye contact, or even facial expressions can make a lot of difference in this negotiation that happens in the blinking of an eye (Hamilton-Baillie 2004; D. Y. T. Chee et al. 2019; Curry et al. 2021).

In driver-pedestrian interactions, intent is often conveyed through subtle, non-verbal cues such as gaze and eye contact, creating an unspoken negotiation as both parties assess who will proceed first. The ability to recognize and interpret these cues is crucial for safe road crossings. This cognitive skill—variously referred to as theory of mind (Baron-Cohen, Leslie, and Frith 1985), intention attribution (Dennett 1971), or mentalizing (Frith 2001)—plays a fundamental role in predicting others’ behavior in social interactions.

Neuroimaging studies have identified the temporoparietal junction (TPJ) as a key brain region involved in reasoning about others’ intentions, particularly in tasks requiring an understanding of beliefs, desires, or goals (Saxe and Kanwisher 2003; Schurz et al. 2014). However, individuals with ASD often exhibit atypical TPJ activation during theory of mind tasks, which may contribute to difficulties in real-time social interactions (Lombardo et al. 2011; Nijhof et al. 2018). In traffic environments, where rapid intent attribution is essential for pedestrian safety, these challenges could pose significant obstacles when engaging with human drivers.

With the rise of autonomous vehicles (AVs), these traditional interaction dynamics are set to change. The AV market is projected to expand to $21 billion by 2035, leading to increasing integration with human road users (Rezwana and Lownes 2024). Unlike human drivers, AVs remove the element of social interaction from pedestrian interactions, lowering significantly the uncertainty level (Wilmut and Purcell 2021; Rezwana and Lownes 2024). While this shift may create challenges for typically developed pedestrians, who rely heavily on social cues (Van Brummelen et al. 2018), it may offer a relative advantage for autistic individuals. Driving-related studies suggest that autistic drivers outperform their typically developed counterparts in rule-based tasks, such as using indicators at roundabouts and checking for cross-traffic at intersections (D. Y. Chee et al. 2017). However, AV-pedestrian interactions involve more than just recognizing patterns and following traffic rules; they introduce a broader cognitive question: Can humans accurately infer AV intentions based solely on movement and adjust their behavior accordingly? Research suggests that this is indeed possible, as humans are capable of detecting and classifying abstract agents’ action goals based solely on motion cues (Surden and Williams 2016; Mahadevan, Somanath, and Sharlin 2018).

In the current context, while typically developed pedestrians often rely on non-verbal cues from human drivers to assess crossing opportunities, autistic pedestrians may be less dependent on such social signals—even when interacting with human-driven vehicles (Earl et al. 2016). Instead, they may have already developed and refined a strategy that relies on analyzing the vehicle’s kinematic features, such as speed, trajectory, and indicator lights, to infer its intentions) (Zach Noonan et al. 2023). This would suggest that, even before the advent of AVs, autistic pedestrians might have been accustomed to interpreting machine behavior analytically, rather than relying on human social cues.

A substantial body of recent research has focused on improving AV-to-human communication by embedding external visual cues—such as light signals, gaze simulations, or symbolic displays—into autonomous vehicles to make their intentions clearer to pedestrians (Driggs-Campbell and Bajcsy 2016; Rezwana and Lownes 2024). These studies implicitly assume that such social cues are both effective and necessary, particularly for neurotypical (NT) individuals, who rely heavily on socially conventional signals in interpreting intent. However, fewer studies have investigated how different individuals—especially those with neurodivergent profiles—perceive and interpret the behavior of autonomous vehicles. That is, while much attention has been devoted to how AVs can better express intent, less is known about how pedestrians vary in their ability to extract intention from AV motion, even in the absence of explicit signaling.


In a seminal visuomotor priming experiment, Pierno et al. (2008) found that ASD children outperformed their NT peers when asked to imitate a reach-to-grasp action previously executed by a robotic arm, but not when the action was performed by a human agent. This result suggests that the mechanisms supporting action understanding in ASD may be selectively enhanced in response to artificial, predictable systems.

More broadly, this finding aligns with a growing body of literature suggesting that ASD individuals often demonstrate a distinctive affinity with technology, which may allow them to engage more effectively with structured, predictable systems than with socially complex human counterparts. For example, enhanced performance has been observed in ASD individuals in tasks involving robot-mediated tutoring, computer-assisted learning, and collaborative problem-solving with virtual agents (Diehl et al. 2012; Scassellati, Admoni, and Matarić 2012; Dubois-Sage et al. 2024).

This privileged relationship with artificial systems may stem from their lower reliance on implicit social cues and greater sensitivity to systematic patterns, allowing for more efficient interaction with technologies that offer consistent and transparent input-output dynamics. In the context of autonomous vehicles, this raises the hypothesis that ASD pedestrians may strategically leverage motion-based parameters to infer AV intent, potentially outperforming NT individuals in tasks that rely less on socially framed communication and more on structural motion analysis.

Both NT and ASD individuals attribute intentions to robots and moving objects (+ref). The movement of these objects provides additional information about their intentions, which can be crucial for safe interaction in shared zones and zebra-crossing. Intention attribution involves understanding the purpose of an object’s movement, often linked to low-level motor intentions, such as moving forward or stopping. Individuals with ASD might, therefore, have a stronger capability in human-machine theory of mind (P. C. Pantelis et al. 2014), which would allow them to read kinematic social cues when interacting with machines (P. Pantelis et al. 2011).

Previous observations on autistic advantage in human-robot interaction suggest that autistic pedestrians may have an edge in interacting with AVs in the near future. By investigating the influence of automation within a social perception framework, especially in simulated real-world traffic scenarios, this study offers a unique opportunity to test the theoretical autistic advantage beyond predictability. In practice, such studies could pave the way to improve safety of pedestrians with autism and help design AVs that are more accessible and supportive for individuals with ASD, ultimately contributing to safer and more inclusive road environments.


Pedestrians account for nearly one-fourth of all traffic-related deaths worldwide, with individuals possessing invisible disabilities such as autism facing heightened risk. Despite increasing prevalence, the behaviors of autistic pedestrians in dynamic traffic contexts—particularly those involving AVs—remain under-explored. This study bridges that gap by investigating how autistic versus NT adults interpret and respond to AV movement cues in a simulated street-crossing task, using multimodal neurocognitive and behavioral metrics.

Functional near-infrared spectroscopy (fNIRS) (Villringer and Chance 1997) has been recognized as a promising tool for investigating social engagement in children with autism because of its portability and noninvasiveness (Liu et al. 2019; Zhang and Roeyers 2019). Given that road crossing is a highly motor-related task that requires ecological validity, we consider fNIRS to be an equally suitable approach for assessing neural activity in autistic and neurotypical adults in this context.

2. Methods

2.1 Participants

A total of 53 participants took part in the study, including 12 adults with autism spectrum disorder (ASD group) and 41 neurotypical adults (NT group). All participants had normal or corrected-to-normal vision and provided informed written consent prior to participation. The study was approved by the ethics committee of the Université de Lyon.

Participants in the ASD group (9 females, 3 males; mean age = 37.6 years, SD = 14.2) were recruited through autism-related communities and forums. All individuals self-identified as autistic and reported having received a formal diagnosis of autism spectrum disorder (ASD) from a healthcare professional, with no history of attention deficit with or without hyperactivity (ADHD). While clinical documentation was not independently verified, participants’ autistic characteristics were further assessed using the Aspie Quiz (https://rdos.net/fr/), a widely used self-report instrument designed to profile neurodivergent and neurotypical traits. The questionnaire provides separate scores for autistic and neurotypical dimensions (range: 0–200). All included participants showed high neurodivergent scores (mean = 136, SD = 20.4) and low neurotypical scores (mean = 67.4, SD = 21.9), in line with typical profiles observed in diagnosed autistic populations (Ekblad 2013).

Participants in the NT group (33 females, 7 males; mean age = 20.6 years, SD = 1.6) were undergraduate students recruited from the Université de Lyon. Although no autism-specific screening instrument was administered to this group, participants reported no history of developmental or psychiatric disorders. Prior to the experiment, all NT participants completed the Smart Tools Proneness Questionnaire (STP-Q; (Navarro et al. 2022)), a 32-item scale assessing the individual tendency to engage with smart tools. Participants were selected if their scores fell at least one standard deviation above or below the population mean, forming a NT group with extreme smart-tool pronenesses.

2.2 Apparatus and Procedure

Participants performed a road-crossing task in a virtual 3D pedestrian simulator presented on a PC, using a Gamepad for realistic movement control, being able to move and view in limited range (see Figure 1).

Figure 1: Participants’ Control and Virtual Environment Setup. Participants navigated the road-crossing simulation as pedestrian avatars using a PS5 DualSense controller. The virtual environment was presented on a 27-inch screen (1920×1080 resolution) positioned approximately 1 meter in front of the participant. The pedestrian’s field of view was constrained to 60° to minimize visual distortion. The controller allowed for intuitive control of the avatar in two primary dimensions: View Control: The left joystick enabled lateral head movements (left or right), allowing participants to visually scan approaching traffic before deciding to cross. Movement Control: The right joystick controlled forward and backward locomotion across the road. Movement speed was continuous and responsive to the degree of joystick tilt, ranging from 3 km/h (minimal tilt) to a maximum of 6 km/h (full tilt). Releasing the joystick immediately halted movement, allowing participants to regulate both the starting time and duration of their crossings with precision. The simulated environment featured a minimalist rural road scenario designed to eliminate unnecessary visual distractions. The road was a one-way, single-lane route measuring approximately 200 meters in length and 5 meters in width. All vehicles were uniform in size and appearance.

This task was designed to investigate participants’ behavioral strategies and physiological responses during interactions with autonomous vehicles (AVs). The experimental environment consisted of a gently curved road, providing a clear line-of-sight for participants, along which virtual AVs approached continuously. Traffic scenarios were presented in three distinct experimental conditions, each differing systematically in how the available physical crossing gap evolved over time, while maintaining an identical temporal crossing window of 3 seconds at the crossing point (illustrated in Figure 2).

Figure 2: Illustration of the four traffic conditions. 1) Baseline: A single vehicle approaches the pedestrian crossing at a constant speed of 30 km/h, with no other vehicles present. This condition served as a physiological baseline for fNIRS recordings. 2) Const: Four groups of vehicles (each composed of three cars) approach at a uniform speed of 30 km/h. The inter-group gaps are temporally and physically consistent, resulting in stable crossing opportunities. 3) VarMinus: Vehicles are organized into four groups, with each successive group traveling faster than the previous one—30, 40, 50, and 60 km/h, respectively. This acceleration pattern leads to progressively shorter physical gaps between the groups, while maintaining a constant temporal gap (3s) at the crossing point. 4) VarPlus: All four vehicle groups initially travel at 60 km/h. The first group maintains this speed throughout. The second group decelerates to 50 km/h over 3 seconds and then maintains that speed. The third group slows further, from 60→50→40 km/h in two 3-second stages. The fourth group continues the same pattern until it reaches 30 km/h. This progressive deceleration results in increasingly widened physical gaps over time, despite the temporal gap remaining constant at 3s.

Being explicitly informed that the vehicles were all fully-automated, participants encountered three sequential inter-group gaps (Gap #1 to Gap #3) in each trial, simulating recurring crossing opportunities. They were instructed to cross as soon as possible, reflecting real-world pedestrian decision-making behaviors. However, only those trials in which participants initiated crossing during the first gap (Gap #1) were retained for subsequent analyses. The sequence of traffic conditions was randomized between trials to mitigate potential learning effects or carryover biases.

Eye-tracking data were captured at a sampling rate of 1000 Hz using an Eyelink Portable Duo eye tracker in remote tracking mode. Recorded metrics included gaze positions, fixation duration, and pupil diameter changes, precisely synchronized with key traffic events (e.g., the onset of the initial inter-group gap).

Concurrently, cerebral hemodynamic responses were monitored using a functional near-infrared spectroscopy (fNIRS) system with a 12×10 optode array covering frontal, temporal and parietal cortical regions, sampling at approximately 5.5 Hz (see Figure 3). fNIRS signals were acquired using an 26-channel fNIRS system (Cortivision Photo Cap C20).

The placement of fNIRS optodes was determined using a systematic and refined procedure based on an automated meta-analytic approach (Yarkoni et al. 2011). Specifically, we conducted an automated meta-analysis via the Neurosynth database, targeting brain regions consistently activated during tasks involving decision-making and Theory of Mind (ToM). Using the “fNIRS Optodes’ Location Decider” (fOLD) toolbox (Zimeo Morais, Balardin, and Sato 2018), we mapped these identified activation clusters onto the international 10–20 EEG electrode positioning standard.

Figure 3: fNIRS layout consisted of 12 emitters (red) and 10 detectors (purple), forming 4 short channels and 21 measurement channels covering the frontal, temporal, and parietal regions. The optode numbers were color-coded, with channels represented by blue arrows indicating connections from source to receptor optodes. Short channels were enclosed in blue squares, with their emitters labeled in parentheses.

The simulation environment was developed using Unity 3D, with precise synchronization between the virtual environment, eye-tracking data, and fNIRS recordings achieved via the Lab Streaming Layer (LSL) protocol(“LabStreamingLayers Documentation Labstreaminglayer 1.13 Documentation,” n.d.). Prior to experimental trials, each participant underwent individual calibrations for both eye-tracking and fNIRS systems, followed by a brief familiarization phase for vibration feedback if collision happens. The entire experimental session, including calibration, data collection, and participant debriefing, lasted approximately 45 minutes. Each participants went through 6 trials per conditions, resulting in 24 trials per participant.

2.3 Measures

[3.1.1 Success Rate:] Binary indicator of collision avoidance in each trial

[3.1.2 Crossing Start Time:] Time from trial onset to engaging with potential collision area

[3.1.3 Crossing duration:] Time spent in crossing

[3.1.4 Current Deviation:] Time difference between pedestrian and gap center trajectories

[3.1.5 Pedestrian Crossing Speed:] Speed of the pedestrian avatar in meter per second

[3.1.6 Temporal Error:] Temporal offset from ideal crossing moment

[3.2.1 Gaze Preference:] Standardized gaze position projected between gap-related vehicles

[3.2.1 Pupil Dilatation:] Proxy for autonomic arousal

3.3.1 HbO in Brain ROI: Proxy for cortex activation

2.4 Data Analysis

Linear mixed-effects models assessed group and condition effects across behavioral and physiological variables. Eye-tracking data were processed via dynamic dot-product alignment to AVs’ screen positions (see Figure 4).

Figure 4: Illustration of the calculation for Gaze Preference. It was computed using a standardized dot product projection along the axis defined by the gap-opening and gap-closing vehicles. Values range beyond the [0, 1] interval: a score of 0 indicates gaze precisely aligned with the gap-opening vehicle (V3), while a score of 1 reflects alignment with the gap-closing vehicle (V1). Intermediate values within [0, 1] indicate gaze directed between the two vehicles. When either vehicle fell outside the visual field, projection endpoints were adjusted accordingly: the origin was set to (0, 0) for missing V3 (left-bottom corner of the screen) and to (0, 0.5) for missing V1 (right edge of the screen).

fNIRS data preprocessing was completed using the MNE package in Python. The raw intensity data were first converted to optical density (OD) changes, then underwent TDDR correction (Fishburn et al. 2019) and bandpass filtering (0.01–0.09 Hz) to filter out motion and other physiological artefacts. Finally the OD data were converted to hemodynamic responses (HDR) using the modified Beer-Lambert Law (Kocsis, Herman, and Eke 2006). The oxygenated hemoglobin (HbO) data of six trials under Baseline traffic condition were averaged for each participant and then used for steady-state control and normalized by z-score methods to eliminate the effect of data units (?) and facilitate comparison between different traffic conditions. Finally, each channel’s HbO amplitude under the six trials of each participant in the three different traffic conditions (Const, VarMinus, VarPlus) was calculated to compare the effect of conditions and groups (ASD, NT).

3. Results

3.1 Behavioral Outcomes

3.1.1 Success Rate:

A linear mixed-effect model showed that the ASD group demonstrated a significantly higher likelihood of collisions compared to the NT group, as indicated by a positive fixed effect estimate for the group comparison (β = 0.206, SE = 0.086, t(51) = 2.379, p = .021).

Notably, in the VarPlus condition, both groups showed a numerical reduction in risk-taking behaviors such as crossing during intra-group gaps, compared to Const and VarMinus conditions. However, this reduction did not reach statistical significance in either group (ASD: z = -1.29, p = .20; NT: z = -0.99, p = .32), suggesting a trend toward more cautious behavior in the presence of physically expanding crossing gaps.

Figure 5: Success rate breakdown by Group and Condition. Each panel displays one condition. Within each panel, each bar displays one group. The bars represent the proportion of trial outcomes across each Group × Condition combination, the full bar height corresponds to 100% of the trials in the given Group × Condition combination. Trials were categorized into three mutually exclusive outcomes: Successful crossings (green), Failed attempts at inter-group gaps (blue), and Failed attempts at intra-group gaps (red). Inter-group gap failures reflect unsuccessful yet strategically appropriate decisions, whereas intra-group gap failures suggest riskier decision-making patterns.

3.1.2 Crossing Start Time:

We examined the effects of traffic condition and group on pedestrian crossing start time using a two-way mixed-design ANOVA with Condition as a within-subjects factor and Group as a between-subjects factor.

Crossing start time revealed a significant main effect on Condition (F(2,96) = 9085.91, p < .001, ηp2=.995), indicating that crossing decisions varied strongly across traffic conditions. Post-hoc tests on Condition showed that the difference majorly came from VarPlus (see Figure 6).

Figure 6: Post-hoc comparisons following the significant main effect of Condition on Crossing Start Time revealed that the VarPlus condition significantly differed from both the Const and VarMinus conditions (ps < .001). Specifically, participants in both the ASD and NT groups initiated crossing earlier under the VarPlus condition compared to the other two conditions. No significant difference was observed between the Const and VarMinus conditions (p > .05). These findings suggest that the increasing physical crossing opportunity in the VarPlus condition—despite identical temporal crossing opportunities across all conditions—promoted earlier initiation of crossing behavior.

The Group effect was also significant (F(1,48) = 5.997, p = .018, ηp2​ = .111), showing that overall, ASD participants started to cross later than NT participants across conditions.

To evaluate potential group differences in motor execution during the road-crossing task, we analyzed also the Crossing duration that participants spent within the potential collision zone—the centered area in crossing, superposed with the trajectory of the approaching vehicles.

3.1.3 Crossing Duration:

Notably, we found no significant group differences in crossing duration (F(1, 669) = 0, p = .99), suggesting intact motor execution in ASD group, once started crossing.

3.1.4 Current Deviation:

We examined current deviation—a behavioral measure reflecting whether pedestrians were adjusting their speed appropriately relative to vehicle positions—using a mixed-design ANOVA with Group as a between-subjects factor and Condition and Time (–5 s to 0 s relative to crossing midpoint) as within-subjects factors. The results revealed no significant main effect of Group ( F(1, 6) = .007, p = .934), no significant main effect of Condition (F(2, 12) = .693, p = .519), and no significant Group × Condition interaction (F(2, 12) = .133, p = .877).

Descriptively, some group- and condition-level differences were observed (see Figure 7), but these did not reach statistical significance. One potential reason for the null results lies in the data quality: the current deviation metric is calculated based on the instantaneous spatial relation between pedestrians and approaching vehicles. In our simulation, vehicle position data were updated at a resolution of 4-meter intervals, which introduced temporal imprecision and resulted in sparse or missing current deviation estimates in many trials, especially in ASD group where we already have limited number of participants.

Figure 7: Current deviation trajectories aligned to the pedestrian’s midpoint crossing time. The y-axis indicates current deviation (in seconds)—the temporal offset between the pedestrian’s projected crossing time (if they maintained their current speed) and the ideal physical midpoint of the inter-vehicle gap. A current deviation of 0 means the pedestrian was on track to cross precisely at the midpoint of the physical gap. Negative values reflect delayed timing (crossing would occur after the optimal midpoint), while positive values reflect anticipatory timing (crossing before the midpoint). The x-axis represents a 5-second countdown leading up to the actual midpoint crossing, allowing observation of real-time adjustments. Mean trajectories for each group and condition are shown as solid lines, with shaded ribbons denoting ±1 standard deviation. This visualization captures participants’ anticipatory or delayed adjustments to their speed as vehicles approached, reflecting their ability to dynamically align with moving gaps.

To circumvent this limitation, we further analyzed pedestrian crossing speed, which does not depend on vehicle position data and thus offers a more continuous behavioral time course.

3.1.5 Pedestrian Crossing Speed:

The pedestrian crossing speed could be interpreted as the slope factor in Figure 7. A mixed-design ANOVA on pedestrian speed revealed a significant main effect of Condition, F(2, 94) = 3.43, p = .036, ηp2=0.068, suggesting that, despite the limitations in current deviation measures, time-resolved modulations of movement speed captured condition-dependent strategies in crossing decisions. follow-up pairwise comparisons did not yield statistically significant differences. This discrepancy may be attributed to the violation of the sphericity assumption (as indicated by Mauchly’s test), which required corrections (e.g., Greenhouse-Geisser), thereby reducing the sensitivity of subsequent post-hoc tests.

3.1.6 Temporal Error:

The temporal error represented the deviation from the ideal mid-gap. For our 3-second gaps, the ideal mid-gap would be at 1.5-second. A mixed-design ANOVA revealed significant main effects of Condition (F(2,96) = 19.323, p < .001, ηp2 = .287) and of Group (F(1,48) = 6.226, p = .016, ηp2 = .115). Figure 8 showed that ASD participants crossed the mid-point significantly later than NT participants. The result was congruent with our previous finding on Crossing Start Time stating that ASD participants started later their crossing.

Figure 8: Temporal deviation from ideal mid-gap crossing time by group and condition. The y-axis indicates the temporal deviation (in seconds) between the actual pedestrian crossing time and the ideal temporal midpoint of a 3-second inter-group gap (i.e., 1.5 s after the gap-opening vehicle and 1.5 s before the gap-closing vehicle). A value of 0 represents a crossing precisely at the temporal mid-gap. Positive values reflect delayed crossings occurring in the second half of the available gap, whereas negative values indicate early crossings occurring before the temporal midpoint. Boxplots summarize the distribution of deviations for each condition and group, with error bars showing standard error.

3.2 Eye-Tracking and Pupillometry

3.2.1 Gaze Preference:

Figure 9: Standardized Gaze Projection onto the Gap Vector. This figure depicts the standardized projection of participants’ gaze positions onto a directional vector defined by two vehicles: the gap-opening vehicle (V3) and the gap-closing vehicle (V1). The projection translates each gaze point into a scalar value along this vector, indicating where attention was spatially oriented during the decision phase leading up to road crossing. To facilitate interpretation, the projection values were segmented into three meaningful ranges: [-0.25, 0.25]: gaze concentrated near the gap-opening vehicle (V3), [0.25, 0.75]: gaze oriented toward the inter-group gap — the physical space potentially available for crossing, [0.75, 1.25]: gaze concentrated near the gap-closing vehicle (V1). The x-axis represents time (in seconds) counting backward from the pedestrian reaching the road center (t = 0). Each dot represents the group-level mean projection value within a time bin; error ribbons indicate the standard error. The ASD group is shown as purple dots overlaid on red ribbons, while the NT group is represented by orange dots with teal ribbons.

Since the gaze data were sparse, we used a mixed linear model on gaze projections. With Subject as the random effect, Group, Condition, and Time as fixed-effect variables, we found a main effect of Condition (F(2,40.33) = 9.297, p < .01) as shown in Figure 10.

Figure 10: Overall Gaze Preference Across Conditions by Group. This figure summarizes the overall spatial distribution of gaze allocation in participants from the ASD and NT groups, aggregated across all experimental conditions and across a 12-second time window preceding the pedestrian’s arrival at the center of the road (t = 0). Gaze projection values were standardized and bounded within the range [−0.25,1.25]; values outside this interval were excluded to focus the analysis on task-relevant visual targets. The solid line represents the mean gaze projection for the ASD group, while the dotted line indicates the mean for the NT group. Error bars show 95% confidence intervals estimated from a linear mixed-effects model. Although visual inspection suggests group-level differences in spatial attention allocation, statistical analysis did not reveal a significant main effect of Group.

3.2.1 Pupil Dilatation:

Pupillometry (see Figure 11) has been widely used to assess cognitive load and emotional arousal, making it a suitable tool to examine how three traffic conditions translated on participants’ workload in different group. 

Figure 11: Pupil dilatation changes across conditions and groups. To account for the physiological delay in pupil dilation responses to cognitive load, we applied a 1000 milliseconds offset when extracting pupil size data relative to the onset of crossing decision-making. A 0.5s baseline sliding window was used for normalization.

Pupil dilation response showed that ASD participants were hypo-sensitive across all conditions, supporting hypo-arousal theory (Zhao, Liu, and Wei 2022). However, under VarPlus conditions, ASD participants showed a significant increase in pupil size, suggesting heightened cognitive demand under uncertainty.

3.3 fNIRS Findings

3.3.1 HbO in Brain ROI:

3.3.2 ERP Waveform:

Distinct activation patterns were observed in frontal cortical areas, aligning with behavioral findings.

Inside the ASD group, the VarPlus condition was associated with decreased activation in the left parietal cortex. This pattern may suggest a reduced demand for social inference or conflict monitoring, potentially reflecting the predictability or perceived cooperativeness of the autonomous agent. In contrast, the VarMinus condition elicited stronger activation in the ASD group, possibly due to increased vigilance or uncertainty.

4. Discussion

Findings highlight the need to consider neurodiverse cognitive strategies in AV-pedestrian communication. Autistic pedestrians showed greater hesitation and sensitivity to speed variability, consistent with predictive coding models of autism. Technology design should avoid relying solely on social cues (e.g., gestures) and incorporate dynamic, explicit, and low-ambiguity signals that better match autistic processing preferences.

This study also raises concerns about the generalizability of AV safety assessments if neurodiverse behaviors are not accounted for. Integrating multimodal physiological signals into AV learning algorithms may improve safety and inclusivity.

limits: statistic power in ASD group. Groups’age not matched. NT participants were pre-selected extremists on smart tool usage.

5. Conclusion

Autistic individuals exhibit distinct behavioral and cognitive patterns in crossing decisions involving AVs. Their challenges do not stem from motor execution but from upstream decision processes tied to sensory processing and arousal. Designing AV systems that accommodate such differences can promote safer, more inclusive mobility ecosystems.

Author contributions

Wenjie Huang: Conceptualization, Methodology, Software, Formal analysis, Investigation, Visualization, Data Curation, Writing – Original Draft.
Emanuelle Reynaud: Conceptualization, Methodology, Validation, Resources, Supervision, Writing – Review & Editing.
Julien Cegarra: Software, Conceptualization, Funding acquisition.
Arnaud Fournel: Software, Formal analysis, Resources, Data Curation.
Vivien Gaujoux: Conceptualization, Methodology, Investigation.
Jordan Navarro: Conceptualization, Supervision, Project administration, Funding acquisition, Writing – Review & Editing.

Declaration of Competing Interest

The authors declare no competing interests.

Funding

This research was supported by the a public funding from Délégation à la sécurité routière (DSR) (Grant no.xxxxx) for Jordan Navarro.

Declaration of Generative AI in the Writing Process

During the preparation of this article, ChatGPT was used to assist with language refinement and formatting. The final content was reviewed and verified by the authors.

Data Availability

Data underlying this article will be made available upon request due to privacy constraints regarding sensitive participant information.

References

Andrew Harrell, W. 1992. “Driver Response to a Disabled Pedestrian Using a Dangerous Crosswalk.” Journal of Environmental Psychology 12 (4): 345–54. https://doi.org/10.1016/S0272-4944(05)80083-0.
Baron-Cohen, Simon, Alan M. Leslie, and Uta Frith. 1985. “Does the Autistic Child Have a Theory of Mind ?” Cognition 21 (1): 37–46. https://doi.org/10.1016/0010-0277(85)90022-8.
Bishop, Haley Johnson, Fred J. Biasini, and Despina Stavrinos. 2017. “Social and Non-Social Hazard Response in Drivers with Autism Spectrum Disorder.” Journal of Autism and Developmental Disorders 47 (4): 905–17. https://doi.org/10.1007/s10803-016-2992-1.
CDC. 2024. “Autism and Developmental Disabilities Monitoring (ADDM) Network.” https://www.cdc.gov/autism/addm-network/index.html.
Chee, Derserri Yan Ting, Hoe Chung Yeung Lee, Ann-Helen Patomella, and Torbjörn Falkmer. 2019. “Investigating the Driving Performance of Drivers with and Without Autism Spectrum Disorders Under Complex Driving Conditions.” Disability and Rehabilitation 41 (1): 18. https://doi.org/10.1080/09638288.2017.1370498.
Chee, Derserri Y., Hoe C. Lee, Ann-Helen Patomella, and Torbjörn Falkmer. 2017. “Driving Behaviour Profile of Drivers with Autism Spectrum Disorder (ASD).” Journal of Autism and Developmental Disorders 47 (9): 2658–70. https://doi.org/10.1007/s10803-017-3178-1.
Curry, Allison E., Kristina B. Metzger, Meghan E. Carey, Emma B. Sartin, Patty Huang, and Benjamin E. Yerys. 2021. “Comparison of Motor Vehicle Crashes, Traffic Violations, and License Suspensions Between Autistic and Non-Autistic Adolescent and Young Adult Drivers.” Journal of the American Academy of Child & Adolescent Psychiatry 60 (7): 913–23. https://doi.org/10.1016/j.jaac.2021.01.001.
Dennett, D. C. 1971. “Intentional Systems.” The Journal of Philosophy 68 (4): 87–106. https://doi.org/10.2307/2025382.
Diagnostic and Statistical Manual of Mental Disorders: DSM-5. 2013. 5th ed. Washington: American psychiatric association.
Diehl, Joshua J., Lauren M. Schmitt, Michael Villano, and Charles R. Crowell. 2012. “The Clinical Use of Robots for Individuals with Autism Spectrum Disorders: A Critical Review.” Research in Autism Spectrum Disorders 6 (1): 249–62. https://doi.org/10.1016/j.rasd.2011.05.006.
Driggs-Campbell, Katherine, and Ruzena Bajcsy. 2016. “2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).” In, 3042–47. Daejeon, South Korea: IEEE. https://doi.org/10.1109/IROS.2016.7759471.
Dubois-Sage, Marion, Baptiste Jacquet, Frank Jamet, and Jean Baratgin. 2024. “People with Autism Spectrum Disorder Could Interact More Easily with a Robot Than with a Human: Reasons and Limits.” Behavioral Sciences 2 (2): 131. https://doi.org/10.3390/bs14020131.
Earl, Robyn, Torbjorn Falkmer, Sonya Girdler, Joakim Dahlman, Anette Rehnberg, and Marita Falkmer. 2016. “Visual Search Strategies of Pedestrians with and Without Visual and Cognitive Impairments in a Shared Zone: A Proof of Concept Study.” Land Use Policy 57 (November): 327–34. https://doi.org/10.1016/j.landusepol.2016.06.009.
Earl, Robyn, Torbjorn Falkmer, Sonya Girdler, Susan L. Morris, and Marita Falkmer. 2018. “Viewpoints of pedestrians with and without cognitive impairment on shared zones and zebra crossings.” PloS One 13 (9): e0203765. https://doi.org/10.1371/journal.pone.0203765.
Ekblad, Leif. 2013. “Autism, Personality, and Human Diversity: Defining Neurodiversity in an Iterative Process Using Aspie Quiz.” SAGE Open 3 (3): 2158244013497722. https://doi.org/10.1177/2158244013497722.
Fishburn, Frank A., Ruth S. Ludlum, Chandan J. Vaidya, and Andrei V. Medvedev. 2019. “Temporal Derivative Distribution Repair (TDDR): A Motion Correction Method for fNIRS.” NeuroImage 184 (January): 171–79. https://doi.org/10.1016/j.neuroimage.2018.09.025.
Frith, U. 2001. “Mind blindness and the brain in autism.” Neuron 32 (6): 969–79. https://doi.org/10.1016/s0896-6273(01)00552-9.
Global Status Report on Road Safety 2023. 2023. 1st ed. Geneva: World Health Organization.
“Going Our Own Way: Public Transit Accessibility for Neurodivergent People.” 2023. https://rootedinrights.org/going-our-own-way-public-transit-accessibility-for-neurodivergent-people/.
Guth, David, Daniel Ashmead, Richard Long, Robert Wall, and Paul Ponchillia. 2005. “Blind and Sighted Pedestrians’ Judgments of Gaps in Traffic at Roundabouts.” Human Factors 47 (2): 314–31. https://doi.org/10.1518/0018720054679533.
Hamilton-Baillie, Ben. 2004. “Urban Design: Why Don’t We Do It in the Road? Modifying Traffic Behavior Through Legible Urban Design.” Journal of Urban Technology 11 (1): 43–62. https://doi.org/10.1080/1063073042000341970.
Kocsis, L, P Herman, and A Eke. 2006. “The Modified BeerLambert Law Revisited.” Physics in Medicine and Biology 51 (5): N91–98. https://doi.org/10.1088/0031-9155/51/5/N02.
“LabStreamingLayers Documentation Labstreaminglayer 1.13 Documentation.” n.d. https://labstreaminglayer.readthedocs.io/.
Liu, Tao, Xingchen Liu, Li Yi, Chaozhe Zhu, Patrick S. Markey, and Matthew Pelowski. 2019. “Assessing Autism at Its Social and Developmental Roots: A Review of Autism Spectrum Disorder Studies Using Functional Near-Infrared Spectroscopy.” NeuroImage 185 (January): 955–67. https://doi.org/10.1016/j.neuroimage.2017.09.044.
Lombardo, Michael V., Bhismadev Chakrabarti, Edward T. Bullmore, and Simon Baron-Cohen. 2011. “Specialization of Right Temporo-Parietal Junction for Mentalizing and Its Relation to Social Impairments in Autism.” NeuroImage 56 (3): 1832–38. https://doi.org/10.1016/j.neuroimage.2011.02.067.
Lord, Catherine, Mayada Elsabbagh, Gillian Baird, and Jeremy Veenstra-Vanderweele. 2018. “Autism Spectrum Disorder.” The Lancet 392 (10146): 508–20. https://doi.org/10.1016/S0140-6736(18)31129-2.
Mahadevan, Karthik, Sowmya Somanath, and Ehud Sharlin. 2018. “CHI ’18: CHI Conference on Human Factors in Computing Systems.” In, 1–12. Montreal QC Canada: ACM. https://doi.org/10.1145/3173574.3174003.
Navarro, J., F. Osiurak, S. Ha, G. Communay, E. Ferrier-Barbut, A. Coatrine, V. Gaujoux, et al. 2022. “Development of the Smart Tools Proneness Questionnaire (STP-q): An Instrument to Assess the Individual Propensity to Use Smart Tools.” Ergonomics 65 (12): 1639–58. https://doi.org/10.1080/00140139.2022.2048895.
Nijhof, Annabel D., Lara Bardi, Marcel Brass, and Jan R. Wiersema. 2018. “Brain Activity for Spontaneous and Explicit Mentalizing in Adults with Autism Spectrum Disorder: An fMRI Study.” NeuroImage: Clinical 18 (January): 475–84. https://doi.org/10.1016/j.nicl.2018.02.016.
Organization, World Health. 2019. Global Status Report on Road Safety 2018. World Health Organization.
Pantelis, Peter C., Chris L. Baker, Steven A. Cholewiak, Kevin Sanik, Ari Weinstein, Chia-Chien Wu, Joshua B. Tenenbaum, and Jacob Feldman. 2014. “Inferring the Intentional States of Autonomous Virtual Agents.” Cognition 130 (3): 360–79. https://doi.org/10.1016/j.cognition.2013.11.011.
Pantelis, Peter, Steven Cholewiak, Paul Ringstad, Kevin Sanik, Ari Weinstein, Chia-Chien Wu, and Jacob Feldman. 2011. “Perception of Intentions and Mental States in Autonomous Virtual Agents.” Journal of Vision 11 (11): 733. https://doi.org/10.1167/11.11.733.
Rezwana, Saki, and Nicholas Lownes. 2024. “Interactions and Behaviors of Pedestrians with Autonomous Vehicles: A Synthesis.” Future Transportation 4 (3): 722–45. https://doi.org/10.3390/futuretransp4030034.
Saxe, R., and N. Kanwisher. 2003. “People thinking about thinking people. The role of the temporo-parietal junction in "theory of mind".” NeuroImage 19 (4): 1835–42. https://doi.org/10.1016/s1053-8119(03)00230-1.
Scassellati, Brian, Henny Admoni, and Maja Matarić. 2012. “Robots for Use in Autism Research.” Annual Review of Biomedical Engineering 14 (Volume 14, 2012): 275–94. https://doi.org/10.1146/annurev-bioeng-071811-150036.
Schurz, Matthias, Joaquim Radua, Markus Aichhorn, Fabio Richlan, and Josef Perner. 2014. “Fractionating Theory of Mind: A Meta-Analysis of Functional Brain Imaging Studies.” Neuroscience & Biobehavioral Reviews 42 (May): 9–34. https://doi.org/10.1016/j.neubiorev.2014.01.009.
Schwartz, Naomi, Ron Buliung, Arslan Daniel, and Linda Rothman. 2022. “Disability and Pedestrian Road Traffic Injury: A Scoping Review.” Health & Place 77 (September): 102896. https://doi.org/10.1016/j.healthplace.2022.102896.
Surden, Harry, and Mary-Anne Williams. 2016. “Technological Opacity, Predictability, and Self-Driving Cars.” SSRN Electronic Journal. https://doi.org/10.2139/ssrn.2747491.
Talantseva, Oksana I., Raisa S. Romanova, Ekaterina M. Shurdova, Tatiana A. Dolgorukova, Polina S. Sologub, Olga S. Titova, Daria F. Kleeva, and Elena L. Grigorenko. 2023. “The Global Prevalence of Autism Spectrum Disorder: A Three-Level Meta-Analysis.” Frontiers in Psychiatry 14 (February). https://doi.org/10.3389/fpsyt.2023.1071181.
Van Brummelen, Jessica, Marie O’Brien, Dominique Gruyer, and Homayoun Najjaran. 2018. “Autonomous Vehicle Perception: The Technology of Today and Tomorrow.” Transportation Research Part C: Emerging Technologies 89 (April): 384–406. https://doi.org/10.1016/j.trc.2018.02.012.
Villringer, Arno, and Britton Chance. 1997. “Non-Invasive Optical Spectroscopy and Imaging of Human Brain Function.” Trends in Neurosciences 20 (10): 435–42. https://doi.org/10.1016/S0166-2236(97)01132-6.
Wilmut, Kate, and Catherine Purcell. 2021. “The Nature of the Risk Faced by Pedestrians with Neurodevelopmental Disorders: A Systematic Review.” Accident Analysis & Prevention 149 (January): 105886. https://doi.org/10.1016/j.aap.2020.105886.
Xiang, Huiyun, Motao Zhu, Sara A. Sinclair, Lorann Stallones, J. R. Wilkins, and Gary A. Smith. 2006. “Risk of Vehiclepedestrian and Vehiclebicyclist Collisions Among Children with Disabilities.” Accident Analysis & Prevention 38 (6): 1064–70. https://doi.org/10.1016/j.aap.2006.04.010.
Yarkoni, Tal, Russell A. Poldrack, Thomas E. Nichols, David C. Van Essen, and Tor D. Wager. 2011. “Large-Scale Automated Synthesis of Human Functional Neuroimaging Data.” Nature Methods 8 (8): 665–70. https://doi.org/10.1038/nmeth.1635.
Zach Noonan, T., Pnina Gershon, Josh Domeyer, Bruce Mehler, and Bryan Reimer. 2023. “Kinematic Cues in Driver-Pedestrian Communication to Support Safe Road Crossing.” Accident Analysis & Prevention 192 (November): 107236. https://doi.org/10.1016/j.aap.2023.107236.
Zeidan, Jinan, Eric Fombonne, Julie Scorah, Alaa Ibrahim, Maureen S. Durkin, Shekhar Saxena, Afiqah Yusuf, Andy Shih, and Mayada Elsabbagh. 2022. “Global prevalence of autism: A systematic review update.” Autism Research: Official Journal of the International Society for Autism Research 15 (5): 778–90. https://doi.org/10.1002/aur.2696.
Zhang, Fen, and Herbert Roeyers. 2019. “Exploring Brain Functions in Autism Spectrum Disorder: A Systematic Review on Functional Near-Infrared Spectroscopy (fNIRS) Studies.” International Journal of Psychophysiology 137 (March): 41–53. https://doi.org/10.1016/j.ijpsycho.2019.01.003.
Zhao, Sijia, Yajie Liu, and Kunlin Wei. 2022. “Pupil-Linked Arousal Response Reveals Aberrant Attention Regulation Among Children with Autism Spectrum Disorder.” Journal of Neuroscience 42 (27): 5427–37. https://doi.org/10.1523/JNEUROSCI.0223-22.2022.
Zimeo Morais, Guilherme Augusto, Joana Bisol Balardin, and João Ricardo Sato. 2018. “fNIRS Optodes Location Decider (fOLD): A Toolbox for Probe Arrangement Guided by Brain Regions-of-Interest.” Scientific Reports 8 (1): 3341. https://doi.org/10.1038/s41598-018-21716-z.

Reuse

Citation

BibTeX citation:
@online{huang2025,
  author = {Huang, Wenjie and Gaujoux, Vivien and Fournel, Arnaud and
    Cegarra, Julien and Reynaud, Emanuelle and Navarro, Jordan},
  title = {Crossing {Differently:} {Divergent} {Cognitive} and {Visual}
    {Strategies} in {Autistic} and {Neurotypical} {Pedestrians} {Facing}
    {Autonomous} {Vehicles}},
  volume = {x},
  number = {x},
  date = {2025-05-05},
  doi = {xxxxx},
  langid = {en},
  abstract = {The rise of autonomous vehicles (AVs) necessitates a
    deeper understanding of how diverse populations, particularly those
    with atypical cognitive profiles, perceive and interact with these
    systems. This study examined how autistic and neurotypical (NT)
    adults approached street crossing in a 3D simulated AV environment.
    Behavioral data, eye-tracking, and functional near-infrared
    spectroscopy (fNIRS) were integrated to evaluate decision-making,
    visual attention, and neural activation. Results revealed
    significantly different success rates and crossing strategies, with
    autistic individuals exhibiting delayed decision initiation and
    reduced autonomic arousal as indexed by pupil dilation. Despite
    these differences, crossing duration post-initiation remained
    similar. Neuroimaging data supported these findings, indicating
    distinct cognitive processing during crossing judgments. This study
    underscores the need for inclusive urban mobility design and
    highlights potential avenues for tailoring AV communication to
    accommodate neurodiversity.}
}
For attribution, please cite this work as:
Huang, Wenjie, Vivien Gaujoux, Arnaud Fournel, Julien Cegarra, Emanuelle Reynaud, and Jordan Navarro. 2025. “Crossing Differently: Divergent Cognitive and Visual Strategies in Autistic and Neurotypical Pedestrians Facing Autonomous Vehicles .” Research in Autism. May 5, 2025. https://doi.org/xxxxx.