How to cite: Galiano-Landeira, J. (2024). Molyneux’s answer: Situated predictive processing. Philosophy and the Mind Sciences, 5. https://doi.org/10.33735/phimisci.2024.11600

Abstract

Molyneux’s problem asks whether a person blind from birth, upon gaining sight, could immediately recognize and distinguish objects by sight alone that were previously known only by touch. Historical and contemporary empirical studies have explored this question with inconclusive results due to empirical limitations. More recently, Held and colleagues (2011) found that treated congenitally blind individuals cannot immediately recognize objects previously familiar through touch. Piller and colleagues (2023) further reported the absence of visual illusions in blind and recently visually-restored individuals. Nevertheless, cross-modal mappings gradually develop post-sight restoration. These findings suggest a reluctance of the mind to make cross-modal inferences, aligning with the predictive processing (PP) framework. PP posits that the mind generates top-down predictions about sensory stimuli, updating internal models through prediction errors when expectations are not met. With no prior visual experience, generative models in congenital blind individuals fail to produce accurate predictions. PP’s representational claims have been challenged by 4E cognitivists, who emphasize embodied, embedded, extended, and enactive aspects of cognition. This paper proposes a Situated Predictive Processing (SPP) framework that integrates PP with 4E cognition through the concept of situated mental representations, offering a new perspective on the Molyneux’s problem and emphasizing the role of experience and situatedness in the gradual development of visual-tactile mappings post-sight restoration.

This article is part of a special issue on “Molyneux’s question today”, edited by Gabriele Ferretti and Brian Glenney.

1 Introduction

Molyneux’s problem, a captivating philosophical puzzle, has intrigued thinkers for centuries. First proposed by the Irish scientist and politician William Molyneux (1656-1698) in the 17th century, this intriguing thought experiment raises intricate questions about the nature of sensory perception and cognition. At its core, Molyneux’s problem poses a deceptively simple question: If a person blind from birth were suddenly granted sight, would they be able to visually recognize, name, and distinguish objects that were previously known only through touch?

The implications of this thought experiment extend far beyond its initial formulation, touching upon fundamental aspects of human experience, multi-modal perception and integration, construction of mental representations, among others. Despite centuries of inquiry, a definitive resolution to the Molyneux’s problem has remained elusive, with various interpretations and approaches yielding inconclusive results.

In recent years, however, advances in cognitive science and neuroscience have offered new insights into the mechanisms underlying sensory perception and cognition. One prominent approach is predictive processing (PP), which proposes that the mind operates by generating and updating internal models of the world to anticipate sensory input (Chanes & Barrett, 2020; Clark, 2013, 2015b; Friston, 2005, 2012; Hohwy, 2013). According to this framework, perception is an active, top-down process in which the mind compares predictions which are generated by the internal model with incoming sensory information. The discrepancies between predictions and sensory information generate prediction errors, which serve to update the internal model, thereby enhancing the correspondence between the internal model and the surroundings.

Alongside PP, situated accounts of cognition, collectively known as ‘4E cognition’ (embodied, embedded, extended, and enacted), have gained traction (Chemero, 2013; Clark, 1996, 1999; Clark & Chalmers, 1998; Fusaroli et al., 2014; Gallagher, 2005, 2017; Menary, 2010; Newen et al., 2018; Telakivi, 2023; Varela et al., 2016). These approaches emphasize the role of the environment, body, and action in shaping cognitive processes, offering a comprehensive framework for understanding cognition beyond purely internal mechanisms

This paper aims to explore the potential of Situated Predictive Processing (SPP) as a novel approach to addressing the Molyneux’s problem. SPP posits that traditional PP is embodied in a brain that is both neuroplastic and sparse, where content arises from the dynamic interaction with the body and environment. By integrating the principles of PP with those of 4E cognition through the concept of situated mental representations, this work seeks to establish a framework that synthesizes theoretical insights with the limited empirical evidence on the Molyneux’s problem.

2 Molyneux’s problem

Molyneux’s problem, initially formulated in the context of Locke’s An Essay Concerning Humane Understanding (Locke republished in 1975), asks whether a person blind from birth could distinguish objects by sight alone if granted vision. This question explored the relationship between visual and tactile sensations and the role of experience. Empiricists like Molyneux, Locke, and Berkeley argued that sensorial cross-modal recognition depends on experience, while rationalists such as Leibniz posited that reasoning could allow recognition without prior visual (Bruno & Mandelbaum, 2010; Glenney, 2012). Neither unanimous solution nor interpretation were drawn in the first theoretical approaches to the problem. Empirical approaches began in the 18^th century, notably with Richard Grant and William Cheselden’s cataract surgeries which suggested that patients could not immediately recognize shapes by sight (Cheselden, 1728; Sassen, 2004; Wade, 2020). However, this empirical paradigm was not free of criticism ranging from moderate claims regarding the experimental design (e.g., the eyes did not have enough time to recover after the surgery) to more sceptical ones about the cataracts per se (e.g., cataracts do not cause a complete blindness in many cases) (Glenney, 2011; Wade, 2020).

In the second half of the 19^th century, Meltzoff and Borton (Meltzoff & Borton, 1979) investigated the cross-modal perceptual abilities of one-month-old infants. The researchers introduced the newborns to two different pacifiers, one with nubs and one without, allowing them to explore the objects tactually for ninety seconds through their mouths. Following this initial tactile exploration, the infants were then presented with visual images of both pacifiers. The researchers measured how long the infants spent looking at each image and found that they tended to spend significantly more time examining the pacifier they had previously explored. This finding implied an early cross-modal representational ability between tactile and visual sensorial modalities. However, subsequent studies which tried to replicate this finding yielded contradictory conclusions (Maurer et al., 1999).

Over the last two decades, research on treated congenital blindness has gained attention, similar to earlier works by Grant and Cheselden (Cheselden, 1728; Degenaar & Collins, 1996; Loaiza, 2020; Sassen, 2004; Wade, 2020). Held and colleagues (2011) examined five individuals with congenital blindness, aged between 8 and 17 years, who had either cataracts or corneal opacities that left them only able to perceive light and dark. Following 48 hours of recovery post-treatment, participants were presented with one object from a pair that featured subtle morphological differences, either through visual-visual, tactual-tactual, or cross-modal (tactual-visual) presentations. The findings revealed that participants could not visually recognize objects they had previously explored solely through touch. However, they demonstrated the ability to gradually establish cross-modal mappings shortly after visual restoration.

Since the publication of Held and colleagues’ (2011) empirically negative response to the Molyneux’s problem, some authors have discussed and criticized their approach. Schwenkler (2012; 2013) argued that Held and colleagues (2011) failed to demonstrate whether their negative results were due to a lack of immediate cross-modal shape recognition capabilities or were indicative of a purely visual deficit, suggesting that the brain is capable of integrating sensory information even without prior visual experience. However, distinguishing how the visual system develops in conjunction with or in the absence of cross-modal integration remains a complex empirical challenge, as these processes are reciprocally and intrinsically connected. Connolly (2013) stressed the need for study more refined study designs that accurately capture the perceptual abilities of newly sighted individuals, implying that previous methodologies may have missed critical aspects of sensory integration. Cheng (2015) and Clarke (2016) took an even more critical stance, rejecting Schwenkler’s second proposal. Cheng argued that while the Molyneux’s problem is empirically approachable, it remains elusive due to significant methodological limitations. Together, these discussions highlight the ongoing challenges and suggest potential strategies for exploring sensory modality integration in newly sighted individuals.

While the Molyneux’s problem focuses on visuo-tactile relationship, other cross-modal integrations might provide new insights worth to consider. For instance, studies employing visuo-haptic illusion paradigms have explored cross-modal perception in treated congenital blind individuals. Pant and colleagues (2021) examined the Size-Weight Illusion (SWI), where smaller objects are perceived as heavier than larger ones of the same weight. They found no significant differences between normally sighted and treated congenital blind individuals, suggesting that early-in-life visual disruptions do not impede later cross-modal visuo-haptic integrations necessary for SWI. Similar conclusions were reached by Piller and colleagues (2023), even though they found more variability in the time required for post-sight restoration adaptation.

Studies examining other cross-modal integrations, such as audio-visual or visuo-motor ones, revealed impairments in treated congenital blind individuals, even after decades of sight recovery (Guerreiro et al., 2016b; Putzar et al., 2007, 2010). However, other studies observed a gradual recovery of these abilities (Ostrovsky et al., 2009; Piller et al., 2023). Interestingly, illusions involving the interpretation of two-dimensional perspective cues as three-dimensional depth (e.g., Ponzo and Müller-Lyer illusions) already arise in treated congenital blind individuals within forty-eight hours post-recovery (Gandhi et al., 2015). This phenomenon could suggest either that rapid development of visual processing occurs post-recovery or that certain cognitive processes are innate, surviving early visual deprivation. If the former holds true, it implies that cross-modal integration requires more time than intra-modal development. Contrarily, Putzar and colleagues (2010) reported that treated congenital blind individuals who had recovered sight for decades still performed worse than normally sighted individuals in an orientation face recognition task. Overall, these findings showcase the significant variability in recovery experiences, indicating that the type of sensory capability being restored, whether cross- or intra-modal, plays a significant role in the process.

The historical journey from early to contemporary studies on treated congenital blindness highlights the complexity of sensory perception development and cross-modal integration. A general analysis of these studies indicates the pivotal role of experience, thereby reinforcing the empiricist stance and its negative response to the Molyneux’s problem. However, existing perceptual theories fall short of fully explaining why this experiential foundation is essential for the proper development of cross-modal mappings. In the following sections, a novel situated predictive processing account will be proposed to address this gap.

3 Situated predictive processing

3.1 Classical predictive processing

PP is a theoretical paradigm in computational and cognitive neuroscience that posits the mind constructs generative models of both its surroundings and the body to predict incoming sensory input during cognition, action, and perception (Clark, 2013; Friston, 2005; Hohwy, 2013). This framework has roots in Helmholtz and Kant’s theories of perception (see for a historical review: Clark, 2013; Swanson, 2016), which argued that perception is a process of probabilistic inference where sensory input is combined with prior knowledge (Helmholz, 1867; Kant & Hatfield, 2005). PP operates through a hierarchically organized multilevel bidirectional cascade of top-down and bottom-up signals. Top-down signals, generated by probabilistic generative models, flow downward to be compared with the upward bottom-up signals generated from sensory receptors (Clark, 2013). At each level of this hierarchy, a matching process occurs between top-down predictions (or ‘priors’) and bottom-up sensory inputs (Dempster et al., 1977; Neal & Hinton, 1998). The discrepancy between the two generates ‘prediction error’, which is transmitted upward to update the generative model, helping the system better represent its surroundings and body. The influence of top-down or bottom-up signals depends on their expected precision: imprecise sensory signals increase the influence of priors, while scenarios with less robust priors rely more on sensory information (Hohwy, 2012). This continuous updating allows the system to make more accurate predictions in the foreseeable future when finding a similar scenario or, alternatively, to actively sample the world in ways that reinforce the current generative model.

Nevertheless, the system still needs a mechanism to account for the variability in the distribution of prediction errors when minimizing them. This is addressed through precision (i.e., the inverse of variability) which is used to weight prediction errors and determine their significance in updating the generative model (Hohwy, 2012). For instance, a noisy prediction error should have less impact on the model update than a solid one, as it may not accurately represent the discrepancies between the model and the surroundings. Overall, higher precision, which corresponds to lower uncertainty, assigns greater weight to the prediction error deemed reliable (Clark, 2013; Friston, 2005, 2009, 2010). However, this process is context-dependent, influenced by factors like sensory modalities (for an example, see Bays & Wolpert, 2007). The precision weighting is based on the priors from the generative model concerning the noise in both the surroundings and the system itself, shaping the expectations for the precision of prediction error. In this way, the system tries to represent the variability in the system-world interaction.

PP offers a powerful theoretical framework for understanding how the mind constructs generative models to predict sensory input and iteratively update these models by matching predictions with actual sensory information. PP also provides valuable insights into how the mind represents and interacts with the surroundings and the body, shedding light on fundamental cognitive processes such as perception and action. Nevertheless, while PP places the brain as the spotlight for all these cognitive processes, other contemporary situated approaches challenge this central role by decentralizing cognition, attributing it not only to the mind but also to external factors such as the body, culture, tools, and the environment. In the next section, 4E cognition, an umbrella term for situated accounts of cognition, will be explored and discussed.

3.2 4E cognition

4E cognition is a framework in cognitive science that emphasizes the embodied, embedded, enacted, and extended nature of cognition (Gallagher, 2017; Menary, 2010; Newen et al., 2018; Noë, 2009; Rowlands, 2010). Embodiment refers to the idea that cognitive processes are shaped by the physical body, suggesting that our bodily experiences significantly influence how we think and perceive the world (Chemero, 2013; Clark, 1996, 1999; Clark & Chalmers, 1998; Gallagher, 2005; Varela et al., 1991). Embeddedness refers to the notion that cognition is deeply intertwined with the physical and social environment in which it occurs (Clark, 2013; De Jaegher et al., 2010; De Jaegher & Di Paolo, 2007; Gallagher, 2008; Hutto et al., 2020). Extension posits that cognitive processes can extend beyond the brain and body, incorporating tools, devices, and technologies that enhance or simulate cognitive abilities (Di Paolo, 2009; Di Paolo & Thompson, 2014; Stewart et al., 2014). Finally, enaction emphasizes the idea that cognition is dynamically shaped by our interactions with the environment, where our actions and perceptions are fundamentally linked (Di Paolo & Thompson, 2014; Gallagher, 2017; Gangopadhyay & Kiverstein, 2009; Hutto, 2022; Hutto & Myin, 2013).

The 4E cognition framework acknowledges that cognition is not something that only occurs within the brain (i.e., internalist accounts), but it is also deeply intertwined with our bodily experiences, the environments in which we live and act, and the tools and technologies that we use (i.e., externalist accounts), leading to a dynamic coupling between the brain-body-world as an autonomous, self-regulating system (Carney, 2020).

Within the 4E cognition framework, strong positions assert that cognition is fundamentally constituted by these external factors. Proponents of strong 4E cognition emphasize that cognitive processes are not merely influenced by these elements; instead, they are intrinsically linked to them, suggesting that our understanding of the mind must encompass the entire system of brain, body, and environment as a unified, self-regulating entity without the brain being on the spotlight (Gallagher, 2005, 2017; Hutto, 2022; Hutto & Myin, 2013; Noë, 2009). For instance, an individual’s ability to perceive and interact with their environment should be considered as a dynamic interplay between the brain, body, and surroundings as a whole cognitive entity. Even though the role of the mind in cognition slightly differs between 4E cognition and PP, both emphasize the enactive feature of cognitive processes, in which action and perception are two sides of the same coin. However, 4E cognition and PP also differ in the use of mental representations. In the following section, mental representations are going to be defined and characterized, before exploring the so-called ‘representations war’ between 4E cognition and PP. A treaty peace between the two is going to be proposed leading to an account of SPP.

Additionally, strong 4E advocates contend that this approach leads to a more comprehensive understanding of cognition than traditional models that rely solely on internal mental representations. They argue that cognitive processes cannot be disentangled from the physical and social environments that shape them, thus calling for a re-evaluation of how we conceptualize mental representations in cognitive science. By recognizing that cognition is deeply situated, strong 4E cognition challenges the notion of a purely internal mind and suggests that understanding cognitive processes requires a broader and more integrative perspective that encompasses all dimensions of human experience.

While strong 4E proponents advocate for a model of cognition that emphasizes the importance of other factors beyond the brain, PP can also align with these views by acknowledging the situatedness of cognitive processes (Clark, 2022; Nave, 2025; Ohata & Tani, 2020). However, PP maintains that mental representations play a crucial role in the way we construct and update our understanding of our surroundings. To bridge the gap between these perspectives, it is essential to invoke the concept of situated mental representations, which recognize the interplay between internal models and the dynamic interactions with our surroundings. This approach can lead to the establishment of an SPP framework that integrates the strengths of both 4E cognition and PP. Before delving into the concept of situated mental representations and how they might reconcile them, it is essential to first explore what is a ‘mental representation’.

3.3 Mental representations

In contemporary cognitive science, mental representations are generally understood as the building blocks of cognition. They serve as internal models of the world, allowing us to perceive, think, and act. These mental entities that ‘contentfully’ stand in for objects, events, properties, and relations of the environment, forming the foundation for cognitive processes such as perception, memory, attention, language, and reasoning (Bermúdez, 2010; Von Eckardt, 2012). The phrase’‘contentfully’ stand in for’ is key here, as it highlights the representational nature of these mental states, i.e., acting as substitutes for things in the world (see Vilarroya, 2017) for a debate that encompasses the concept up to neural representations). However, despite its importance, the concept of mental representation remains elusive as there is no widespread agreement of the implications for one thing to represent another one (Roth, 2010). Different fields, such as cognitive neuroscience and philosophy of mind, often diverge in their views due to varying ontological and epistemological foundations. Although definitions vary (Ramsey, 2007; Roth, 2010; Vilarroya, 2017), it is generally accepted that they are mental objects with semantic properties and that they ‘stand in for’ something else (Ramsey, 2016; Vilarroya, 2017).

This lack of a clear and widely accepted definition partly arises from the functional ambiguity of mental representations, which is why they are often referred to as a ‘cluster concept’ (i.e., a concept that encompasses several different properties) (Cummins, 1995; Ramsey, 2007). Ramsey, in his book ‘Representations reconsidered’ (Ramsey, 2007), elegantly illustrates this debate. However, before delving into this complex debate, it is important to briefly define two basic features and identify the main views on mental representations up to now.

Mental representations involve intentionality, i.e., they have intrinsic meaning or are about something, in contrast to other types of representations (e.g., traffic symbols) whose their meaning derives from the mental states of an agent interpreting them (Egan, 2014; Ramsey, 2007; Von Eckardt, 2012; Williams, 2018). As will be discussed later, the state-of-the-art accepts that the meaning is intrinsic to the mental representation per se, although some authors argue that an interpreter (i.e., an agent using and even creating that representation) is necessary to ascribe meaning. In addition, mental representations may possess causal properties (Egan, 2014; Ramsey, 2007), as exemplified in belief-like representations. For instance, if an agent holds a belief-like mental representation that counting to ten will help them calm down when frustrated, this belief-like mental representation will cause them to count to ten in frustrating situations. Dretske (1997) and Ramsey (2007) stated that their causality is in virtue of their content, even though their content is causally inert.

The intimate connection between intentionality and causality has led to the claim that mental representation is a functional notion (Haugeland, 1991; Pierce, 1931; Ramsey, 2007). This implies that if a cognitive account employs mental representations within its theoretical framework, then it must clearly specify the functional role they play and avoid any type of redundancy (Ramsey, 2007). However, understanding how a mental representation serves as a representation is, as Ramsey puts it, “different from the account of the conditions responsible for its representational content” (Ramsey, 2007, p. 30). While the content is crucial to the representation’s function, it does not fully define or reduce it. The search for the specific role of mental representations within a theoretical cognitive framework is what Ramsey termed the ‘job description challenge’ (Ramsey, 2007). This challenge seeks for an explanatory benefit in describing the internal parts of a system in representational terms, i.e., that their role as representations must not be redundant.

Mental representations have been employed across distinct theoretical frameworks, most notably the classical computational theory of cognition (CCTC) and the connectionist framework. CCTC posits that cognition is grounded in inner computations¹, while the connectionist framework emphasizes the connections within neural networks as the basis for cognitive processes. Mainly within the CCTC framework, but in other novel cognitive frameworks as well, two general types of mental representations have received considerable attention: the Input-Output representations (IO-representations) (Cummins 1991) and the Structural/Simulation representations (S-representations) (Cummins, 1991; see Lee & Calder, 2023 for a recent and elegant review; Swoyer, 1991). The former describe mental representations as the inputs and outputs of either a computational process, according to the CCTC, or a neural network, according to the connectionists (Cummins, 1991; Ramsey, 2007). The latter describe mental representations as sharing structural isomorphism with their target and are exploited for this resemblance. In S-representations, the pattern of relations between the parts of the target is reflected in the representation itself (Cummins, 1991). According to Ramsey (2007), IO-representations are necessary for the function of sub-systems, while S-representations allow the system to exploit the structural similarity between the representations and the target for cognitive purposes. However, some authors pointed out that the distinction between the S- and IO-representations seems blurrier than initially expected, arguing that both types of representations share functional similarities and are more interconnected than initially believed (Facchin, 2021b; Morgan, 2014; Nirshberg & Shapiro, 2021; Shagrir, 2012; Sprevak, 2011). Overall, they proposed that both types involve mapping relationships between inputs and outputs or rely on structural correspondence with the world. Recently, Facchin (2024) suggested that S-representations need to be reclassified as the traditional notion designate a large variety of different and distinct types of representations, which could explain this blurriness. Whether viewed as fundamentally distinct or not, S- and IO-representations play a functional role in CCTC and meet the job description challenge.

On the contrary, other types of mental representations have not successfully met the job description challenge: the receptor/detector-representations (r/d-representations) (Lettvin et al., 1959; and Hubel & Wiesel, 1962; Hubel & Wiesel, 1968 for empirical background) and tacit-representations. R/d-representations refer to the network of neurons responsible for detecting a specific stimulus, leading to the assumption that they carry/represent the information of those stimuli. However, Ramsey (2007) argued that r/d-representations can be explained purely in causal-physical terms without invoking representational terms. For instance, when a network of neurons ‘x’ is activated by the sight of a red apple through a cascade of neural communication from the retina to the visual cortex, it communicates with other neural networks that organize the response, such as grasping and eating the apple. As Ramsey pointed out, this entire process can be understood without the need for representational explanations. On the other hand, tacit-representations apply to neural networks not because they are triggered by a stimulus, but because the entire network encodes the information through its connections (Rumelhart et al., 1986a, 1986b). Even though this might seem familiar to S-representations, tacit-representations do not share structural isomorphism with their target; rather, the representational information is distributed across the network’s connections. Ramsey (2007) argued that these tacit-representations display dispositional properties and constrain the capabilities of the neural network. However, this does not necessarily mean that they require a representational status. Ramsey adverted that accepting tacit-representations as mental representations would imply that anything with a disposition for something could be considered representational (e.g., “[r]ocks are now representational, since, after all, even a rock (in this sense)”knows how” [it has the disposition] to roll down a hill” (Ramsey, 2007, pp. 170–171) (italics are added to emphasize).

Despite their central role in cognition and the fact that their concept can be historically traced back to the philosophies of Aristotle and Aquinas, there is still no widespread agreement on the precise definition of mental representations. Most researchers adopt a working definition that describes mental representations as mental objects that stand in for something else. The majority agree that intentionality and causality are two crucial features of mental representations, often emphasizing a strong connection between the two. Given the importance of their functional role in cognitive frameworks, Ramsey (2007) proposed the job description challenge in order to evaluate whether the notion of mental representations was either redundant or significant within a cognitive framework. Ramsey defended that IO- and S-representations successfully meet the challenge, while r/d- and tacit-representations fall short. However, the ongoing debate over their necessity in cognition has fuelled the so-called ‘representation wars’, where proponents of two leading contemporary cognitive theories, i.e., PP and 4E cognition, dispute the necessity of mental representations for a well-functioning cognitive framework. In the following subsection, representation wars are going to be discussed, outlining the key arguments from both sides of this scaramouche.

3.3.1 Mental representations in PP

The discussion of mental representations in PP was initiated with Clark’s (2013, 2015b) characterization of the PP framework. According to him, mental representations in PP are probabilistic and action-oriented mirrors of the world. These mental representations enable organisms to engage successfully with their environment by minimizing prediction error, a key aspect of PP that ultimately supports survival and autopoiesis. The multi-level probabilistic generative models, in which mental representations play a central role, guide perception and action (Clark, 2013, 2015b; Williams, 2018). According to Clark, these mental representations carry abstract content, serving as causal-loops designed to predict states of the world through their representational properties. Gładziejewski (2016) advanced this view by proposing that the representations in PP align with prototypical S-representations (Cummins, 1991; Ramsey, 2007; Swoyer, 1991). As Gładziejewski put it, “cognitive systems navigate their actions through the use of a sort of causal-probabilistic”maps” of the world” (Gładziejewski, 2016, p. 569). The structural similarity of S-representations within PP can be understood as the brain implementing Bayesian networks (Pearl, 2000), “whose structure resembles the causal-probabilistic structure of our system’s environment” (Gładziejewski, 2016, p. 571; Wiese, 2017; Williams, 2018). These networks also align with the causal loops within PP’s hierarchical predictive structure. However, Gładziejewski (2016) and Williams (2018) clarified that these maps and generative models do not function identically but share key features (e.g., action-guiding through active inference (i.e., active inference Brown et al., 2011), detached/decoupled, unable the detection of representation errors) which help them meet the job description challenge (Ramsey, 2007). Prediction errors play a crucial role by constraining the structural mapping between the hierarchical generative model and the causal-probabilistic structure of the world (Clark, 2012; Gładziejewski, 2016; Hohwy, 2013, 2016). Regarding the feature of detachment/decouplement, Gładziejewski admitted that this is an open debate. Even though he was prone to claim that representational posits in PP work in a completely detached manner, this question is unresolved. Detachment, however, is a pivotal feature in the representation wars, as situated accounts of cognition argue that cognition involves continuous interaction between the mind and external factors. Thus, the role of detachment in PP warrants further exploration in this ongoing debate.

Wiese (2017) took a step further Gładziejewski’s proposal regarding PP’s mental representations by delving into their content. Wiese distinguished between two types of content, following Egan’s (2014) framework: (i) cognitive content and (ii) mathematical content. While the latter refers to the computational system that performs a task, the former refers to the content relative to the context and cannot be derived from the computational aspects. As proposed by Gładziejewski (2016), the structure of models in PP is composed of three elements: likelihoods, dynamic relations, and prior probabilities. Building on this, Wiese (2017) argued that each of these elements are the mathematical content of the mental representations, while the functional relations between the variables at different hierarchical levels account for the cognitive content. In terms of Ramsey’s (2007) classification of representations, Wiese’s approach suggests a combination of S-representations, which carry the cognitive content, and IO-representations, which carry the mathematical content. This combination can be viewed as part of a gradual representationalism framework, as the one proposed by Toribio and Clark (1994), which suggests that representations may vary along a continuum depending on their function and content. This gradualism was further developed by Rutar and colleagues (2022), who identified two gradual features of S-representations: structural similarity and decoupling. Nevertheless, in light of the discussions around the functional similarity between S- and IO-representations (Facchin, 2024; Shagrir, 2012; Sprevak, 2011), if one accepts that these two types of representations are functionally equivalent, it follows that there should not be any distinction between the carriers of mathematical and cognitive content. This means that Wiese’s account requires a slight reformulation, acknowledging that both mathematical and cognitive content refer to different functional aspects or ways of usage of the same mental representations, in line with Sprevak’s (2011) critique.

Both Wiese (2017) and Williams (2018), building on Gładziejewski’s (2016) work, argued that the representation of causal-probabilistic dependencies among variables in the surroundings forms a dynamical model of both the body and its environment. Williams (2018) claimed that the content of mental representations in PP is organism-relative, constructing a model of the world from the perspective of a self-organising entity, shaped by its body’s physiological needs.

More recently, Rutar and colleagues (2022) suggested the idea of gradation in representational features within PP, expanding on the earlier work by Toribio and Clark (1994). Since PP’s mental representations behave similarly to S-representations (Gładziejewski, 2016), Rutar proposed that gradation should be assessed in terms of two key aspects: structural similarity and decoupling. Both features can be consequently gradually decomposed as follows: structural similarity can be broken down into the number of preserved relations (i.e., relations between the parts of the representation) and space granularity (i.e., the information carried besides the relations of the parts). Decoupling, on the other hand, can be understood through the hierarchical level (i.e., from higher to lower levels that are proximal to sensory information) and the precision weighting of prediction error (i.e., adapting the accuracy of the representation).

Anderson and Chemero (2013), Orlandi (2016) and later Downey (2018), van Es (2020) and Facchin (2021a, 2021b), all challenged the representationalist stance in PP. Anderson and Chemero (2013) argued that since the bottom-up and top-down signals central to PP can be interpreted in non-representational terms, there is no need for a representational theoretical framework. Similarly but a step further, Orlandi (2016) and van Es (2020) proposed that the causal loops are better understood as covariations/correlations between two proximal levels in the hierarchy. The same would happen to priors and likelihoods. This view would align PP’s posits with r/d-representations, which fail to meet the job description challenge (as discussed previously in the #3.3. mental representations section). Facchin (2021a, 2021b) adopted a similar position, explicitly rejecting Gładziejewski ‘s claim that mental representations in PP behave as prototypical S-representations. Focusing on sensorimotor contingencies (i.e., refer to the regular ways in which sensory inputs change in response to an agent’s movements), Facchin argued that the role of generative models in PP is primarily to guide an agent’s interactions with the world, rather than to construct internal models that merely represent it. Thus, the processes in PP can be better understood by focusing on the enacted and embodied nature of cognitive processes. Downey (2018) introduced a fictionalist perspective on representationalism in PP. He proposed that although PP entails mental representations as theoretical posits, they play an explanatory role, meeting the job description challenge, without needing to metaphysically exist. Downey claimed that this fictionalist approach could resolve the representation wars. Downey’s argumentation is based on Orlandi’s work (2016), presented above, and Ramsey (2017) refusal of the necessity of cognition to be representationalist. However, this eliminativist -fictionalist perspective presents a contradiction: if mental representations do not ontologically exist, they cannot exert causal powers, thereby failing to meet the job description challenge. Nonetheless, it can be agreed that this fictionalist discourse is a ’weak’ eliminativist position, serving as a transitional framework towards potentially non-representationalist PP accounts.

So far, the consensus is that PP posits mental representations within a multi-level hierarchical generative model that guides both perception and action (Clark, 2013, 2015b). These mental representations are thought to represent the causal relations in worldly states through causal loops (Gładziejewski, 2016). Some authors argued that they meet the job description challenge proposed by Ramsey (2007) because they resemble prototypical S-representations (Gładziejewski, 2016). Wiese (2017) further suggested that the content of these mental representations can be divided into cognitive and mathematical components, while Rutar and colleagues (2022) emphasized the importance of gradation in PP’s representationalism. However, critics like Orlandi (2016), Downey (2018), and van Es (2020) argued against representational posits of PP by claiming that they fail to meet the job description challenge, as they seem to act more like r/d-representations. Despite this, Downey denoted that they still have an explanatory role within PP, proposing a fictionalist perspective. Van Es, meanwhile, leaned toward non-representationalism. In the following section, non-representationalist perspectives advocated by proponents of 4E cognition will be explored.

3.3.2 Representation wars: 4E cognition’s non-representationalism

Ecological psychology, founded by Gibson (2015), along with more recent contributions from Favela (2023), has argued that a paradigm shift is underway in the cognitive sciences, shifting away from a representation-centred framework. Many early proponents of embodied, embedded, extended, and enacted accounts of cognition also advocated for the elimination of mental representations from cognitive accounts (Chemero, 2013; Hutto & Myin, 2013; Shapiro, 2011; Varela et al., 1991). The general upshot is that the body and world themselves serve as representations external to the mind, eliminating the need for internal mental representations (Hutto & Myin, 2013; O’Regan & Noë, 2001). In contrast to representation-centred paradigms, which posit that perception and cognition aim to build objective models of the world in an observer-independent manner (Anderson, 2017; Engel et al., 2016), action-oriented frameworks like 4E cognition advocate for a performative understanding of the mind (Anderson, 2014). Thus, 4E cognition assigns the brain the role of a control system that governs the organism’s interactions with the world, rather than creating internal models of it (Anderson, 2017; Chemero, 2013; Cisek, 1999).

At first glance, the positions of PP representationalists and 4E cognition non-representationalists seem irreconcilable, given that they place the focus of cognition on opposing extremes (for an elegant overview of the debate see Başoğlu, 2021). But is this divide truly irremediable? Clark claimed that peace could be reached if PP was understood under situated features. As Clark put it,

“Dynamically speaking, the whole embodied, active system here self-organizes around the organismically-computable quantity”prediction error”. […] Is this an inner economy bloated with representations, detached from the world? Not at all. This is an inner economy geared for action, whose inner states bear contents in virtue of the way they lock embodied agents onto properties and features of their worlds. But it is simultaneously a structured economy built of nested system, whose communal project is both to model and engage the (organism-relative) world” (Clark, 2015a, p. 6)

In order to establish a situated PP framework, two key elements are necessary: (i) mental representations, as posited within PP frameworks, and (ii) an understanding of cognition that encompasses both internalist and externalist perspectives. This work proposes the concept of situated mental representations as a potential reconciliation between PP and 4E cognition. In the following sections, the notion of situated mental representations will be explored, drawing on recent work by Piccinini (2022), before discussing their explanatory potential to bridge the gap PP and 4E cognition.

3.3.3 Situated mental representations: treaty peace

Situated mental representations began to take shape during the debate between Dokic and Recanati (Dokic, 2007). Dokic argued that certain authors had implicitly tied the notion of ‘situation’ to mental representations, suggesting that their truth conditions could be influenced by context. Dokic emphasized the importance of ad hoc or temporary/occasional concepts, defined as transient constructions held in working memory (Dokic, 2007, p. 205). Dokic set up an example where the concept of ‘dog’ arises distinct mental representations depending on the context (e.g., the mental representation will be different if you are in the Parc Ciutadella or in the Arctic tundra). In this view, a ‘situation’ encompasses the various factors that make mental representation capable of expressing an absolute proposition. Thus, the situation is comprised of, as Dokic put it, “relational facts between a representation and its propositional constituents” (Dokic, 2007, p. 215). However, Dokic’s account seemed to lean towards understanding situational factors as primarily cognitive, a point that Recanati contested (Dokic, 2007, p. 218).

Both Clark (1996) and Miłkowski (2017) examined the possibility of cognition being both representational and situated. Miłkowski (2017) argued that representational computational mechanisms should be understood as embedded within larger mechanisms that dynamically process feedback from the environment. While both authors concurred that cognition should encompass both representational and situated elements, they did not develop a specific cognitive framework to encapsulate this duality.

Recently, Newen and Vosgerau (2020) claimed that mental representations must be understood as “non-static, use-dependent, and situated relative to a certain behaviour or cognitive ability” (Newen & Vosgerau, 2020, p. 2). The functional roles of the mental representations, they proposed, are realized through mechanistic relations that extend beyond the neural level to bodily and even social levels. Situated mental representations are use-dependent, meaning that their content is intimately tied to the purpose for which the representation is employed. Their situatedness directly applies to the fact that the vehicle of the representation can be a combination of neural and bodily states, with the content varying depending on the explanatory level, suggesting a form of gradation (Clark & Toribio, 1994). Through this, Newen and Vosgerau (2020) constructed the first comprehensive framework for situated mental representations.

Finally, Piccinini (2022) proposed a framework for situated mental representations, explaining how situatedness solves issues surrounding the content of mental representations. Piccinini’s framework builds on S-representations and informational teleosemantics (i.e., the semantic content of a mental representation comes from the information that they have regarding their function (e.g., Dretske, 1997). Piccinini argued that a representational account of cognition requires situatedness, i.e., it needs to be understood as embodied, embedded, enacted, and with affect. This necessity originates from the dynamic interactions between the nervous system, the body, and the environment, as well as the system’s use of feedback from these interactions to update its models-it is important to denote that his is similar to the principles of PP-. This situated representationalism leads to representations with (i) original (i.e., not derivative) semantic content, (ii) neural (and probably bodily) vehicles that are coordinated with their content, (iii) a causal role aligned with the system’s purposes, (iv) a distal representation of stimuli, (v) the potential to misrepresent.

According to Piccinini, “the vehicles of neural representations and their semantic content are two sides of the same coin. That is, the same functional properties that turn a system of internal states into a neural representation system are also sufficient to give such internal states their semantic content” (Piccinini, 2022, p. 5). Piccinini defined that their content display three causal processes: the learning process that creates the content, the causal process which creates the content (i.e., classical function of representation in terms of stand in for something), and the process guiding the behaviour of the system (i.e., the other classical function). Thus, Piccinini introduced a new causal process related to the creation and updating of representations: learning. The concept of learning is inherently situated, as neurocognitive systems present plasticity, which is a dynamic response of the system to their surroundings by changing their cellular and molecular structures. According to Piccinini, this active learning is key for generating original semantic content for mental representations and requires embodiment (i.e., the system requires a body to receive information from within and outside and establish real-time feedback loops with its surroundings), embeddedness (i.e., the body and environment provide information sources and form part of the feedback loop), enaction (i.e., dynamism is essential as the sensory information changes over time and the body moves), and affect (i.e., affective states directly influence reinforcement learning, which is linked to active).

Even though Piccinini tried to demonstrate the importance of active learning for understanding situated mental representations, a system without active learning should still be able to generate mental representations with original content. However, this work aligns with Piccinini’s claim that to create original content, systems need to be embodied, embedded, extended, and enacted. While affect undoubtedly influences the content, it does not seem to be a necessary requirement. Active learning serves as an excellent example of how content might be updated and showcases the situatedness of mental representations. However, Piccinini’s argument holds even in the absence of active learning.

In summary, frameworks on the situatedness of mental representations are gradually being established (Heras-Escribano & Martı́nez Moreno, 2024), highlighting the importance of situatedness in addressing the problem of content. Piccinini’s work (2022) explicitly references to 4E cognition, while also implicitly aligning with PP, given its focus on feedback loops and active learning through motion and model updates, both central aspects of PP. Therefore, Piccinini’s framework is suitable to address the next challenge: the development of an SPP account.

4 Situated predictive processing

Situated accounts of PP emphasize the importance of the environment, body, and action in shaping and implementing predictions. These approaches represent a middle ground between traditional cognitive models that treat cognition as a largely internal process and the radical situated views, which emphasize the external dynamic nature of cognitive processes. Most of the existing situated accounts focus on a specific aspect of 4E cognition (i.e., embodied, embedded, extended, or enacted).

Even though many 4E cognitivists might challenge the idea that interoceptive PP accounts are inherently embodied, interoception (i.e., the perception of internal bodily states) places the body at the center of cognition, treating it as a dynamic system rather than merely passive vessel (see Khalsa et al., 2018; Petzschner et al., 2021 for general overviews of interoception and predictive processing). Seth and colleagues (Seth et al., 2012; Seth, 2013; Seth & Tsakiris, 2018; Seth & Friston, 2016) proposed a model in which interoceptive prediction error, which underpins the subjective sense of presence, runs in parallel with exteroceptive prediction error, which underpins the sense of agency. According to this model, subjective feeling states, such as emotions, arise from interoceptive inference (i.e., analogous to active inference but in a bodily manner). In this view, emotions are cognitive evaluations of the body’s physiological states. Barrett and collaborators (Barrett, 2016; Barrett & Simmons, 2015; Kleckner et al., 2017) described the neural underpinnings of the so-called ‘Embodied Predictive Interoception Coding’ (EPIC) model, with the term ‘embodied’ explicitly mentioned, which explains an embodied and constructed account of emotions [i.e., similar to Seth’s proposals mentioned above) and is associated with allostasis (i.e., adaptive processes that maintain stability through change (Schulkin & Sterling, 2019)). Allostasis itself has been associated with interoceptive predictive processing by Shulkin and Sterling (2019). Pezzulo and colleagues (2015, 2021) reviewed the role of interoception and homeostatic regulation in active inference. Owens and colleagues (2018) approached interoceptive inference empirically by examining the connection between cardiac interoception and autonomic cardiac control. Other approaches to PP highlight the role of bodily experiences in shaping sensory processing and prediction-making (Apps & Tsakiris, 2014; Seth & Friston, 2016). More recently, Badcock, Friston, and Ramstead (2019) developed a ‘hierarchically mechanistic mind’ with evolutionary systems theory of psychology, which integrates a situated, embodied, Bayesian brain. They defined the brain as the following,

“[A]n embodied, complex adaptive control system that actively minimises the variational free-energy (and, implicitly, the entropy) of (far from equilibrium) phenotypic states via self-fulfilling action-perception cycles [which might be linked to PP], which are mediated by recursive interactions between hierarchically organised (functionally differentiated and differentially integrated) neurocognitive processes.” (Badcock et al., 2019, p. 17) (italics are added to emphasize).

These approaches suggest that sensory inputs are not processed purely in isolation but are instead modulated by internal bodily signals and states, such as interoceptive and proprioceptive signals. In summary, interoceptive PP accounts propose an embodied form of PP in which predictions serve to minimize the energy required to keep allostasis of the organism or to respond effectively to incoming external signals. At the same time, prediction errors update the internal generative model, refining it for future occasions with similar internal or external signals.

Regarding embeddedness, both social knowledge (Brodski et al., 2015; Brodski-Guerniero et al., 2017; Chanes et al., 2018; Draganov et al., 2023; Ramos-Grille et al., 2022) and environmental factors (Constant et al., 2020) have been proposed to shape perception. Kilner and colleagues (2007) proposed that the mirror neuron system, which is involved in action observation and imitation, can be understood through PP principles. Briefly, the mirror neuron system consists of distinct brain regions that are active not only when a subject executes an action but also when observing the action from others, effectively transforming visual information into knowledge or skills (Bonini et al., 2022; Rizzolatti & Craighero, 2004). Kilner and colleagues (2007) suggested that during action observation, this system tries to infer the most likely cause of an action by minimizing prediction errors. In this scenario, the ‘cause’ refers to the intentional mental states that caused/motivated the action. Thus, when observing two actions that are identical in sequence but with distinct intentions, PP allows to distinguish between them by integrating motor information through the mirror neuron system with additional sensory information from other brain modules. Constant and colleagues (2020) formulated an active inference formulation that “views cognitive niche construction as a cognitive function aimed at optimizing organisms’ generative models” (Constant et al., 2020, p. 1), similar to what is normally understood as a mixture of embedded and extended cognition. Cognitive niche construction involves behaviours and knowledge supported by sociocultural practices, playing a critical role in human evolution and cognition. The reciprocal relationship between individual cognitive processes and collective sociocultural practices means that as individuals interact with their cultural surroundings, they update their generative models to better predict and navigate these sociocultural environments. This dynamic mechanism of updating enhances the ability to function effectively within cultural contexts. As individuals grow up within a particular culture, their generative models develop in tandem with the affordances and practices of that culture, creating a reciprocal relationship between the individual’s internal models and the external social environment. This approach highlights how cultural and environmental factors are not separate from cognition but are intricately woven into the very fabric of how we predict, perceive, and interact with the world.

Extended approaches to PP have been left aside until recently, even though some criticisms against these proposals have already been raised (Facchin, 2023; Hohwy, 2016, 2018). Kirchhoff and Kiverstein (2021) defended that an extended PP is feasible, even when incorporating the Markov blanket formalism. They argued that self-evidencing processes, which contribute to maintaining the organizational integrity of the individual over time and, thus, distinguishing it from the environment, are semipermeable. This permeability allows external elements to be integrated when necessary. Kersten (2022) supported this view, proposing that prediction error minimization can be used to frame extended systems as genuine cognitive systems. According to Kersten, extended systems need to engage in prediction error minimization at an algorithmic level in order to be part of the cognitive process. Similarly, Clark (2022) emphasized the importance of distinguishing between the process of ‘recruitment’ and actual cognitive processing. Clark argued for the continuous flow and transformation of information between the cognizant and extended systems, both working to minimize prediction error. For instance, when someone uses glasses to improve vision, the clarity of the received visual signals increase. This alters the prediction of accuracy concerning exteroception, leading the system to update its generative model and modify its priors about the reliability of this sensory modality. Recently, Kersten (2024) expanded on Clark’s proposal by distinguishing between two important senses of recruitment: ready-to-hand and adaptive recruitments, emphasizing the role of temporality in their functioning. More sophisticated approaches involve neuromodulation, which may influence the neural mechanisms of PP, thereby altering behaviour. Draganov and colleagues (2023) demonstrated how socio-affective predictions could be modified using transcranial alternate current stimulation, implying that brain oscillation modulation can transiently alter the generative model.

Enactive approaches to PP emphasize the importance of action in shaping and implementing predictions (Gallagher, 2017). These approaches suggest that motor control functions as an active inference process, where predictions based on proprioceptive signals are fulfilled through peripheral motor reflexes (Adams et al., 2013; Brown et al., 2011; Friston et al., 2011; Millidge et al., 2021). This mechanism enables organisms to adjust their actions based on their prior knowledge and internal models of the world (e.g., a goalkeeper repositioning to save a goal) to enhance the alignment between predictions and incoming sensory signals, thereby minimizing prediction error. Building on this perspective, Seth (2014) described an enacted and embodied account of PP that explains sensorimotor contingencies and perceptual presence (i.e., way in which objects are experienced as whole and present in the environment). This framework also extends to explaining phenomena such as synaesthesia. Facchin (2021a) further argued that this enacted and embodied account of PP aligns more closely with anti-representationalist views, suggesting that PP can operate without relying heavily on internal representations and instead depends on the dynamic brain-body-environment interaction. Ridderinkhof and Brass (2015) proposed a PP framework to explain kinesthetic motor imagery (i.e., cognitive ability that allows an individual to perform and experience motor actions through the mind, without executing such actions in a first-person perspective). The general idea is that this process facilitates the updating of the generative model, improving predictive motor control when actual actions must be executed. Bruineberg and colleagues (2018) diverged from the traditional Helmholtzian perspective of perception by proposing that the generative model in the context of PP is not a merely source of internal representations, but a tool for guiding an organism’s interactions with the environment to maintain a stable brain-body-environment dynamic system. Similarly, Tschantz and colleagues (2020) developed a framework combining goal-oriented and epistemic behaviours through active inference. This approach generates models that balance action-oriented goals with the need for information gathering, in order to create accurate and detailed models that are relevant to specific actions. While both proposals retain a representational aspect, they shift towards a more situated approach, as the representations that they proposed do not simply mirror the causal probabilistic structure of the environment. Instead, they emphasized the enactive coupling between the brain-body-environment. Tschantz and colleagues (2020) highlighted that these representations may be even less veridical than classical representations but are more functionally useful for a system that is actively engaged with the environment.

Finally, the affective feature of situatedness has been more recently proposed (Thompson, 2010) and, consequently, a comprehensive framework is still under development. As mentioned earlier in the embodied PP frameworks, Seth and collaborators (Seth et al., 2012; Seth, 2013; Seth & Tsakiris, 2018; Seth & Friston, 2016) and Barrett and collaborators (Barrett, 2016; Barrett & Simmons, 2015; Kleckner et al., 2017) have suggested that emotions should be understood as evaluations of the physiological states of the body through interoceptive PP. Additionally, Ridderinkhof (2014, 2017) proposed a PP framework for emotional actions, emphasizing that PP allows to understand the impulsive and purposive features of emotional actions as evaluations and fine-tunings of anticipated action effects based on the predicted sensory consequences. Piccinini, in his work on situated mental representations (Piccinini, 2022), referred to the importance of affect in reinforcement learning, which in turn impacts active learning processes. Since active learning is closely linked to PP, it follows that affect can facilitate the updating of the generative model through this learning mechanism.

Overall, situated accounts of PP provide a more comprehensive and nuanced view by emphasizing the importance of embodiment, embeddedness, extension, enaction, and even affect in shaping and implementing predictions and understanding the myriads of suggested functions of prediction errors. However, these accounts face a challenge concerning their reliance on traditional mental representations. Thus, the situated mental representations proposed in the previous section serve to bridge both PP and 4E cognition, as they (i) introduce mental representations which are essential for contemporary PP frameworks, and (ii) require a combined externalist-internalist approach to cognition. This discussion will pave the way for a future proposal of SPP that integrates all aspects of 4E cognition within a general framework, while explaining how situated mental representations do an explanatory work.

4.1 Situated predictive processing as a framework to understand the Molyneux problem

To this point, this work has set up the foundations for an SPP framework which employs situated mental representations in contrast to traditional ones. However, it remains unclear how SPP can elucidate the mechanisms behind the Molyneux’s problem and help interpret the findings from empirical studies conducted in recent years. In essence, Molyneux’s problem deeply questions about the origin of cross-modal mappings: (i) by experience or (ii) innately. This work argues that neither one nor the other but a combination of both, with experience playing a slightly more significant role.

SPP posits that the brain relies on priors from a generative model that represents the surroundings in a situated manner. In the case of a congenitally blind individual presented with two distinct tactile stimuli, the individual would use their generative model to differentiate between them. If they fail to do so, prediction errors would arise from the mismatch between top-down predictions and bottom-up sensory input. These prediction errors would then guide the update of the generative model. According to SPP, these generative models are inherently embodied; they are shaped by the specific characteristics of the individual’s brain and body and are sensitive to changes in bodily states. For instance, if an individual experiences reduced tactile sensitivity due to a peripheral nervous system injury, their generative model, which was built on the assumption of normal sensory function, would generate top-down predictions based on previous experiences. However, the reduced sensory signals caused by the injury would produce a mismatch between predictions and sensory input, leading to significant prediction errors that update the model accordingly. It is likely that the system weights more precision on the generative model by the time being, but still, it is going to update it to improve the alignment and representation of the surroundings in this new embodiment. But what would happen if the peripheral system recovered, and the bottom-up neural signals got back to the normal activity from before the injury? It would happen the same, but in the opposite direction. The top-down predictions from the generative model would predict lower bottom-up neural activity, creating prediction errors that would go upwards again to update the generative model. While this example illustrates how perception adjusts in response to bodily changes, the shifts involved are not as dramatic as those posed by the Molyneux’s problem. Yet, there is another critical aspect missing from this example, which is highly relevant to the Molyneux’s problem: cross-modal integration.

When thinking about generative models, it is common to focus on a single sensory modality. However, there is no contradiction in considering a more comprehensive generative model that encompasses multiple modalities to account for associations between them. This cross-modality of PP has been empirically analyzed in a few studies (Das et al., 2023; Dercksen et al., 2021; Sánchez-Garcı́a et al., 2011). The findings generally suggest that the system relies primarily on intra-model predictions that match the incoming sensorial information, but cross-modal predictions are also integrated. However, the system prioritizes certain modalities over others based on their reliability in a given context Sánchez-García and colleagues (2011) found that visual predictions tend to have an advantage over auditory predictions in a cross-modal framework. Therefore, PP, and by extension SPP, must accommodate cross-modality within their frameworks. Nonetheless, further empirical investigation is needed to understand how sensory modality weighting occurs and its broader implications.

Returning to a scenario more aligned with the Molyneux’s problem, let us now consider an individual who has suffered congenital blindness. Throughout their life, they have developed generative models across various sensory modalities, even cross-modal maps, except for the visual ones (though some studies, discussed later, question the extent of this). This individual can tactually distinguish between two objects and may also associate their distinct sounds with tactile information in a cross-modal generative model. After sight restoration, they would not have a pre-existing generative model for the visual modality and would thus be unable to make confident predictions that link visual stimuli to tactile information. The system would, therefore, assign low precision to the priors from the visual modality and would refrain from making uncertain predictions, instead placing greater weight on the bottom-up sensory information. However, because generative models for other sensory modalities have already been established, the association between incoming visual information and these stored models could allow the creation of a new associative generative model at a faster pace. The system can leverage the reliability of these pre-existing generative models to guide the active learning process for the visual modality, which may be further accelerated by integrating multiple sensory modalities simultaneously into a larger associative generative model.

Overall, this framework aligns with the findings of Held (2011), Pant (2021), and Piller (2023), and their respective colleagues, who demonstrated that young adults with congenital blindness, after sight restoration, were able to gradually develop cross-modal visuo-tactile mappings within a short period. However, it is important to highlight that the Molyneux’s problem poses a relatively simple task in the context of building a cross-modal generative model, i.e., the visual discrimination of two objects that can be already discriminated tactually. What would happen in more complex scenarios or in cases of visual impairments that cannot be fully reversed? Severe visual deprivation experiments on domestic cats, which consisted of removing or limiting the visual input to the brain through several methods such as dark rearing, and binocular or monocular deprivation (see Kandel, 2013 for a general overview; Wiesel & Hubel, 1965), have provided some insight. These studies revealed that visual input is crucial for the proper development of the visual system during a critical developmental period. Had visual deprivation extended beyond this critical period, cats would have failed to develop a functioning visual system, resulting in significant visual deficits or even complete blindness. These findings are consistent with earlier empiricist perspectives. In a more recent line of thought and drawing from these findings, Gallagher (1996, 2005), one of the main 4E cognition promoters, recently argued that comparing the visual capabilities of blind-recovered individuals to those of control individuals may be problematic, as long-term deprivation can lead to neurodegenerative changes in the visual system that impact the recovery of the visual system.

Similar to the experiments on visually deprived cat, the lack of visual input has significant negative implications for the system’s visual processing, as it has been reported for the audio-visual (Guerreiro et al., 2016a, 2016b; Putzar et al., 2007, 2010), visuo-motor (Ostrovsky et al., 2009), and tactile-propioceptive (Petkova et al., 2012) cross-maps (see Nava et al. 2024 for a review). Thus, cross-modal mappings appear to be both context- and modality-dependent, suggesting that the Molyneux’s problem would be answered even more negatively if the visuo-tactile task involved something more complex than distinguishing two tactile-known objects. Under the SPP framework, this might be explained by the fact that the PP’s generative model is embodied within a nervous system that is reciprocally intertwined with a body and that, during the development of the individual, both systems require the signalling of the other to develop properly.

This system’s enactive ability to gradually create generative models through associations with other sensory modalities can be attributed to two key features of the brain: (i) neural plasticity and (ii) brain sparsity. Neural plasticity, which is important for PP, allows the brain to reorganize itself by forming new or eliminating old connections throughout life. Based on Hebbian learning (Hebb, 1949), it is summarized with its maxim “neurons that fire together, wire together”, as well as synaptic plasticity through long-term potentiation and depression (LTP and LTD, respectively) (Bear & Malenka, 1994; Bliss & Collingridge, 1993; Bliss & Gardner‐Medwin, 1973), which enable the brain to adapt dynamically as needed. Brain sparsity challenges the traditional notion of brain modularity, which suggests that specific functions are localized in discrete brain modules. Instead, brain sparsity suggests that a simple one-to-one mapping between brain areas and functions is an oversimplification, advocating for a network-based perspective (Huntenburg et al., 2018; Pessoa, 2014). In this view, multiple networks dynamically interact to perform functions, thereby challenging rigid modular boundaries. Overall, these features imply that the mind is embodied in a neuroplastic and sparse brain, which constrains its ability to form associative and non-associative generative models. In addition, the brain’s dependence on external stimuli for proper development (i.e., demonstrated in studies of visual deprivation) reinforces the notion that cognition is embodied within an enacted system.

Amedi and colleagues (2005) elegantly reviewed how the occipital areas of blind people, which are normally in charge of visual processing in the visual system hierarchy, are repurposed for other sensory modalities, including tactile (see Sadato et al., 1996 for an example), motor (see Ricciardi et al., 2009 for an example), or even for other cognitive functions such as language and memory. Using fMRI, Peelen and colleagues (2014) found that the occipitotemporal cortex of blind individuals was activated during shape comparisons similarly to sighted individuals. Therefore, when areas once responsible for visual stimuli (e.g., occipital areas) processing are recruited for other functions (e.g., tactile processing) in congenital blindness, these regions may more quickly develop associative generative models after visual restoration, utilizing this neural flexibility.

The Molyneux’s problem has briefly been approached by prosthetic vision strategies, including artificial retinas, sensory substitution devices, and visual prosthetic systems, as noted by Evans (1985), leading to a reformulation of the question: “would a formerly blind individual, after having regained a degree of visual functionality by means of a prosthetic device, pass the Molyneux test?” (Jacomuzzi et al., 2003, p. 270). Artificial retinas, which produce electrical impulses when activated by light to induce phosphene perception (i.e., a luminous sensation produced by mechanical or electrical stimulation of the retina), can slightly improve vision (Chow, 2004; Ramirez et al., 2023). However, artificial retinas have mainly been used for degenerative-caused blindness, which means they do not directly apply to the Molyneux’s problem. Sensory substitution devices (SSDs) provide visual information by stimulating a non-visual modality. Bach-Y-Rita and colleagues (Bach-Y-Rita et al., 1969) used electrical stimulators on body areas with haptic receptors, showing that patterns of visual discrimination can be learnt through haptic stimulation. Their successful approach was later refined (Deroy & Auvray, 2012; Nau et al., 2015; Reich et al., 2012; Ward & Meijer, 2010) leading to striking findings. After brief training with SSDs, blind individuals demonstrated the ability to point at targets, recognize patterns, and perform tasks like motion tracking and object distance estimation. Similar to Amedi and colleagues’ work (Amedi et al., 2005), these studies emphasized the brain’s ability to reorganize and adapt, recruiting the visual system for object recognition while using haptic-derived information. SSDs effectively facilitate cross-modal perception, generating an extrinsic and artificial visuo-tactile associative model, while recruiting visual system areas, which could help enhance post-sight recovery. Finally, regarding cortical prostheses, Dobelle’s pioneering work (2000) demonstrated that visual cortical prostheses could help blind individuals by creating more specific and individualized phosphene perceptions than those created by artificial retinas. Nevertheless, subsequent studies have emphasized the challenges of using these type of prostheses due to high variability in perception and the need for extensive training (Lewis et al., 2015; Najarpour Foroushani et al., 2018). Although these prosthetic devices are still evolving, they highlight an extended feature of the Molyneux’s problem, even suggesting the creation of extrinsic associative generative models and recruiting the visual system after haptic stimulation.

Synaesthesia and the Molyneux’s problem, while differing in their origins and dispositions, both explore the brain’s capacity for cross-modal mapping. Synaesthesia often results from atypical neural connections that result in stable, automatic associations between sensory modalities (Ward, 2013). For example, hearing a sound might consistently evoke a perception of colour. While synaesthesia is commonly acquired, developmental synaesthesia is of particular interest here: if a specific developmental visuo-tactile synaesthesia were present in a congenital blind individual, it could potentially yield a positive answer to the Molyneux’s problem. Even though synaesthetic congenital blindness cases are rare, a case report described an acquired audio-tactile synaesthesia in a congenital blind individual triggered by LSD, resulting in ‘visual-like’ qualia, similar to experiences reported by users of SSDs (Dell’Erba et al., 2018). While this case is not conclusive due to its non-developmental nature, it raises the possibility of visuo-tactile synaesthetic blind individuals providing a positive answer to the Molyneux’s problem.

Synaesthesia has been theoretically approached by Seth’s PP model of sensorimotor contingencies (2014). Seth suggested that, in typical perception, generative models are rich in counterfactuals, contributing to a sense of perceptual presence. However, in synaesthetes, generative models may exhibit unusually high prior precision, leading to a reduced role of counterfactuals. Synaesthesia emerges by drastic changes in neural networks, influenced by neural plasticity. Both SPP and Seth’s enacted and embodied PP (2014) can account for synaesthesia, as they are embodied in a plastic brain. Conversely, a negative answer to the Molyneux’s problem is the rule, as the counterfactually-rich generative model is low in prior precisions due to the absence of previous visual experience.

To sum up, SPP offers a comprehensive framework for understanding how the mind is embodied in a brain that displays neuroplasticity, sparsity, and is enacted with the surroundings. In addition, the brain can even be extended through sensory substitution devices (Bach-Y-Rita et al., 1969; Deroy & Auvray, 2012; Nau et al., 2015; Reich et al., 2012; Ward & Meijer, 2010). Neuroplasticity and brain sparsity are intrinsic features of the brain’s adaptability, suggesting that the answer to Molyneux’s problem may not be entirely negative, being synaesthesia an extreme case. The brain is a highly flexible system capable of adjusting to novel scenarios, within certain limits. In blind individuals, for instance, the brain may repurpose visual networks for other functions that can later facilitate the development of associative generative models post-visual restoration. Nevertheless, experience remains crucial for proper neurodevelopment, establishing the groundwork for associative generative models, and driving neural plasticity. Thus, while the brain’s adaptability offers some potential for cross-modal learning, this process still requires experience, meaning that the answer to the Molyneux’s problem still leans towards a negative conclusion.

5 Concluding remarks

Even though the Molyneux’s problem was first published over three centuries ago, it still remains unsolved. While empirical approaches during the 18^th century (Cheselden, 1728; Sassen, 2004; Wade, 2020) and contemporary studies (Held et al., 2011; Pant et al., 2021; Piller et al., 2023) predominantly suggest a negative answer, recent critiques regarding experimental design (Cheng, 2015; Clarke, 2016; Connolly, 2013; Schwenkler, 2012, 2015) demonstrate that the definitive resolution to the problem is still elusive.

This work describes an SPP framework and suggests that it provides a valuable lens for understanding the Molyneux’s problem. According to traditional PP, brain generates predictions about incoming sensory information based on its internal generative models about the world, which are updated by comparing top-down predictions to bottom-up sensory inputs. Crucially, in contrast to traditional views, SPP posits that these generative models derive their content from dynamic brain-body-environment interactions, as Piccinini (2022) argued, thereby resolving the problem of content in cognitive systems.

SPP explains why individuals born blind, upon sight restoration, struggle to predict or visually distinguish objects previously familiar through touch. However, it also demonstrates how the brain’s intrinsic properties, including neural plasticity and sparsity, allow for a gradual reconfiguration and cross-modal adaptation. As such, SPP offers a ‘moderate’ negative answer to the Molyneux’s problem: while cross-modal predictions without previous visual experience are not immediate and require experience, neuroplasticity and sparsity allow for rapid adaptation once visual experience is gained, leading to the development of associative generative models.

This work sets up the foundations for an SPP framework, but further theoretical and empirical research is needed to fully explore SPP in cross-modal perception. Future studies may also uncover additional implications for the study of sensory substitution and the plasticity of cognitive systems, shedding further light on the intricate Molyneux’s problem.

Acknowledgments

I would like to specifically thank Lorena Chanes for the discussions about mental representations, predictive processing, and 4E cognition that incited the beginning of this manuscript. I am grateful for the indefatigable assistance and discussion of Ube Cisfúgar. I also wanted to express my deep gratitude to Núria Peñuelas for the general discussions that helped me to arrange my ideas. I am grateful to the anonymous reviewers from Philosophy and the Mind Sciences for their insightful feedback, which has significantly enhanced both the flow and content of the paper, transforming it from the rough initial draft into its current form.

Funding

The work for this research was generously funded by the Centro Internacional de Neurociencia y Ética (Spain).

References

Adams, R. A., Shipp, S., & Friston, K. J. (2013). Predictions not commands: Active inference in the motor system. Brain Structure and Function, 218(3), 611–643. https://doi.org/10.1007/s00429-012-0475-5

Amedi, A., Merabet, L. B., Bermpohl, F., & Pascual-Leone, A. (2005). The occipital cortex in the blind: Lessons about plasticity and vision. Current Directions in Psychological Science, 14(6), 306–311. https://doi.org/10.1111/j.0963-7214.2005.00387.x

Anderson, M. L. (2014). After phrenology: Neural reuse and the interactive brain. The MIT Press.

Anderson, M. L. (2017). Of Bayes and bullets: An embodied, situated, targeting-based account of predictive processing. Philosophy and Predictive Processing. https://doi.org/10.15502/9783958573055

Anderson, M. L., & Chemero, A. (2013). The problem with barin GUTs: Conflation of different senses of "perception" threatens metaphysical desaster. Behavioral and Brain Sciences, 36(3), 233–253. https://doi.org/10.1017/s0140525x1200221x

Apps, M. A. J., & Tsakiris, M. (2014). The free-energy self: A predictive coding account of self-recognition. Neuroscience & Biobehavioral Reviews, 41, 85–97. https://doi.org/10.1016/j.neubiorev.2013.01.029

Bach-Y-Rita, P., Collins, C. C., Saunders, F. A., White, B., & Scadden, L. (1969). Vision substitution by tactile image projection. Nature, 221(5184), 963–964. https://doi.org/10.1038/221963a0

Badcock, P. B., Friston, K. J., & Ramstead, M. J. D. (2019). The hierarchically mechanistic mind: A free-energy formulation of the human psyche. Physics of Life Reviews, 31, 104–121. https://doi.org/10.1016/j.plrev.2018.10.002

Barrett, L. F. (2016). The theory of constructed emotion: An active inference account of interoception and categorization. Social Cognitive and Affective Neuroscience, nsw154. https://doi.org/10.1093/scan/nsw154

Barrett, L. F., & Simmons, W. K. (2015). Interoceptive predictions in the brain. Nature Reviews Neuroscience, 16(7), 419–429. https://doi.org/10.1038/nrn3950

Başoğlu, Y. R. (2021). How not to argue about the compatibility of predictive processing and 4E cognition. Organon F, 2021(4), 777–801. https://doi.org/10.31577/orgf.2021.28402

Bays, P. M., & Wolpert, D. M. (2007). Computational principles of sensorimotor control that minimize uncertainty and variability. The Journal of Physiology, 578(2), 387–396. https://doi.org/10.1113/jphysiol.2006.120121

Bear, M. F., & Malenka, R. C. (1994). Synaptic plasticity: LTP and LTD. Current Opinion in Neurobiology, 4(3), 389–399. https://doi.org/10.1016/0959-4388(94)90101-5

Bermúdez, J. L. (2010). Cognitive science: An introduction to the science of the mind (1st ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511781322

Bliss, T. V. P., & Collingridge, G. L. (1993). A synaptic model of memory: Long-term potentiation in the hippocampus. Nature, 361(6407), 31–39. https://doi.org/10.1038/361031a0

Bliss, T. V. P., & Gardner‐Medwin, A. R. (1973). Long‐lasting potentiation of synaptic transmission in the dentate area of the unanaesthetized rabbit following stimulation of the perforant path. The Journal of Physiology, 232(2), 357–374. https://doi.org/10.1113/jphysiol.1973.sp010274

Bonini, L., Rotunno, C., Arcuri, E., & Gallese, V. (2022). Mirror neurons 30 years later: Implications and applications. Trends in Cognitive Sciences, 26(9), 767–781. https://doi.org/10.1016/j.tics.2022.06.003

Brodski, A., Paasch, G.-F., Helbling, S., & Wibral, M. (2015). The faces of predictive coding. Journal of Neuroscience, 35(24), 8997–9006. https://doi.org/10.1523/JNEUROSCI.1529-14.2015

Brodski-Guerniero, A., Paasch, G.-F., Wollstadt, P., Özdemir, I., Lizier, J. T., & Wibral, M. (2017). Information-theoretic evidence for predictive coding in the face-processing system. The Journal of Neuroscience, 37(34), 8273–8283. https://doi.org/10.1523/JNEUROSCI.0614-17.2017

Brown, H., Friston, K., & Bestmann, S. (2011). Active inference, attention, and motor preparation. Frontiers in Psychology, 2. https://doi.org/10.3389/fpsyg.2011.00218

Bruineberg, J., Kiverstein, J., & Rietveld, E. (2018). The anticipating brain is not a scientist: The free-energy principle from an ecological-enactive perspective. Synthese, 195(6), 2417–2444. https://doi.org/10.1007/s11229-016-1239-1

Bruno, M., & Mandelbaum, E. (2010). Locke’s answer to Molyneux’s thought experiment. History of Philosophy Quarterly, 27(2), 165–180. https://www.jstor.org/stable/27809501

Carney, J. (2020). Thinking avant la lettre: A review of 4E cognition. Evolutionary Studies in Imaginative Culture, 4(1), 77–90. https://doi.org/10.26613/esic.4.1.172

Chanes, L., & Barrett, L. F. (2020). The predictive brain, conscious experience, and brain-related conditions. In The philosophy of science of predictive processing (pp. 159–169). Bloomsbury Academic.

Chanes, L., Baumann Wormwood, J., Betz, N., & Feldman Barrett, L. (2018). Facial expression predictions as drivers of social perception. Journal of Personality and Social Psychology, 114(3), 380–396. https://doi.org/10.1037/pspa0000108

Chemero, A. (2013). Radical embodied cognitive science. Review of General Psychology, 17(2), 145–150. https://doi.org/10.1037/a0032923

Cheng, T. (2015). Obstacles to testing Molyneux’s question empirically. I-Perception, 6(4). https://doi.org/10.1177/2041669515599330

Cheselden, W. (1728). An account of some observations made by a young gentleman, who was blind, or lost his sight so early, that he had no rememberance of ever having seen, and was couch’d between 13 and 14 years of age. Philosophical Transactions of the Royal Society, 35, 447–450.

Chow, A. Y. (2004). The artificial silicon retina microchip for the treatment of vision loss from retinitis pigmentosa. Archives of Ophthalmology, 122(4), 460. https://doi.org/10.1001/archopht.122.4.460

Cisek, P. (1999). Beyond the computer metaphor: Behaviour as interaction. Journal of Consciousness Studies, 6(11-12), 125–142.

Clark, A. (1996). Being there: Putting brain, body, and world together again. The MIT Press. https://doi.org/10.7551/mitpress/1552.001.0001

Clark, A. (1999). An embodied cognitive science? Trends in Cognitive Sciences, 3(9), 345–351. https://doi.org/10.1016/S1364-6613(99)01361-3

Clark, A. (2012). Dreaming the whole cat: Generative models, predictive processing, and the enactivist conception of perceptual experience. Mind, 121(483), 753–771. https://doi.org/10.1093/mind/fzs106

Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(3), 181–204. https://doi.org/10.1017/S0140525X12000477

Clark, A. (2015a). Predicting peace: The end of the representation wars – a reply to Michael Madary. Open MIND. https://doi.org/10.15502/9783958570979

Clark, A. (2015b). Radical predictive processing. The Southern Journal of Philosophy, 53(S1), 3–27. https://doi.org/10.1111/sjp.12120

Clark, A. (2022). Extending the predictive mind. Australasian Journal of Philosophy, 102(1), 119–130. https://doi.org/10.1080/00048402.2022.2122523

Clark, A., & Chalmers, D. (1998). The extended mind. Analysis, 58(1), 7–19. https://www.jstor.org/stable/3328150

Clark, A., & Toribio, J. (1994). Doing without representing? Synthese, 101(3), 401–431. https://doi.org/10.1007/BF01063896

Clarke, S. (2016). Investigating what felt shapes look like. I-Perception, 7(1), 2041669515627948. https://doi.org/10.1177/2041669515627948

Colombo, M., & Piccinini, G. (2023). The computational theory of mind (1st ed.). Cambridge University Press. https://doi.org/10.1017/9781009183734

Connolly, K. (2013). How to test Molyneux’s question empirically. I-Perception, 4(8), 508–510. https://doi.org/10.1068/i0623jc

Constant, A., Clark, A., Kirchhoff, M., & Friston, K. J. (2020). Extended active inference: Constructing predictive cognition beyond skulls. Mind & Language, 37(3), 373–394. https://doi.org/10.1111/mila.12330

Cummins, R. (1991). The role of representation in connectionist explanations of cognitive capacities. In W. Ramsey, S. Stich, & D. Rumelhart (Eds.), Philosophy and connectionist theory (pp. 91–114). Lawrence Erlbaum.

Cummins, R. (1995). Meaning and mental representation (3. print). MIT Press.

Das, S., Meyyappan, S., Ding, M., & Mangun, G. R. (2023). Top-down effects on cross-modal stimulus processing: A predictive coding framework. Journal of Vision, 23(9), 5801. https://doi.org/10.1167/jov.23.9.5801

De Jaegher, H., & Di Paolo, E. (2007). Participatory sense-making: An enactive approach to social cognition. Phenomenology and the Cognitive Sciences, 6(4), 485–507. https://doi.org/10.1007/s11097-007-9076-9

De Jaegher, H., Di Paolo, E., & Gallagher, S. (2010). Can social interaction constitute social cognition? Trends in Cognitive Sciences, 14(10), 441–447. https://doi.org/10.1016/j.tics.2010.06.009

Degenaar, M., & Collins, M. J. (1996). Molyneux’s problem: Three centuries of discussion on the perception of forms. Kluwer academic publ.

Dell’Erba, S., Brown, D. J., & Proulx, M. J. (2018). Synesthetic hallucinations induced by psychedelic drugs in a congenitally blind man. Consciousness and Cognition, 60, 127–132. https://doi.org/10.1016/j.concog.2018.02.008

Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B: Statistical Methodology, 39(1), 1–22. https://doi.org/10.1111/j.2517-6161.1977.tb01600.x

Dercksen, T. T., Stuckenberg, M. V., Schröger, E., Wetzel, N., & Widmann, A. (2021). Cross‐modal predictive processing depends on context rather than local contingencies. Psychophysiology, 58(6), e13811. https://doi.org/10.1111/psyp.13811

Deroy, O., & Auvray, M. (2012). Reading the world through the skin and ears: A new perspective on sensory substitution. Frontiers in Psychology, 3. https://doi.org/10.3389/fpsyg.2012.00457

Di Paolo, E. (2009). Extended life. Topoi, 28(1), 9–21. https://doi.org/10.1007/s11245-008-9042-3

Di Paolo, E., & Thompson, E. (2014). The enactive approach. In L. Shapiro (Ed.), The routledge handbook of embodied cognition (pp. 68–78). Routledge.

Dobelle, Wm. H. (2000). Artificial vision for the blind by connecting a television camera to the visual cortex. ASAIO Journal, 46(1), 3–9. https://doi.org/10.1097/00002480-200001000-00002

Dokic, J. (2007). Situated representations and ad hoc concepts. In M. J. Frápolli (Ed.), Saying, meaning and referring: Essays on françois recanati’s philosophy of language (pp. 203–220). Palgrave-Macmillan.

Downey, A. (2018). Predictive processing and the representation wars: A victory for the eliminativist (via fictionalism). Synthese, 195(12), 5115–5139. https://doi.org/10.1007/s11229-017-1442-8

Draganov, M., Galiano-Landeira, J., Doruk Camsari, D., Ramı́rez, J.-E., Robles, M., & Chanes, L. (2023). Noninvasive modulation of predictive coding in humans: Causal evidence for frequency-specific temporal dynamics. Cerebral Cortex, 33(13), 8421–8430. https://doi.org/10.1093/cercor/bhad127

Dretske, F. I. (1997). Explaining behavior: Reasons in a world of causes (5th print). MIT Press.

Egan, F. (2014). How to think about mental content. Philosophical Studies, 170(1), 115–135. https://doi.org/10.1007/s11098-013-0172-0

Engel, A. K., Friston, K. J., & Kragic, D. (Eds.). (2016). The pragmatic turn: Toward action-oriented views in cognitive science. The MIT Press. https://doi.org/10.7551/mitpress/9780262034326.001.0001

Evans, G. (1985). Collected papers. Clarendon Press ; Oxford University Press.

Facchin, M. (2021a). Are generative models structural representations? Minds and Machines, 31(2), 277–303. https://doi.org/10.1007/s11023-021-09559-6

Facchin, M. (2021b). Structural representations do not meet the job description challenge. Synthese, 199(3-4), 5479–5508. https://doi.org/10.1007/s11229-021-03032-8

Facchin, M. (2023). Extended predictive minds: Do Markov blankets matter? Review of Philosophy and Psychology, 14(3), 909–938. https://doi.org/10.1007/s13164-021-00607-9

Facchin, M. (2024). Maps, simulations, spaces and dynamics: On distinguishing types of structural representations. Erkenntnis. https://doi.org/10.1007/s10670-024-00831-6

Favela, L. H. (2023). The ecological brain: Unifying the sciences of brain, body, and environment (1st ed.). Routledge. https://doi.org/10.4324/9781003009955

Friston, K. (2005). A theory of cortical responses. Philosophical Transactions of the Royal Society B: Biological Sciences, 360(1456), 815–836. https://doi.org/10.1098/rstb.2005.1622

Friston, K. (2009). The free-energy principle: A rough guide to the brain? Trends in Cognitive Sciences, 13(7), 293–301. https://doi.org/10.1016/j.tics.2009.04.005

Friston, K. (2010). The free-energy principle: A unified brain theory? Nature Reviews Neuroscience, 11(2), 127–138. https://doi.org/10.1038/nrn2787

Friston, K. (2012). The history of the future of the Bayesian brain. NeuroImage, 62(2), 1230–1233. https://doi.org/10.1016/j.neuroimage.2011.10.004

Friston, K., Mattout, J., & Kilner, J. (2011). Action understanding and active inference. Biological Cybernetics, 104(1-2), 137–160. https://doi.org/10.1007/s00422-011-0424-z

Fusaroli, R., Gangopadhyay, N., & Tylén, K. (2014). The dialogically extended mind: Language as skillful intersubjective engagement. Cognitive Systems Research, 29-30, 31–39. https://doi.org/10.1016/j.cogsys.2013.06.002

Gallagher, S. (1996). First perception: A new solution to the molyneux problem. Proceedings of the New York State Philosophical Association.

Gallagher, S. (2005). How the body shapes the mind (1st ed.). Oxford University PressOxford. https://doi.org/10.1093/0199271941.001.0001

Gallagher, S. (2008). Direct perception in the intersubjective context. Consciousness and Cognition, 17(2), 535–543. https://doi.org/10.1016/j.concog.2008.03.003

Gallagher, S. (2017). Enactivist interventions (Vol. 1). Oxford University Press. https://doi.org/10.1093/oso/9780198794325.001.0001

Gandhi, T., Kalia, A., Ganesh, S., & Sinha, P. (2015). Immediate susceptibility to visual illusions after sight onset. Current Biology, 25(9), R358–R359. https://doi.org/10.1016/j.cub.2015.03.005

Gangopadhyay, N., & Kiverstein, J. (2009). Enactivism and the unity of perception and action. Topoi, 28(1), 63–73. https://doi.org/10.1007/s11245-008-9047-y

Gibson, J. J. (2015). The ecological approach to visual perception: Classic edition. Psychology Press.

Gładziejewski, P. (2016). Predictive coding and representationalism. Synthese, 193(2), 559–582. https://doi.org/10.1007/s11229-015-0762-9

Glenney, B. (2011). Adam Smith and the problem of the external world. Journal of Scottish Philosophy, 9(2), 205–223. https://doi.org/10.3366/jsp.2011.0016

Glenney, B. (2012). Leibniz on Molyneux’s question. History of Philosophy Quarterly, 29(3), 247–264.

Guerreiro, M. J. S., Putzar, L., & Röder, B. (2016a). The effect of early visual deprivation on the neural bases of auditory processing. The Journal of Neuroscience, 36(5), 1620–1630. https://doi.org/10.1523/JNEUROSCI.2559-15.2016

Guerreiro, M. J. S., Putzar, L., & Röder, B. (2016b). Persisting cross-modal changes in sight-recovery individuals modulate visual perception. Current Biology, 26(22), 3096–3100. https://doi.org/10.1016/j.cub.2016.08.069

Haugeland, J. (1991). Representational genera. In W. Ramsey, S. Stich, & D. Rumelhart (Eds.), Philosophy and connectionist theory (pp. 61–89). Lawrence Erlbaum.

Hebb, D. O. (1949). The organization of behavior: A neuropsychological theory. Wiley.

Held, R., Ostrovsky, Y., De Gelder, B., Gandhi, T., Ganesh, S., Mathur, U., & Sinha, P. (2011). The newly sighted fail to match seen with felt. Nature Neuroscience, 14(5), 551–553. https://doi.org/10.1038/nn.2795

Helmholz, H. von. (1867). Treatise on physiological optics. The Optical Society of America.

Heras-Escribano, M., & Martı́nez Moreno, D. (2024). The emergence of ur-intentionality: An ecological proposal. Philosophies, 9(3), 54. https://doi.org/10.3390/philosophies9030054

Hohwy, J. (2012). Attention and conscious perception in the hypothesis testing brain. Frontiers in Psychology, 3. https://doi.org/10.3389/fpsyg.2012.00096

Hohwy, J. (2013). The predictive mind. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199682737.001.0001

Hohwy, J. (2016). The self‐evidencing brain. Noûs, 50(2), 259–285. https://doi.org/10.1111/nous.12062

Hohwy, J. (2018). The predictive processing hypothesis. In The oxford handbook of 4E cognition (pp. 161–197). Oxford University Press.

Hubel, D. H., & Wiesel, T. N. (1962). Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. The Journal of Physiology, 160(1), 106–154. https://doi.org/10.1113/jphysiol.1962.sp006837

Hubel, D. H., & Wiesel, T. N. (1968). Receptive fields and functional architecture of monkey striate cortex. The Journal of Physiology, 195(1), 215–243. https://doi.org/10.1113/jphysiol.1968.sp008455

Huntenburg, J. M., Bazin, P.-L., & Margulies, D. S. (2018). Large-scale gradients in human cortical organization. Trends in Cognitive Sciences, 22(1), 21–31. https://doi.org/10.1016/j.tics.2017.11.002

Hutto, D. (2022). Getting real about pretense: A radical enactivist proposal. Phenomenology and the Cognitive Sciences, 21(5), 1157–1175. https://doi.org/10.1007/s11097-022-09826-6

Hutto, D. D., Gallagher, S., Ilundáin-Agurruza, J., & Hipólito, I. (2020). Culture in mind – an enactivist account: Not cognitive penetration but cultural permeation. In L. J. In L. J. Kirmayer, C. M. Worthman, S. Kitayama, R. Lemelson, & C. & Cummings (Eds.), Culture, mind, and brain: Emerging concepts, models, and applications (pp. 163–187). Cambridge University Press.

Hutto, D. D., & Myin, E. (2013). Radicalizing enactivism: Basic minds without content. MIT Press.

Jacomuzzi, A. C., Kobau, P., & Bruno, N. (2003). Molyneux’ question redux. Phenomenology and the Cognitive Sciences, 2(4), 255–280. https://doi.org/10.1023/B:PHEN.0000007370.68536.2d

Kandel, E. R. (Ed.). (2013). Principles of neural science (5th ed). McGraw-Hill.

Kant, I., & Hatfield, G. C. (2005). Prolegomena to any future metaphysics that will be able to come forward as science: With selections from the Critique of pure reason (Rev. ed., reprinted). Cambridge Univ. Press.

Kersten, L. (2022). A new mark of the cognitive? Predictive processing and extended cognition. Synthese, 200(4), 281. https://doi.org/10.1007/s11229-022-03674-2

Kersten, L., & Philosophy Documentation Center. (2024). Recruitment revisited in advance: Cognitive extension and the promise of predictive processing. Thought: A Journal of Philosophy. https://doi.org/10.5840/tht202492035

Khalsa, S. S., Adolphs, R., Cameron, O. G., Critchley, H. D., Davenport, P. W., Feinstein, J. S., Feusner, J. D., Garfinkel, S. N., Lane, R. D., Mehling, W. E., Meuret, A. E., Nemeroff, C. B., Oppenheimer, S., Petzschner, F. H., Pollatos, O., Rhudy, J. L., Schramm, L. P., Simmons, W. K., Stein, M. B., … Zucker, N. (2018). Interoception and mental health: A roadmap. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 3(6), 501–513. https://doi.org/10.1016/j.bpsc.2017.12.004

Kilner, J. M., Friston, K. J., & Frith, C. D. (2007). Predictive coding: An account of the mirror neuron system. Cognitive Processing, 8(3), 159–166. https://doi.org/10.1007/s10339-007-0170-2

Kirchhoff, M. D., & Kiverstein, J. (2021). How to determine the boundaries of the mind: A Markov blanket proposal. Synthese, 198(5), 4791–4810. https://doi.org/10.1007/s11229-019-02370-y

Kleckner, I. R., Zhang, J., Touroutoglou, A., Chanes, L., Xia, C., Simmons, W. K., Quigley, K. S., Dickerson, B. C., & Feldman Barrett, L. (2017). Evidence for a large-scale brain system supporting allostasis and interoception in humans. Nature Human Behaviour, 1(5), 0069. https://doi.org/10.1038/s41562-017-0069

Lee, J., & Calder, D. (2023). The many problems with S-representation (and how to solve them). Philosophy and the Mind Sciences, 4. https://doi.org/10.33735/phimisci.2023.9758

Lettvin, J., Maturana, H., McCulloch, W., & Pitts, W. (1959). What the frog’s eye tells the frog’s brain. Proceedings of the IRE, 47(11), 1940–1951. https://doi.org/10.1109/JRPROC.1959.287207

Lewis, P. M., Ackland, H. M., Lowery, A. J., & Rosenfeld, J. V. (2015). Restoration of vision in blind individuals using bionic devices: A review with a focus on cortical visual prostheses. Brain Research, 1595, 51–73. https://doi.org/10.1016/j.brainres.2014.11.020

Loaiza, J. R. (2020). Accessibility and phenomenality: Remarks on solving molyneux’s question empirically. Humanitas Hodie, 2(2), h223. https://doi.org/10.28970/hh.2019.2.a3

Maurer, D., Stager, C. L., & Mondloch, C. J. (1999). Cross‐modal transfer of shape is difficult to demonstrate in one‐month‐olds. Child Development, 70(5), 1047–1057. https://doi.org/10.1111/1467-8624.00077

Meltzoff, A. N., & Borton, R. W. (1979). Intermodal matching by human neonates. Nature, 282(5737), 403–404. https://doi.org/10.1038/282403a0

Menary, R. (2010). Introduction to the special issue on 4E cognition. Phenomenology and the Cognitive Sciences, 9(4), 459–463. https://doi.org/10.1007/s11097-010-9187-6

Miłkowski, M. (2017). Situatedness and embodiment of computational systems. Entropy, 19(4), 162. https://doi.org/10.3390/e19040162

Millidge, B., Seth, A., & Buckley, C. L. (2021). Predictive coding: A theoretical and experimental review. arXiv. https://doi.org/10.48550/ARXIV.2107.12979

Morgan, A. (2014). Representations gone mental. Synthese, 191(2), 213–244. https://doi.org/10.1007/s11229-013-0328-7

Najarpour Foroushani, A., Pack, C. C., & Sawan, M. (2018). Cortical visual prostheses: From microstimulation to functional percept. Journal of Neural Engineering, 15(2), 021005. https://doi.org/10.1088/1741-2552/aaa904

Nau, A., Murphy, M., & Chan, K. (2015). Use of sensory substitution devices as a model system for investigating cross-modal neuroplasticity in humans. Neural Regeneration Research, 10(11), 1717. https://doi.org/10.4103/1673-5374.169612

Nave, K. (2025). A drive to survive: The free energy principle and the meaning of life. The MIT Press.

Neal, R. M., & Hinton, G. E. (1998). A view of the EM algorithm that justifies incremental, sparse, and other variants. In M. I. Jordan (Ed.), Learning in Graphical Models (pp. 355–368). Springer Netherlands. https://doi.org/10.1007/978-94-011-5014-9_12

Newen, A., De Bruin, L., & Gallagher, S. (Eds.). (2018). The Oxford handbook of 4E cognition (1st ed.). Oxford University Press. https://doi.org/10.1093/oxfordhb/9780198735410.001.0001

Newen, A., & Vosgerau, G. (2020). Situated mental representations: Why we need mental representations and how we should understand them. In J. Smortchkova, K. Dołęga, & T. Schlicht (Eds.), What are Mental Representations? (1st ed., pp. 178–212). Oxford University Press New York. https://doi.org/10.1093/oso/9780190686673.003.0007

Nirshberg, G., & Shapiro, L. (2021). Structural and indicator representations: A difference in degree, not kind. Synthese, 198(8), 7647–7664. https://doi.org/10.1007/s11229-020-02537-y

Noë, A. (2009). Out of our heads: Why you are not your brain, and other lessons from the biology of consciousness (1. ed). Hill; Wang.

O’Regan, J. K., & Noë, A. (2001). A sensorimotor account of vision and visual consciousness. Behavioral and Brain Sciences, 24(5), 939–973. https://doi.org/10.1017/S0140525X01000115

Ohata, W., & Tani, J. (2020). Investigation of the sense of agency in social cognition, based on frameworks of predictive coding and active inference: A simulation study on multimodal imitative interaction. Frontiers in Neurorobotics, 14, 61. https://doi.org/10.3389/fnbot.2020.00061

Orlandi, N. (2016). Bayesian perception is ecological perception. Philosophical Topics, 44(2), 327–352. https://www.jstor.org/stable/26529415

Ostrovsky, Y., Meyers, E., Ganesh, S., Mathur, U., & Sinha, P. (2009). Visual parsing after recovery from blindness. Psychological Science, 20(12), 1484–1491. https://doi.org/10.1111/j.1467-9280.2009.02471.x

Owens, A. P., Friston, K. J., Low, D. A., Mathias, C. J., & Critchley, H. D. (2018). Investigating the relationship between cardiac interoception and autonomic cardiac control using a predictive coding framework. Autonomic Neuroscience, 210, 65–71. https://doi.org/10.1016/j.autneu.2018.01.001

Pant, R., Guerreiro, M. J. S., Ley, P., Bottari, D., Shareef, I., Kekunnaya, R., & Röder, B. (2021). The size-weight illusion is unimpaired in individuals with a history of congenital visual deprivation. Scientific Reports, 11(1), 6693. https://doi.org/10.1038/s41598-021-86227-w

Pearl, J. (2000). Causality: Models, reasoning, and inference (Second edition, reprinted with corrections). Cambridge University Press.

Peelen, M. V., He, C., Han, Z., Caramazza, A., & Bi, Y. (2014). Nonvisual and visual object shape representations in occipitotemporal cortex: Evidence from congenitally blind and sighted adults. The Journal of Neuroscience, 34(1), 163–170. https://doi.org/10.1523/JNEUROSCI.1114-13.2014

Pessoa, L. (2014). Understanding brain networks and brain organization. Physics of Life Reviews, 11(3), 400–435. https://doi.org/10.1016/j.plrev.2014.03.005

Petkova, V. I., Zetterberg, H., & Ehrsson, H. H. (2012). Rubber hands feel touch, but not in blind individuals. PLoS ONE, 7(4), e35912. https://doi.org/10.1371/journal.pone.0035912

Petzschner, F. H., Garfinkel, S. N., Paulus, M. P., Koch, C., & Khalsa, S. S. (2021). Computational models of interoception and body regulation. Trends in Neurosciences, 44(1), 63–76. https://doi.org/10.1016/j.tins.2020.09.012

Pezzulo, G., Rigoli, F., & Friston, K. (2015). Active inference, homeostatic regulation and adaptive behavioural control. Progress in Neurobiology, 134, 17–35. https://doi.org/10.1016/j.pneurobio.2015.09.001

Pezzulo, G., Zorzi, M., & Corbetta, M. (2021). The secret life of predictive brains: What’s spontaneous activity for? Trends in Cognitive Sciences, 25(9), 730–743. https://doi.org/10.1016/j.tics.2021.05.007

Piccinini, G. (2022). Situated neural representations: Solving the problems of content. Frontiers in Neurorobotics, 16, 846979. https://doi.org/10.3389/fnbot.2022.846979

Pierce, C. S. (1931). The collected papers of C. S. Pierce. Cambridge University Press.

Piller, S., Senna, I., & Ernst, M. O. (2023). Visual experience shapes the Bouba-Kiki effect and the size-weight illusion upon sight restoration from congenital blindness. Scientific Reports, 13(1), 11435. https://doi.org/10.1038/s41598-023-38486-y

Putzar, L., Goerendt, I., Lange, K., Rösler, F., & Röder, B. (2007). Early visual deprivation impairs multisensory interactions in humans. Nature Neuroscience, 10(10), 1243–1245. https://doi.org/10.1038/nn1978

Putzar, L., Hötting, K., & Röder, B. (2010). Early visual deprivation affects the development of face recognition and of audio-visual speech perception. Restorative Neurology and Neuroscience, 28(2), 251–257. https://doi.org/10.3233/RNN-2010-0526

Ramirez, K. A., Drew-Bear, L. E., Vega-Garces, M., Betancourt-Belandria, H., & Arevalo, J. F. (2023). An update on visual prosthesis. International Journal of Retina and Vitreous, 9(1), 73. https://doi.org/10.1186/s40942-023-00498-1

Ramos-Grille, I., Weyant, J., Wormwood, J. B., Robles, M., Vallès, V., Camprodon, J. A., & Chanes, L. (2022). Predictive processing in depression: Increased prediction error following negative valence contexts and influence of recent mood-congruent yet irrelevant experiences. Journal of Affective Disorders, 311, 8–16. https://doi.org/10.1016/j.jad.2022.05.030

Ramsey, W. (2016). Untangling two questions about mental representation. New Ideas in Psychology, 40, 3–12. https://doi.org/10.1016/j.newideapsych.2015.01.004

Ramsey, W. (2017). Must cognition be representational? Synthese, 194(11), 4197–4214. https://doi.org/10.1007/s11229-014-0644-6

Ramsey, W. M. (2007). Representation reconsidered (1st ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511597954

Reich, L., Maidenbaum, S., & Amedi, A. (2012). The brain as a flexible task machine: Implications for visual rehabilitation using noninvasive vs. Invasive approaches. Current Opinion in Neurology, 25(1), 86–95. https://doi.org/10.1097/WCO.0b013e32834ed723

Ricciardi, E., Bonino, D., Sani, L., Vecchi, T., Guazzelli, M., Haxby, J. V., Fadiga, L., & Pietrini, P. (2009). Do we really need vision? How blind people “see” the actions of others. The Journal of Neuroscience, 29(31), 9719–9724. https://doi.org/10.1523/JNEUROSCI.0274-09.2009

Ridderinkhof, K. R. (2014). Neurocognitive mechanisms of perception–action coordination: A review and theoretical integration. Neuroscience & Biobehavioral Reviews, 46, 3–29. https://doi.org/10.1016/j.neubiorev.2014.05.008

Ridderinkhof, K. R. (2017). Emotion in action: A predictive processing perspective and theoretical synthesis. Emotion Review, 9(4), 319–325. https://doi.org/10.1177/1754073916661765

Ridderinkhof, K. R., & Brass, M. (2015). How kinesthetic motor imagery works: A predictive-processing theory of visualization in sports and motor expertise. Journal of Physiology-Paris, 109(1-3), 53–63. https://doi.org/10.1016/j.jphysparis.2015.02.003

Rizzolatti, G., & Craighero, L. (2004). The mirror-neuron system. Annual Review of Neuroscience, 27(1), 169–192. https://doi.org/10.1146/annurev.neuro.27.070203.144230

Roth, M. A. (2010). Representation, philosophical issues about. WIREs Cognitive Science, 1(1), 32–39. https://doi.org/10.1002/wcs.31

Rowlands, M. (2010). The new science of the mind: From extended mind to embodied phenomenology. MIT Press.

Rumelhart, D. E., McClelland, J. L., & AU. (1986a). Parallel distributed processing, volume 1: Explorations in the microstructure of cognition: Foundations. The MIT Press. https://doi.org/10.7551/mitpress/5236.001.0001

Rumelhart, D. E., McClelland, J. L., & AU. (1986b). Parallel distributed processing, volume 2: Explorations in the microstructure of cognition: Psychological and biological models. MIT Press.

Rutar, D., Wiese, W., & Kwisthout, J. (2022). From representations in predictive processing to degrees of representational features. Minds and Machines, 32(3), 461–484. https://doi.org/10.1007/s11023-022-09599-6

Sadato, N., Pascual-Leone, A., Grafman, J., Ibañez, V., Deiber, M.-P., Dold, G., & Hallett, M. (1996). Activation of the primary visual cortex by Braille reading in blind subjects. Nature, 380(6574), 526–528. https://doi.org/10.1038/380526a0

Sánchez-Garcı́a, C., Alsius, A., Enns, J. T., & Soto-Faraco, S. (2011). Cross-modal prediction in speech perception. PLoS ONE, 6(10), e25198. https://doi.org/10.1371/journal.pone.0025198

Sassen, B. (2004). Kant on Molyneux’s problem. British Journal for the History of Philosophy, 12(3), 471–485. https://doi.org/10.1080/0960878042000253114

Schulkin, J., & Sterling, P. (2019). Allostasis: A brain-centered, predictive mode of physiological regulation. Trends in Neurosciences, 42(10), 740–752. https://doi.org/10.1016/j.tins.2019.07.010

Schwenkler, J. (2012). On the matching of seen and felt shape by newly sighted subjects. I-Perception, 3(3), 186–188. https://doi.org/10.1068/i0525ic

Schwenkler, J. (2013). Do things look the way they feel? Analysis, 73(1), 86–96. https://doi.org/10.1093/analys/ans137

Schwenkler, J. (2015). Long-term deprivation affects visual perception and cortex. Frontiers in Psychology.

Seth, A. K. (2013). Interoceptive inference, emotion, and the embodied self. Trends in Cognitive Sciences, 17(11), 565–573. https://doi.org/10.1016/j.tics.2013.09.007

Seth, A. K. (2014). A predictive processing theory of sensorimotor contingencies: Explaining the puzzle of perceptual presence and its absence in synesthesia. Cognitive Neuroscience, 5(2), 97–118. https://doi.org/10.1080/17588928.2013.877880

Seth, A. K., Suzuki, K., & Critchley, H. D. (2012). An interoceptive predictive coding model of conscious presence. Frontiers in Psychology, 2. https://doi.org/10.3389/fpsyg.2011.00395

Seth, A. K., & Tsakiris, M. (2018). Being a beast machine: The somatic basis of selfhood. Trends in Cognitive Sciences, 22(11), 969–981. https://doi.org/10.1016/j.tics.2018.08.008

Seth, A., & Friston, K. (2016). Active interoceptive inference and the emotional brain. Philosophical Transactions of the Royal Society B: Biological Sciences, 371(1708). https://doi.org/10.1098/rstb.2016.0007

Shagrir, O. (2012). Structural representations and the brain. The British Journal for the Philosophy of Science, 63(3), 519–545. https://doi.org/10.1093/bjps/axr038

Shapiro, L. A. (2011). Embodied cognition (1. publ). Routledge.

Sprevak, M. (2011). William M. Ramsey representation reconsidered. The British Journal for the Philosophy of Science, 62(3), 669–675. https://doi.org/10.1093/bjps/axr022

Stewart, J., Gapenne, O., & Di Paolo, E. A. (2014). Enaction: Toward a new paradigm for cognitive science. the MIT press.

Swanson, L. R. (2016). The predictive processing paradigm has roots in Kant. Frontiers in Systems Neuroscience, 10. https://doi.org/10.3389/fnsys.2016.00079

Swoyer, C. (1991). Structural representation and surrogative reasoning. Synthese, 87, 449–508.

Telakivi, P. (2023). Extending the extended mind: From cognition to consciousness (1st ed). Springer International Publishing AG.

Thompson, E. (2010). Mind in life: Biology, phenomenology, and the sciences of mind (First Harvard University Press paperback edition). The Belknap Press of Harvard University Press.

Tschantz, A., Seth, A. K., & Buckley, C. L. (2020). Learning action-oriented models through active inference. PLOS Computational Biology, 16(4), e1007805. https://doi.org/10.1371/journal.pcbi.1007805

Van Es, T. (2020). Minimizing prediction errors in predictive processing: From inconsistency to non-representationalism. Phenomenology and the Cognitive Sciences, 19(5), 997–1017. https://doi.org/10.1007/s11097-019-09649-y

Varela, F. J., Rosch, E., & Thompson, E. (1991). The embodied mind: Cognitive science and human experience. The MIT Press. https://doi.org/10.7551/mitpress/6730.001.0001

Varela, F. J., Thompson, E., & Rosch, E. (2016). The embodied mind: Cognitive science and human experience (revised edition). MIT Press.

Vilarroya, O. (2017). Neural representation. A survey-based analysis of the notion. Frontiers in Psychology, 8, 1458. https://doi.org/10.3389/fpsyg.2017.01458

Von Eckardt, B. (2012). The representational theory of mind. In K. Frankish & W. Ramsey (Eds.), The cambridge handbook of cognitive science (pp. 29–50). Cambridge University Press.

Wade, N. (2020). Molyneux’s vision. Routledge.

Ward, J. (2013). Synesthesia. Annual Review of Psychology, 64(1), 49–75. https://doi.org/10.1146/annurev-psych-113011-143840

Ward, J., & Meijer, P. (2010). Visual experiences in the blind induced by an auditory sensory substitution device. Consciousness and Cognition, 19(1), 492–500. https://doi.org/10.1016/j.concog.2009.10.006

Wiese, W. (2017). What are the contents of representations in predictive processing? Phenomenology and the Cognitive Sciences, 16(4), 715–736. https://doi.org/10.1007/s11097-016-9472-0

Wiesel, T. N., & Hubel, D. H. (1965). Comparison of the effects of unilateral and bilateral eye closure on cortical unit response in kittens. Journal of Neurophysiology, 28(6), 1029–1040. https://doi.org/10.1152/jn.1965.28.6.1029

Williams, D. (2018). Predictive processing and the representation wars. Minds and Machines, 28(1), 141–172. https://doi.org/10.1007/s11023-017-9441-6

To define the term ‘computation’ is out of the scope for this manuscript. See Colombo and Piccinini’s (2023) recent work for a general overview on the topic.↩︎