Visualising language similarities without trees

May 1, 2012 in Evolution, Linguistics, Reviews

Gerhard Jäger uses lexostatistics to demonstrate that language similarities can be computed without using tree-based representations (for why this might be important, see Kevin’s post on reconstructing linguistic phylogenies).  On the way, he automatically derives a tree of phoneme similarity directly from word lists.  The result is an alternative and intuitive look at how languages are related (see graphs below).  I review the method, then suggest one way it could get away from prior categorisations entirely.

Jäger presented work at the workshop on Visualization of Linguistic Patterns and Uncovering Language History from Multilingual Resources at the recent EACL conference last month.  He uses the Automated Similarity Judgment Program (ASJP) database, which contains 40 words from the Swadesh-list (universal concepts) for around 5800 languages (including Klingon!).  The words are transcribed in the same coarse transcription.  The task is to calculate the distance between languages based on these lists in a way that they reflect the genetic relationships between languages.

Read the rest of this entry →

Babies know who’s boss, whose boss, and who knows what else.

April 20, 2012 in Evolution, Evolution, Reviews, Science

forthcoming paper (grateful nod to ICCI) in PNAS from Oliver Mascaro and Gergely Csibra presents a series of experiments investigating the representation of social dominance relations in human infants, and it’s excellent news: we’re special.

Social dominance can be inferred in a couple of ways. Causal cues such as age, physical aggression and size can tell us about the dominance status of an individual quite intuitively, so we can make a sensible decision about whether or not we get into a scrap with them. Another way we can establish this is to look for direct realisations of dominance, such as who gets the banana if two hungry chimps both want it; chances are, little Pan Pipsqueak isn’t going to get a look in. In order to be useful, we also have to use this information to expect certain things from the individuals around us, so those representations have some property of stability across time that allows us to have those expectations. The question being explored in this paper is whether the representations we have are about the relationship between the two agents who want the banana, or the individual properties each of them has.

In a series of experiments using preferential looking time as a dependent measure, human infants (9 and 12 month olds) were exposed to videos of geometric figures exhibiting similar goal-directed behaviour. Then they would watch, say, a dominant triangle picking up the last figurative banana when the nondominant pentagon also wanted it. For expository purposes and posterity’s sake, I have constructed an artist’s impression of a dominant triangle and a subordinate pentagon in MSPaint (below, right):

A dominant triangle and subordinate pentagon (artist's impression).

I’m not just showing off my extraordinary artistic talent here; the good thing about these agents is that there are none of the cues like size or aggression that can give rise to the assignment of individual dominance properties. The task also doesn’t indicate anything similar; it’s just about who gets the desired object when there’s only one left. In other words, the goal-directed actions of two agents are in opposition. After seeing a triangle beat a pentagon to an object of ‘banana’ status, 12 month olds looked for longer when they were then presented with an incongruent trial where the pentagon gained over the triangle. 9 month olds (understandably?) couldn’t care less. So, on the basis of this social interaction alone, the 12 month olds were able to notice when something unexpected happened.

To rule out the possibility that this was just the result of some simple heuristic such as “when triangle and pentagon are present, triangle gets the object” and make sure the infants really were assigning some dominance, another experiment (with 12 and 15 month olds) showed the same test video of the two agents collecting little objects. This time, however, the preceding video was of the triangle dominating a little walled-in space that the pentagon also wanted to inhabit. The 12 month olds had no idea what was up, but the 15 month olds generalised from the first “get out of my room” interaction to the “I get the last banana” interaction. So, 15 month olds can extract, just from watching a social interaction, the dominance status of agents and can generalise that information to novel situations. So if a 15 month old watches you lose your favourite seat in front of the TV, they’ll also expect you to miss out on the last slice of pizza, because you’re a loser.

What we still don’t know is whether they think your belly is inherently yellow, or if you’re just a pushover when interacting with a particular person. Is it the relationship between the triangle and pentagon that the babies are tracking, or do they just give each agent some sort of dominance score? This was addressed in experiment 4, where infants were presented with two interactions: one between A and B, where A wins, and then another between B and C, where B wins. If the babies are assigning an individual value to each agent, they should have some sort of linear, transitive representation of dominance like A > B > C. If they’re then presented with a novel interaction between A and C, they would have the expectation that A will beat C. So if they stare in surprise at a trial where C wins, we know it’s violated that kind of expectation, and that they’re representing this stuff linearly – I.E. each agent has a dominance value. In contrast, if the infant is tracking the relations between agents, they can’t really have an expectation of what will happen when A and C both want a banana, because they’ve never seen C before. The results find that the infants look preferentially when they get an incongruent trial using agent pairs they have seen before – as we’d expect from the previous experiment. When they’re presented with a new “I get the last banana” interaction between A and C, however, there’s nothing startling about it when C wins – which means their expectations are not based on something like A > B > C.

The only tiny little harrumph I have about this result is that all it does is falsify the linear representation account. Though I think their account is absolutely right, it’d be nice to see something more predictive come out of the relation-representation hypothesis that is a little more falsifiable. But this result is pretty huge, and stands in contrast with what we know about social cognition in other animals like baboons (Cheney et al, 1995; Bergman et al, 2003), lemurs (Maclean et al., 2008) and even pigeons (Lazareva & Wasserman, 2012), who seem to employ this sort of hierarchical, transitive inference when presented with novel interactions. It may also muddy the waters a little when we want to make the appealing claim that, since language surely emerged in order to enable communication as we navigated a social environment, hierarchical social cognition gives rise to the processing of languagey things like hierarchical syntax or our semantic representation (Hamilton, 2005), which can be characterised as hierarchical (e.g. hyperonym > hyponym). If we consider the nature of the human social environment, though, it should seem more intuitive that something more reliable than simple transitive inference is necessary in order to successfully navigate through our interactions. Due to our prolific production of (and reliance on) culture, humans have a much more diverse range of social currencies, which correspond to values for things like money, intelligence, blackmail information, who your friends are, ad infinitum. That means it’s pretty reasonable that our social cognition needs new strategies in order to get by; we have a little more to consider than just who’s big and angry enough to get all the bananas.

References

Bergman, T., Beehner, J., Cheney, D. & Seyfarth, R. (2003) “Hierarchical Classification by Rank and Kinship in Baboons” Science 14(302), 1234-1236.

Cheney, D., Seyfarth, R. & Silk, J. (1995) “The response of female baboons (Papio cynocephalus ursinus) to anomalous social interactions: evidence for causal reasoning?” Journal of Comparative Psychology 109(2), 134-141.

Hamilton, D.L. (2005) Social Cognition: Key Readings (p. 104) Psychology Press

Lazareva, O. & Wasserman, E. (2012) “Transitive inference in pigeons: measuring the associative values of stimulus B and D” Behavioural Process 89(3), 244-255.

Maclean, E., Merritt, D. & Brannon, E.M. (2008) “Social complexity predicts transitive reasoning in prosimian primates” Animal Behaviour 76(2), 479-486.

Mascaro, O. & Csibra, G. (forthcoming) “Representation of stable dominance relations by human infants” Proceedings of the National Academy of Sciences

 

Language Evolution and Levels of Explanation

April 16, 2012 in Evolution, Evolution, Linguistics, Science, Science News

A somewhat contentious debate among the behavioural sciences is currently underway concerning Mayr’s division of causal explanations in evolutionary theory. Here I’m going to give you a brief rundown of two papers in particular, before I chip in my two-cents about how other insights from the theoretical literature can inform this debate. It seems the discussion is just getting started with respect to cultural evolution, so it’d be interesting to hear other peoples’ comments from either camp.

Over the years, evolutionary theorists have tried to make logical divisions between the kinds of things we can ask about, with a view to making it clear what exactly scientific studies can tell us. A dominant paradigm dividing two levels of causation for biological features we see in the world is Mayr’s distinction between ultimate and proximate causes.  Ultimate causation explains the proliferation of a trait in a population in terms of the evolutionary forces acting on that trait. For example, peahens that prefer peacocks with larger tails (an honest signal of fitness following the handicap principle) will have stronger or more successful offspring, and so this preference proliferates along with larger peacock tails. Proximate causation uses immediate physiological and environmental factors to explain a particular peahen’s penchant for a large-tailed peacock in a mate choice trial, where the signal of the peacock’s large tail elevates the hormone levels in the peahen and copulatory behaviour ensues. Although the behaviour in both of these examples is the same, the levels of explanation are based on different sets of factors.

In Perspectives on Psychological Science last year, a paper by Scott-Phillips, Dickins and West voiced some concerns about these two levels of causation being conflated in the behavioural sciences. In particular, they addressed instances where proximate explanations of traits are being framed as ultimate ones. The paper points specifically to studies of the evolution of cooperation, transmitted culture and epigenetics to illustrate this. Regarding the evolution of cooperation, they point to an instance where ‘strong reciprocity’ (an individual’s propensity to reward cooperative norms and sanction violation of these norms) is purported to be an ultimate explanation of why humans cooperate, rather than a proximate mechanism that enables such cooperation.

Table of proximate and ultimate explanations. For the topic of 'linguistic structure', cultural transmission is listed as a proximate explanation, and the adaptive benefit of cooperative activity through communication is given as an ultimate explanation.,

Table 1 from Scott-Phillips et al. (2011), highlight added.

Among the examples was the feature of linguistic structure (see table 1 from paper above), where several studies pointed to the cultural transmission process as an ultimate explanation of linguistic structure. They suggest that cultural transmission constitutes a proximate process, because it gives the means by which linguistic structure is expressed – and this is how cultural transmission contributes to what the linguistic structure looks like. One analogy might be that the vibrating of my particular vocal cords is a proximate mechanism giving rise, in part, to how my voice sounds, rather than an ultimate explanation of why I vocalise. Since an ultimate account must suggest how a trait contributes to inclusive fitness in order to explain its prevalence in humans, they uncontroversially venture that the ultimate rationale for the ubiquity of linguistic structure is that it greater enables communication (and therefore increases inclusive fitness by enabling cooperative activity).

An opposing view was later published in Science by Laland, Sterelny, Odling-Smee et al., who suggest that the use of Mayr’s division of ultimate and proximate causation is not helpful to all evolutionary investigations, and even hampers progress. The grounds for rejecting Mayr’s paradigm seem to lie largely in what Laland et al. term “reciprocal causation”. That is, that “proximate mechanisms both shape and respond to selection, allowing developmental processes to feature in proximate and ultimate explanations”. After aligning proximate explanations with ontogeny and ultimate explanations with phylogeny, they suggest that what we may have called ultimate and proximate features are no longer sharply delineated, and that these reciprocal processes mean that the source of selection sometimes cannot be separated. They present an idea from the field of evolutionary-developmental biology that, if a developmental process makes some variant of a trait more likely to arise than others, then this proximate mechanism helps to construct an “evolutionary pathway”.

Flow charts presenting alternative processes for Biological evolution, Cultural evolution and Gene-culture coevolution

Figure 2 from Laland et al. (2011)

The paper also highlights developmental plasticity, and gene-environment interaction more broadly (see fig. 2 from paper, above), as a process where reciprocal causation offers an evolutionary explanation conceptually comparable to ultimate causation. Talking specifically on the topic of linguistic structure, they present the debate about whether specific design features of language are attributable to biological or cultural evolution. The paper points out that cultural evolution determines features of linguistic structure – for example, word order – and that the existing word order determines that of future speakers. Indeed, at the Edinburgh LEC we know that transmission by iterated inductive inference under general conditions can explain particular structures in languages. That cultural evolution determines the variation between languages, Laland et al. say, provides evidence that it is an evolutionary force comparable to natural selection (and, therefore, ultimate explanation).

What follows is a collection of my thoughts on the matter, which are (spoiler alert) largely in support of the Scott-Phillips et al. paper. I hope others more experienced in cultural evolution studies than I will contribute their perspective.

It seems to me that there are a few assumptions made in the Laland et al. paper that are not quite in line with how Mayr himself understood the paradigm, and perhaps much can be learned from this debate’s previous incarnation when Richard C. Francis made similar arguments against the ultimate/proximate distinction in 1990. In his critique, he equated ultimate causation with phylogeny and proximate causation with ontogeny – an approach that was rebuked by Mayr in 1993, who made the point that “all physiological activities are proximately caused, but is a reflex an ontogenetic phenomenon?” Mayr’s response is actually rather unhelpful in addressing the arguments fully, and this statement is particularly dense. But what he is getting at here is the idea that interaction with the environment that gives rise to adaptive behaviours (such as recoiling instantly from a hot stove) is itself subject to selection, and thus constitutes a proximate explanation of causation. Relatedly, he points out that most components of the phenotype are indeed the result of genetic contribution and interaction with the environment, which has been successfully explored in biology within the traditional theoretical paradigm.

A perhaps more nuanced account of how we can divide the possible explanations of biological phenomena is offered by Tinbergen in his “four questions”, where ultimate explanations are further subdivided into Function (concerning the adaptive solution to a survival problem favoured by natural selection) and Phylogeny, which is a historical account of when the trait arose in the species, and importantly includes processes other than natural selection that give rise to variation – such as mutation, drift and the constraints imposed by pre-existing traits (see blind spot example below). Proximate explanations are further split into Mechanism (immediate physiological/environmental factors causal in how the trait operates in the individual) and Ontogeny (the way in which this trait develops over the lifetime of the individual). As a simple example, here is the paradigm applied to a trait like mammalian vision that I lifted from Wikipedia:
Ultimate
Function: To find food and avoid danger.
Phylogeny: The vertebrate eye initially developed with a blind spot, but the lack of adaptive intermediate forms prevented the loss of the blind spot.
Proximate
Causation: The lens of the eye focuses light on the retina
Ontogeny: Neurons need the stimulation of light to wire the eye to the brain within a critical period (as those awful studies of blindfolded kittens illustrated).
A schematic below, adapted from Tinbergen (1963) shows how these levels of causation may interact with one another, which appears to communicate something roughly comparable to the importance Laland et al. place on “reciprocal causation” in the formation of adaptive variants:

Flow chart showing the interaction of functional, phylogenetic, mechanistic and ontogenetic explanations

Adapted from Tinbergen (1963); Causal Relationships

Applied the to debate outlined above, it would seem that there is no apparent reason that a process of gene-environment interaction – including the cultural environment – can’t itself be subject to selection, or that developmental plasticity itself is not an adaptation in need of an ultimate explanation. It has long been the case that behaviour is no longer understood as either “nature” or “nurture”, but gene-environment interaction, with varying levels of heredity. The “reciprocal causation” suggested in Laland et al.’s paper, is (as they point out) very common in nature; feedback loops are uncontroversial proximate processes in biology. That a proximate process may give rise to a dominant variant of a trait in a population does not explain why it is adaptive, and this points to another problem with the proposing the abandonment of Mayr’s paradigm: a logical division of levels of explanation doesn’t seem to be the sort of thing that can be rendered outdated by empirical evidence. Indeed, claims about the particulars of traits and processes (and languages) themselves are a matter for empirical data – but the theoretical issue about the level of explanation that data is useful for does not itself seem to be subject to empirical findings.

The finding that a proximate process such as cultural transmission gives rise to a trait that is prolific in a population is itself exciting and surprising, and even shows us that the pressure for making language easier to learn gives us adaptive languages to learn; however, it could be argued that it is this process that is adaptive, and that the reason why humans so heavily rely on this process is an ultimate explanation.

One way of resolving these two perspectives may be to place cultural processes that give rise to variation at the level of what Tinbergen labels Phylogenetic (one subset of ultimate) explanation, as it concerns processes which produce some heightened frequency of traits over a language’s history. An explanation at the level of Phylogeny still must make recourse to natural selection at some point, since variants that result from mutation or drift are retained because of their adaptive value (or an adaptive trade-off). This approach may be a problem for the current understanding, which holds that the features resulting from cultural processes are themselves adaptive and therefore comparable to what Tinbergen labels Function.

The problem with this is that calling particular structures of language ‘adaptive’ obscures what it is about Language that is actually being selected for. To flesh out what I mean, I think it’s useful to consult Millikan’s (1993) distinction between Direct Proper Function and Derived Proper Function (… bear with me, it’ll be worth it, honest). The Direct Proper Function of a given trait T can be thought of as a “reproduction” of an item that has performed the exact same adaptive function F, and T exists because of these historical performances of F. Sperber and Origgi (2000) use the illustrative example of the heart, where the human heart has a bunch of properties (it pumps blood, makes a thumping noise, etc), but only its ability to pump blood is its Direct Proper Function. This is because even a heart that doesn’t work right or makes irregular thumping noises or whatever, still has the ability to pump blood. Hearts that pump blood have been “reproduced through organisms that, thanks in part to their owning a heart pumping blood, have had descendents similarly endowed with blood-pumping hearts”.

The Derived Proper Function, however, refers to a trait T that is the result of some device that, in some environment, has a Proper Function F. In that given environment, F is usually achieved by the production of something like T. If I unpack this idea and apply it to language, we can understand it as the acquisiton and production of a device that, in this environment, leads to, say, a particular SVO language, T. The Proper Function of adaptive communication is performed by T in this case, but could also be performed by any number of SOV, VSO, etc Ts in other cases. In other words, the Proper Function of this language is not the word order itself, but communication. The word order is the realisation of this device that is reproduced because of the performance of T in a particular environment, but does not necessarily lead to T in the next incarnation of that device (i.e. My child, if born and raised in Japan, will speak Japanese). We see, then, that a proximate process resulting in what a particular language spoken by a given population looks like does not necessarily speak to the evolutionary function. In other words, it is the device that allows the performance of Language that is adaptive, not the individual language itself.

One question being asked in the study of cultural transmission is why a particular language looks like it does, while we also know that there are 6000 different versions that perform the same (ultimate) function. I would even argue that asking how proximate processes shape languages is actually the most exciting and interesting avenue of inquiry precisely because it’s so blindingly obvious what the adaptive function of language is. But perhaps the value in this endeavour is somewhat neglected, in part, because of the same impression that Francis (1990) had: “the attitude, implicit in the term ultimate cause, [is] that these functional analyses are somehow superordinate to those involving proximate causes” which would be a shame. It seems to me that the coarse grain of ultimate vs proximate perhaps doesn’t do enough to help complex proximate study to position itself in the wider theoretical framework, and the best way to proceed from this might be to couch explanation in terms of Function, Phylogeny, Ontogeny and Mechanism. I think more fine-grained terminology grants us more explanatory power, in this case.

A final question in this debate that came up too many times during discussions with the LEC is: what does keeping the traditional paradigm “buy us”? Well, the first answer to this is consilience with one of the most successful and robust theories in science. The same sentiment has been communicated by Pinker and Bloom (1990), who said: “If current theory of language is truly incompatible with the neo-Darwinian theory of evolution, one could hardly blame someone for concluding that it is not the theory of evolution that must be questioned, but the theory of language”. Part of the reason this debate may have arisen is that studies of cultural evolution have used evolutionary theory as an incredibly fruitful way of analysing cultural processes, but additional acknowledgement about how cultural adaptation is different to biological adaptation may be necessary. This difference is an aspect of Laland’s paper (shown in Fig 2) that I think is important, as it’s part of the reason that more nuanced frameworks for cultural evolution are now needed. Without this widespread acknowledgement, cultural evolution may be considered an extension of biological evolutionary theory instead of a successfully applied metaphor. It seems to me that the side of this debate one falls on is well predicted by whether one subscribes to the former interpretation of cultural evolution or the latter.

Knowing which level of explanation current work pertains to is a valuable part of evolutionary exploration, and abandoning this in favour of an approach where proximate processes are explanatory ends to themselves may mean the exploration of Function and Phylogeny may suffer. That said, it is telling, I think, that even in seeking to abandon the proximate/ultimate distinction, we must still exploit this existing terminology in order to explain such a position. That natural selection has explained countless adaptations in all living things is certainly not trivial, and to reject the theory giving rise to ultimate explanations as they’re currently defined is to reject this fundamental aspect of evolutionary theory. The big problem seems to be that we’re coming to understand proximate processes as so elaborate and complex, that a more nuanced framework is needed to deal with the dynamics of those processes. I reckon, however, that such a framework can be developed within the traditional paradigms of evolutionary theory.

 

References

Francis, R.C. (1990) – “Causes, Proximate and Ultimate” Biology and Philosophy 5(4) 401-415.

Laland, K., Sterelny, K., Odling-Smee, J., Hoppitt, W. & Uller, T. (2011) – “Cause and Effect in Biology Revisited: Is Mayr’s Proximate-Ultimate Distinction Still Useful?” Science 334, 1512-1516.

Mayr, E. (1993) – “Proximate and Ultimate Causations” Biology and Philosophy 8: 93-94.

Millikan, R. (1993) – White Queen Psychology and Other Essays for Alice, Cambridge, Mass: MIT Press.

Pinker, S. & Bloom, P. (1990) – “Natural language and natural selection” Behaviour and Brain Sciences 13, 707-784.

Scott-Phillips, T. Dickins, T. & West, S. (2011) – “Evolutionary Theory and the Ultimate-Proximate Distinction in the Human Behavioural Sciences” Perspectives on Psychological Science 6(1): 38-47.

Sperber, D. & Origgi, G. (2000) – “Evolution, communication and the proper function of language” In P. Carruthers and A. Chamberlain (Eds.) Evolution and the Human Mind: Language, Modularity and Social Cognition (pp.140-169) Cambridge: Cambridge University Press.

Tinbergen, N. (1963) “On Aims and Methods in Ethology,” Zeitschrift für Tierpsychologie, 20: 410–433.

 

 

Wild Replicator’s Got Funky Rhythm, Part 2

March 27, 2012 in Evolution

As its name indicates, this post builds on Wild Replicator’s Got Funky Rhythm, Part 1. I want to call your attention, in particular, to the next to the last section, Becoming Memetic. There I trace, albeit sketchily, the history of Rhythm Changes. The point is that Rhymthm Changes didn’t exist as a memetic entity in 1930, when George Gershwin wrote “I Got Rhythm.” Just when the chord changes had become differentiated from the song itself is not clear. But it had certainly happened, at least in the jazz world, by the mid 1940s. Thus, it is not as though certain patterns are essentially memetic while others are not. It’s a question of how the patterns function in the cultural system.

* * * * *

In the previous post I took a look at Rhythm Changes, a memetic entity that has played an important role in jazz and, in particular, in bebop. FWIW, Rhythm Changes has also been used in the theme song for well-known some well-known cartoons, Woody Woodpecker and The Flintstones. In this post I want to do several things:

  • consider all the elements of “I Got Rhythm,” rather than just the chord changes,
  • think briefly about how pools of memetic elements function in defining musical styles, and
  • look briefly at how the chord changes to Gershwin’s tune became memetically active.

Taken together those discussions flesh out the role of memetic elements in music systems in the large. I conclude by

  • examining this discussion of memes in music in the context of a recent article by Evelyn Fox Keller and David Harel, Beyond the Gene, and not some broad thematic similarities between their discussion and mine.

I Got Rhythm, Whole

As I’ve indicated, Rhythm Changes is derived from, abstracted from, George Gershwin’s “I Got Rhythm.” Now let’s think about the whole tune, not just its harmonic trajectory, i.e. Rhythm Changes. In addition to that trajectory we also have a specific melody, the lyrics, the rhythmic framework, and the arrangement. The lyrics are optional; the tune can be performed without them, and among jazz musicians that is the typical, if not universal, performance practice. Note, however, that any consideration of the lyrics brings a whole other memetic field into consideration, that of language. Read the rest of this entry →

Nothing in Language Makes Sense…

March 20, 2012 in Evolution, Linguistics, Science

… Except in the Light of Biological and Cultural Evolution

Sean mentioned in one of his many Evolang posts that, based on de Boer’s talk, the real audience for researchers of cultural evolution should be biologists. Well, deciding that actions plus words can work far better together, I decided to get in contact with Jeremy Yoder of the excellent group blog, Nothing in Biology Makes Sense. The result: an introductory post on the biological and cultural evolution of language called Crossing Those Curious Parallels (after Darwin’s famous passage describing the similarities between linguistic and biological change). Most regular readers will be familiar with the content and argument as the article is a pastiche of earlier pieces I wrote on this blog, but there is a sprinkling of some original paragraphs here and there. So feel free to go over, leave a comment and help foster some cross-disciplinary discussions. Actually, on cross-disciplinary note: since physicists seem so keen to solve problems in linguistics, maybe we should lend them a hand and run a corpus analysis to discover that elusive mass of the Higgs boson.

 

 

In Search of the Wild Replicator

March 15, 2012 in Evolution, Linguistics


The key to the treasure is the treasure.
– John Barth

In view of Sean’s post about Andrew Smith’s take on linguistic replicators I’ve decided to repost this rather longish note from New Savanna. I’d orignally posted it in the Summer of 2010 as part of a run-up to a post on cultural evolution for the National Humanities Center (USA); I’ve collected those notes into a downloadable PDF. Among other things the notes deal with William Croft’s notions (at least as they existed in 2000) and suggests that we’ll find language replicators on the emic side of the emic/etic distinction.

I’ve also appended some remarks I made to John Lawler in the subsequent discussion at New Savanna.

* * * * *
There’s been a fair amount of work done on language from an evolutionary point of view, which is not surprising, as historical linguistics has well-developed treatments of language lineages and taxonomy, the “stuff” of large-scale evolutionary investigation. While this work is directly relevant to a consideration of cultural evolution, however, I will not be reviewing or discussing it. For it doesn’t deal with the theoretical issues which most concern me in these posts, namely, a conceptualization of the genetic and phenotypic entities of culture. This literature is empirically oriented in a way that doesn’t depend on such matters.

 

The Arbitrariness of the Sign

In particular, I want to deal with the arbitrariness of the sign. Given my approach to memes, that arbitrariness would appear to eliminate the possibility of word meanings could have memetic status. For, as you may recall, I’ve defined memes to be perceptual properties – albeit sometimes very complex and abstract ones – of physical things and events. Memes can be defined over speech sounds, language gestures, or printed words, but not over the meanings of words. Note that by “meaning” I mean the mental or neural event that is the meaning of the word, what Saussure called the signified. I don’t mean the referent of the word, which, in many cases, but by no means all, would have perceptible physical properties. I mean the meaning, the mental event. In this conception, it would seem that that cannot be memetic.

That seems right to me. Language is different from music and drawing and painting and sculpture and dance, it plays a different role in human society and culture. On that basis one would expect it to come out fundamentally different on a memetic analysis.

This, of course, leaves us with a problem. If word meaning is not memetic, then how is it that we can use language to communicate, and very effectively over a wide range of cases? Not only language, of course, but everything that depends on language. Read the rest of this entry →

Evolang coverage: Animal Communication and the Evolution of Language

March 13, 2012 in Evolution, Science

Are there more differences or more similarities between human language and other animal communication systems? And what exactly does it tell us if we find precursors and convergent evolution of aspects similar to human language? These were some of the key questions at this year’s Evolang’s Animal Communication and Language Evolution Workshop (proceedings for all workshops here).

As Johan Bolhuis pointed out, ever since Darwin (1871), comparing apes and humans’ seemed like the most logical thing to do when trying to find out more about the evolution of traits presumed to be special to humans. Apes and especially chimpanzees, so the reasoning goes, are after all our closest relatives and serve as the best models for the capacities of our prelinguistic hominid ancestors. The comparative aspects of language have gained new attention since the controversial Hauser, Chomsky, Fitch (2002) paper in Science. For example, their claim that the capacity for producing and understanding recursive embedding of a certain kind is uniquely human was taken up by some researchers (including Hauser and Fitch themselves) who looked for syntactic abilities in other animals. More recently, songbirds have also become a centre of attention in the animal communication literature, with pretty much everything being quite controversial, however.

What is important here, according to the second workshop organizer Kazuo Okanoya, is that when doing research and theorizing, we should not treat humans as a special case, but as on a continuum with animals. And this also holds for language. In explaining language evolution, we don’t want to speak of a sudden burst that gave us something that is wholly different from anything else in the animal kingdom, but more of a continuous transition and emergence of language. For this it is, important to study other animals in closer details if we are to arrive at a continuous explanation of language emergence. Granted, humans are special. But simply saying they are special isn’t scientific. We need to detail in what ways humans are special.

Regarding the central question whether there are more differences or similarities between language and animal communication, and what exactly these similarities and differences are, opinions of course differ. After the first speaker didn’t turn up Irene Pepperberg gave an impromptu talk on her work with parrots. Taking the example of a complex exclusion task, she argued that symbol-trained animals can do things other animals simply cannot, and that this might be tied to the complex cognitive processing that occurs during language (and vocal) learning. She also stressed that birds can serve as good models for the evolution of some aspects underlying language because they developed broadly similar vocal learning capacities like humans in a process referred to as parallel evolution, convergence, or analogy. Responding to other prevalent criticism, Pepperberg counters the view that animals like Alex and Kanzi are simply exceptional and unique, just like not every human is a Picasso or a Beethoven. What Picasso and Beethoven show us is what humans can be capable of, and the same holds for animals and Alex and Kanzi. No one would argue that animals have language in the sense that humans do. But given that they have the brain structures and cognitive capacities to allow a more complicated vocal learning and complicated cognitive processing means we can use them as a model of how these processes might have got started. There is still much work to be done, especially questions like what animals like parrots actually need and use these complex vocal and cognitive capacities for in the wild.

Whereas Dominic Mitchell argued in his talk that there is indeed a discontinuity between animal communication and human language with reference to animal signaling theory (e.g. Krebs & Dawkins 1984), Ramon Ferrer-i-Cancho after him focused more on the similarities. Specifically, he showed quite convincingly that statistical patterns in language, like Zipf’s law, the law of brevity, the law that more frequent words are shorter, and the Menzerath-Altmann law (the longer the words the shorter the syllables) can also be found in the communicative behaviours of other animals. Zipf’s law for word frequencies, for example, can also be observed in the whistles of bottlenose dolphins. A criticism of Zipf’s law in the Chomskyan tradition holds that it just as well applies to random typing and rolling the dice, but Ferrer-i-Cancho showed that it is simply not the case by plotting the actual distribution of random typing and rolling the dice which is actually quite different from the logarithmic distribution of Zipf’s law if you look at it in any detail. The law that more frequent words are shorter can also be found in Chickadee calls, Formosan macaques and Common marmosets. There is some controversy whether this law really holds for all of these species, especially common marmosets, but Ferrer-i-Cancho presented a reanalysis of criticism in which he showed that what there are no “true exceptions” to the law. He proposes an information theoretic explanation for these kinds of behavioural universals where communicative solutions converge on a local optimum of differing communicative demands. He also proposes that considerations like this should lead us to change our perspective and concepts of universals quite radically, and that instead of looking only for linguistic universals we should also look for universals of communicative behavior and universal principles beyond human language such as cognitive effort minimization and mean code length minimization.

Returning to birds, Johan J. Bolhius picked the issue of similarities and differences up again and showed that there is in fact a staggering amount of similarities between birds and humans. For example, songbirds also learn their songs from a tutor (most often their father) and make almost perfect copies of their songs. As Hauser, Chomsky, Fitch 2002 have already pointed out, this signal copying seems not to be present in apes and monkeys. But the similarities go even further than that: Songbirds “babble” before they can sing properly (a period called ‘subsong’) and they also have a sensitive period for learning. And there are not only behavioural, but also neural similarities. In fact, songbirds seem to have a neural organization, broadly similar to the human separation between Broca’s area (mostly concerned with production, although this simple view of course is not the whole story, as James, for example, has shown) and Wernicke’s area (mostly concerned with understanding). So there seem to be regions that are exclusively activated when animals hears songs (kinda Wernicke-Type region) and regions with neuronal activation when animals sing, something which is called the ‘song system. Interestingly, this activation is also related to how much the animal has learned about that particular song it is hearing, so the better it knows the song the more activation is there. This means that this regions might be related to song memory. In lesion studies, where these regions involved in listening to a known song were damaged, recognition of the songs were indeed impaired but not wholly wiped out. Song production, on the other hand was completely unimpaired, mirroring the results from patients with lesions to either Broca’s or Wernicke’s areas. Zebra finches also show some degree of lateralization in that there is stronger activation in the left hemisphere when they hear the song they know, but not when the song they hear is unfamiliar. Although FOXP2 is not a “language gene”, which can’t be stressed enough, it is interesting that songbirds in which the bird-FOXP2-gene was “knocked out” show incomplete learning of the tutor songs.

Overall, Bolhuis concludes that what we can learn from looking at birdsong is that there are three significant factors evolved in the evolution of language:

Homology in the neural and genetic mechanisms due to our shared evolutionary past with birds.

Convergences or parallel evolution of auditory-vocal learning

And last specialisations, specifically human language syntax, which as Bolhuis argued in a paper with Bob Berwick and Kazuo Okanoya is still vastly different in complexity and hierarchical embedding from everything in songbird vocal behavior.

This focus on syntactic ability stems of course from a generativist perspective on these issues, and future research, especially from new and up-and-coming linguistic schools like Cognitive Linguistics and Construction Grammar (cf. Hurford 2012) is sure to bring more light into the matter of how exactly human language works, what kinds of elements and constructions it is made of, and how these compare to what is found in animals, and whether there really a single unitary thing like the fabled “syntactic ability” of humans (cf. e.g.work by Ewa Dabrowska)

 

Evolang Previews: Cognitive Construal, Mental Spaces, and the Evolution of Language and Cognition

March 10, 2012 in Evolution, Linguistics, Science

Evolang is busy this year – 4 parallel sessions and over 50 posters. We’ll be posting a series of previews to help you decide what to go and see. If you’d like to post a preview of your work, get in touch and we’ll give you a guest slot.

Michael Pleyer Cognitive Construal, Mental Spaces, and the Evolution of Language and Cognition Poster Session 1, 17:20-19:20, “Hall” (2F), 14th March

Perspective-taking and -setting in language, cognition and interaction is crucial to the creation of meaning and to how people share knowledge and experiences. As I’ve already written about on this blog (e.g. herehere, here), it probably also played an important part in the story of how human language and cognition came to be. In my poster presentation I argue that a particular school of linguistic thought, Cognitive Linguistics (e.g. Croft & Cruse 2004; Evans & Green 2006; Geeraerts & Cuyckens 2007; Ungerer & Schmid 2006), has quite a lot to say about the structure and cognitive foundations of perspective-taking and -setting in language.

Therefore an interdisciplinary dialogue between Cognitive Linguistics and research on the evolution of language might prove highly profitable. To illustrate this point, I offer an example of one potential candidate for such an interdisciplinary dialogue, so-called Blending Theory (e.g. Fauconnier & Turner 2002), which, I argue,  can serve as a useful model for the kind of representational apparatus that needed to evolve in the human lineage to support linguistic interaction. In this post I will not say much about Blending Theory (go see my poster for that ;-) or browse here ), but I want to  elaborate a bit on Cognitive Linguistics and why it is a promising school of thought for language evolution research, something which I also elaborate on in my proceedings paper.

So what is Cognitive Linguistics?

Evans & Green (2006: 50), define Cognitive Linguistics as

“the study of language in a way that is compatible with what is known about the human mind, treating language as reflecting and revealing the mind.”

Cognitive Linguistics sees language as tightly integrated with human cognition. What is more, a core assumption of Cognitive Linguistics is that principles inherent in language can be seen as instantiations of more general principles of human cognition. This means that language is seen as drawing on mechanisms and principles that are not language-specific but general to cognition, like conceptualisation, categorization, entrenchment, routinization, and so forth.

From the point of view of the speaker, the most important function of language is that it expresses conceptualizations, i.e. mental representations. From the point of view of the hearer, linguistic utterances then serve as prompts for the dynamic construction of a mental representation. Crucially, this process of constructing a mental representation is fundamentally tied to human cognition and our knowledge of the world around us. Read the rest of this entry →

Everett, Pirahã and Recursion: The Latest

February 1, 2012 in Evolution, Genetics, Linguistics

Discussing the concept of recursion is like a rite of passage for anyone interested in language evolution: you go through it once, take a position and hope it doesn’t come back to haunt you.  As Hannah pointed out last year, there are two definitions of recursion:

(1) embeddedness of phrases within other phrases, which entails keeping track of long-distance dependencies among phrases;

(2) the specification of the computed output string itself, including meta-recursion, where recursion is both the recipe for an utterance and the overarching process that creates and executes the recipe.

The case of grammatical recursion (see definition 1) is perhaps most famously associated with Noam Chomsky. Not only does he claim all human languages are recursive, but also that this ability is biologically hardwired as part of our genetic makeup. Countering Chomsky’s first claim is the debate surrounding a small Amazonian tribe called the Pirahã: even though they show signs of recursion, such as the ability to recursively embed structures within stories, the Pirahã grammar is claimed not to recursively embed phrases within other phrases. If true, then are numerous implications for a wide variety of fields in linguistics, but this is still an unsubstantiated claim: for the most part, we are relying on one specific researcher (Daniel Everett) who, despite having dedicated a large portion of his life to studying the tribe, could very well have been misled. That said, I retain a large amount of respect for Everett, having watched him speak at Edinburgh a few years ago and read his book on the topic: Don’t Sleep, There are Snakes: Life and Language in the Amazonian Jungle.

So, why am I rambling on about recursion? Well, besides its obvious relevance, — and perhaps under-representation on this blog (deserved or not, I’ll let you decide) — Everrett has recently published a series of slides about a corpus study of Pirahã grammar (see below).

His tentative conclusion: there is no strong evidence for recursion among relative clauses, complement clauses, possessive structures and conjunctions/disjunctionsHowever, there is possible evidence of recursive structure in topics/repeated arguments. He also posits cultural pressures for longer or shorter sentences, such as writing systems (as I mentioned way back in 2009).

I’m sure this debate will be brought to the fore at this year’s EvoLang, with Chomsky Berwick Piattelli-Palmarini and many of the Biolinguistic crowd in attendance, and it’s a shame I’ll almost certainly miss it (unless someone wants to pay for my ticket… Just hit the donate button in the left-hand corner ;-) ).

Read the rest of this entry →

Animal Cognition & Consciousness (II): Metacognition & Mentalizing

January 12, 2012 in Evolution, Science

As I wrote in my last post, three kinds of behaviours are most often discussed in debates about animal consciousness and cognition:

“1. Mirror self-recognition

2. Tests of metacognition;

3. Metacognition of others’ mental states” (Gómez 2009: 45)

After having discussed the first capacitiy in my previous post, I will discuss the latter two in this post, starting with metacognition, that is being aware of one’s own knowledge states, and then turn to being aware of other’s mental states.

Metacognition.

Being aware of one’s own mental states, i.e., reflective consciousness, surely seems to be one of the most crucial components of self-awareness. In one paradigm used to test for metacognitive awareness, monkeys were trained to select, out of a number of two or more images, the one that is identical to an image they have been shown earlier. As is to expected, the monkeys’ performance progressively deteriorated the longer the delay was between the sample image and the selection task.

 

Read the rest of this entry →