Language Evolution and Levels of Explanation

April 16, 2012 in Evolution, Evolution, Linguistics, Science, Science News

A somewhat contentious debate among the behavioural sciences is currently underway concerning Mayr’s division of causal explanations in evolutionary theory. Here I’m going to give you a brief rundown of two papers in particular, before I chip in my two-cents about how other insights from the theoretical literature can inform this debate. It seems the discussion is just getting started with respect to cultural evolution, so it’d be interesting to hear other peoples’ comments from either camp.

Over the years, evolutionary theorists have tried to make logical divisions between the kinds of things we can ask about, with a view to making it clear what exactly scientific studies can tell us. A dominant paradigm dividing two levels of causation for biological features we see in the world is Mayr’s distinction between ultimate and proximate causes.  Ultimate causation explains the proliferation of a trait in a population in terms of the evolutionary forces acting on that trait. For example, peahens that prefer peacocks with larger tails (an honest signal of fitness following the handicap principle) will have stronger or more successful offspring, and so this preference proliferates along with larger peacock tails. Proximate causation uses immediate physiological and environmental factors to explain a particular peahen’s penchant for a large-tailed peacock in a mate choice trial, where the signal of the peacock’s large tail elevates the hormone levels in the peahen and copulatory behaviour ensues. Although the behaviour in both of these examples is the same, the levels of explanation are based on different sets of factors.

In Perspectives on Psychological Science last year, a paper by Scott-Phillips, Dickins and West voiced some concerns about these two levels of causation being conflated in the behavioural sciences. In particular, they addressed instances where proximate explanations of traits are being framed as ultimate ones. The paper points specifically to studies of the evolution of cooperation, transmitted culture and epigenetics to illustrate this. Regarding the evolution of cooperation, they point to an instance where ‘strong reciprocity’ (an individual’s propensity to reward cooperative norms and sanction violation of these norms) is purported to be an ultimate explanation of why humans cooperate, rather than a proximate mechanism that enables such cooperation.

Table of proximate and ultimate explanations. For the topic of 'linguistic structure', cultural transmission is listed as a proximate explanation, and the adaptive benefit of cooperative activity through communication is given as an ultimate explanation.,

Table 1 from Scott-Phillips et al. (2011), highlight added.

Among the examples was the feature of linguistic structure (see table 1 from paper above), where several studies pointed to the cultural transmission process as an ultimate explanation of linguistic structure. They suggest that cultural transmission constitutes a proximate process, because it gives the means by which linguistic structure is expressed – and this is how cultural transmission contributes to what the linguistic structure looks like. One analogy might be that the vibrating of my particular vocal cords is a proximate mechanism giving rise, in part, to how my voice sounds, rather than an ultimate explanation of why I vocalise. Since an ultimate account must suggest how a trait contributes to inclusive fitness in order to explain its prevalence in humans, they uncontroversially venture that the ultimate rationale for the ubiquity of linguistic structure is that it greater enables communication (and therefore increases inclusive fitness by enabling cooperative activity).

An opposing view was later published in Science by Laland, Sterelny, Odling-Smee et al., who suggest that the use of Mayr’s division of ultimate and proximate causation is not helpful to all evolutionary investigations, and even hampers progress. The grounds for rejecting Mayr’s paradigm seem to lie largely in what Laland et al. term “reciprocal causation”. That is, that “proximate mechanisms both shape and respond to selection, allowing developmental processes to feature in proximate and ultimate explanations”. After aligning proximate explanations with ontogeny and ultimate explanations with phylogeny, they suggest that what we may have called ultimate and proximate features are no longer sharply delineated, and that these reciprocal processes mean that the source of selection sometimes cannot be separated. They present an idea from the field of evolutionary-developmental biology that, if a developmental process makes some variant of a trait more likely to arise than others, then this proximate mechanism helps to construct an “evolutionary pathway”.

Flow charts presenting alternative processes for Biological evolution, Cultural evolution and Gene-culture coevolution

Figure 2 from Laland et al. (2011)

The paper also highlights developmental plasticity, and gene-environment interaction more broadly (see fig. 2 from paper, above), as a process where reciprocal causation offers an evolutionary explanation conceptually comparable to ultimate causation. Talking specifically on the topic of linguistic structure, they present the debate about whether specific design features of language are attributable to biological or cultural evolution. The paper points out that cultural evolution determines features of linguistic structure – for example, word order – and that the existing word order determines that of future speakers. Indeed, at the Edinburgh LEC we know that transmission by iterated inductive inference under general conditions can explain particular structures in languages. That cultural evolution determines the variation between languages, Laland et al. say, provides evidence that it is an evolutionary force comparable to natural selection (and, therefore, ultimate explanation).

What follows is a collection of my thoughts on the matter, which are (spoiler alert) largely in support of the Scott-Phillips et al. paper. I hope others more experienced in cultural evolution studies than I will contribute their perspective.

It seems to me that there are a few assumptions made in the Laland et al. paper that are not quite in line with how Mayr himself understood the paradigm, and perhaps much can be learned from this debate’s previous incarnation when Richard C. Francis made similar arguments against the ultimate/proximate distinction in 1990. In his critique, he equated ultimate causation with phylogeny and proximate causation with ontogeny – an approach that was rebuked by Mayr in 1993, who made the point that “all physiological activities are proximately caused, but is a reflex an ontogenetic phenomenon?” Mayr’s response is actually rather unhelpful in addressing the arguments fully, and this statement is particularly dense. But what he is getting at here is the idea that interaction with the environment that gives rise to adaptive behaviours (such as recoiling instantly from a hot stove) is itself subject to selection, and thus constitutes a proximate explanation of causation. Relatedly, he points out that most components of the phenotype are indeed the result of genetic contribution and interaction with the environment, which has been successfully explored in biology within the traditional theoretical paradigm.

A perhaps more nuanced account of how we can divide the possible explanations of biological phenomena is offered by Tinbergen in his “four questions”, where ultimate explanations are further subdivided into Function (concerning the adaptive solution to a survival problem favoured by natural selection) and Phylogeny, which is a historical account of when the trait arose in the species, and importantly includes processes other than natural selection that give rise to variation – such as mutation, drift and the constraints imposed by pre-existing traits (see blind spot example below). Proximate explanations are further split into Mechanism (immediate physiological/environmental factors causal in how the trait operates in the individual) and Ontogeny (the way in which this trait develops over the lifetime of the individual). As a simple example, here is the paradigm applied to a trait like mammalian vision that I lifted from Wikipedia:
Ultimate
Function: To find food and avoid danger.
Phylogeny: The vertebrate eye initially developed with a blind spot, but the lack of adaptive intermediate forms prevented the loss of the blind spot.
Proximate
Causation: The lens of the eye focuses light on the retina
Ontogeny: Neurons need the stimulation of light to wire the eye to the brain within a critical period (as those awful studies of blindfolded kittens illustrated).
A schematic below, adapted from Tinbergen (1963) shows how these levels of causation may interact with one another, which appears to communicate something roughly comparable to the importance Laland et al. place on “reciprocal causation” in the formation of adaptive variants:

Flow chart showing the interaction of functional, phylogenetic, mechanistic and ontogenetic explanations

Adapted from Tinbergen (1963); Causal Relationships

Applied the to debate outlined above, it would seem that there is no apparent reason that a process of gene-environment interaction – including the cultural environment – can’t itself be subject to selection, or that developmental plasticity itself is not an adaptation in need of an ultimate explanation. It has long been the case that behaviour is no longer understood as either “nature” or “nurture”, but gene-environment interaction, with varying levels of heredity. The “reciprocal causation” suggested in Laland et al.’s paper, is (as they point out) very common in nature; feedback loops are uncontroversial proximate processes in biology. That a proximate process may give rise to a dominant variant of a trait in a population does not explain why it is adaptive, and this points to another problem with the proposing the abandonment of Mayr’s paradigm: a logical division of levels of explanation doesn’t seem to be the sort of thing that can be rendered outdated by empirical evidence. Indeed, claims about the particulars of traits and processes (and languages) themselves are a matter for empirical data – but the theoretical issue about the level of explanation that data is useful for does not itself seem to be subject to empirical findings.

The finding that a proximate process such as cultural transmission gives rise to a trait that is prolific in a population is itself exciting and surprising, and even shows us that the pressure for making language easier to learn gives us adaptive languages to learn; however, it could be argued that it is this process that is adaptive, and that the reason why humans so heavily rely on this process is an ultimate explanation.

One way of resolving these two perspectives may be to place cultural processes that give rise to variation at the level of what Tinbergen labels Phylogenetic (one subset of ultimate) explanation, as it concerns processes which produce some heightened frequency of traits over a language’s history. An explanation at the level of Phylogeny still must make recourse to natural selection at some point, since variants that result from mutation or drift are retained because of their adaptive value (or an adaptive trade-off). This approach may be a problem for the current understanding, which holds that the features resulting from cultural processes are themselves adaptive and therefore comparable to what Tinbergen labels Function.

The problem with this is that calling particular structures of language ‘adaptive’ obscures what it is about Language that is actually being selected for. To flesh out what I mean, I think it’s useful to consult Millikan’s (1993) distinction between Direct Proper Function and Derived Proper Function (… bear with me, it’ll be worth it, honest). The Direct Proper Function of a given trait T can be thought of as a “reproduction” of an item that has performed the exact same adaptive function F, and T exists because of these historical performances of F. Sperber and Origgi (2000) use the illustrative example of the heart, where the human heart has a bunch of properties (it pumps blood, makes a thumping noise, etc), but only its ability to pump blood is its Direct Proper Function. This is because even a heart that doesn’t work right or makes irregular thumping noises or whatever, still has the ability to pump blood. Hearts that pump blood have been “reproduced through organisms that, thanks in part to their owning a heart pumping blood, have had descendents similarly endowed with blood-pumping hearts”.

The Derived Proper Function, however, refers to a trait T that is the result of some device that, in some environment, has a Proper Function F. In that given environment, F is usually achieved by the production of something like T. If I unpack this idea and apply it to language, we can understand it as the acquisiton and production of a device that, in this environment, leads to, say, a particular SVO language, T. The Proper Function of adaptive communication is performed by T in this case, but could also be performed by any number of SOV, VSO, etc Ts in other cases. In other words, the Proper Function of this language is not the word order itself, but communication. The word order is the realisation of this device that is reproduced because of the performance of T in a particular environment, but does not necessarily lead to T in the next incarnation of that device (i.e. My child, if born and raised in Japan, will speak Japanese). We see, then, that a proximate process resulting in what a particular language spoken by a given population looks like does not necessarily speak to the evolutionary function. In other words, it is the device that allows the performance of Language that is adaptive, not the individual language itself.

One question being asked in the study of cultural transmission is why a particular language looks like it does, while we also know that there are 6000 different versions that perform the same (ultimate) function. I would even argue that asking how proximate processes shape languages is actually the most exciting and interesting avenue of inquiry precisely because it’s so blindingly obvious what the adaptive function of language is. But perhaps the value in this endeavour is somewhat neglected, in part, because of the same impression that Francis (1990) had: “the attitude, implicit in the term ultimate cause, [is] that these functional analyses are somehow superordinate to those involving proximate causes” which would be a shame. It seems to me that the coarse grain of ultimate vs proximate perhaps doesn’t do enough to help complex proximate study to position itself in the wider theoretical framework, and the best way to proceed from this might be to couch explanation in terms of Function, Phylogeny, Ontogeny and Mechanism. I think more fine-grained terminology grants us more explanatory power, in this case.

A final question in this debate that came up too many times during discussions with the LEC is: what does keeping the traditional paradigm “buy us”? Well, the first answer to this is consilience with one of the most successful and robust theories in science. The same sentiment has been communicated by Pinker and Bloom (1990), who said: “If current theory of language is truly incompatible with the neo-Darwinian theory of evolution, one could hardly blame someone for concluding that it is not the theory of evolution that must be questioned, but the theory of language”. Part of the reason this debate may have arisen is that studies of cultural evolution have used evolutionary theory as an incredibly fruitful way of analysing cultural processes, but additional acknowledgement about how cultural adaptation is different to biological adaptation may be necessary. This difference is an aspect of Laland’s paper (shown in Fig 2) that I think is important, as it’s part of the reason that more nuanced frameworks for cultural evolution are now needed. Without this widespread acknowledgement, cultural evolution may be considered an extension of biological evolutionary theory instead of a successfully applied metaphor. It seems to me that the side of this debate one falls on is well predicted by whether one subscribes to the former interpretation of cultural evolution or the latter.

Knowing which level of explanation current work pertains to is a valuable part of evolutionary exploration, and abandoning this in favour of an approach where proximate processes are explanatory ends to themselves may mean the exploration of Function and Phylogeny may suffer. That said, it is telling, I think, that even in seeking to abandon the proximate/ultimate distinction, we must still exploit this existing terminology in order to explain such a position. That natural selection has explained countless adaptations in all living things is certainly not trivial, and to reject the theory giving rise to ultimate explanations as they’re currently defined is to reject this fundamental aspect of evolutionary theory. The big problem seems to be that we’re coming to understand proximate processes as so elaborate and complex, that a more nuanced framework is needed to deal with the dynamics of those processes. I reckon, however, that such a framework can be developed within the traditional paradigms of evolutionary theory.

 

References

Francis, R.C. (1990) – “Causes, Proximate and Ultimate” Biology and Philosophy 5(4) 401-415.

Laland, K., Sterelny, K., Odling-Smee, J., Hoppitt, W. & Uller, T. (2011) – “Cause and Effect in Biology Revisited: Is Mayr’s Proximate-Ultimate Distinction Still Useful?” Science 334, 1512-1516.

Mayr, E. (1993) – “Proximate and Ultimate Causations” Biology and Philosophy 8: 93-94.

Millikan, R. (1993) – White Queen Psychology and Other Essays for Alice, Cambridge, Mass: MIT Press.

Pinker, S. & Bloom, P. (1990) – “Natural language and natural selection” Behaviour and Brain Sciences 13, 707-784.

Scott-Phillips, T. Dickins, T. & West, S. (2011) – “Evolutionary Theory and the Ultimate-Proximate Distinction in the Human Behavioural Sciences” Perspectives on Psychological Science 6(1): 38-47.

Sperber, D. & Origgi, G. (2000) – “Evolution, communication and the proper function of language” In P. Carruthers and A. Chamberlain (Eds.) Evolution and the Human Mind: Language, Modularity and Social Cognition (pp.140-169) Cambridge: Cambridge University Press.

Tinbergen, N. (1963) “On Aims and Methods in Ethology,” Zeitschrift für Tierpsychologie, 20: 410–433.

 

 

Wild Replicator’s Got Funky Rhythm, Part 1

March 23, 2012 in Evolution

Now that the replicator meme is out and about I’ve got more to say. I’m going to republish two more posts from my 2010 cultural evolution series. These posts are about music. I have various reasons for using music as my cultural evolution conceptual sandbox. For one thing, it means that I don’t have to contend with semantic meanings arbitrarily associated with bits of music. In music, all we’ve got is the physical signal.

In these two posts I choose, not a simple musical example but, rather, a complex one, something jazz musicians know as Rhythm Changes. While I could talk about the four-note motif Beethoven used to construct the first movement of his Fifth Symphony, which is a memetic favorite, that’s too easy. Thinking about it won’t stretch our intuitions about the memetic properties of mere physical things. That motif has four notes, with specific durations and specific note-to-note pitch relationships.

Rhythm Changes isn’t like that. It’s an abstract property of a sound stream. There is now specific number of notes, no specific durations, and no specific note-to-note pitch relationships. Thousands upon thousands of specific musical streams, many quite different from one another, have exemplified the properties of Rhythm Changes.

In the previous post (in this series) I argued memes, the cultural parallel to the biological gene, are those physical properties of objects, events, and processes that allow different individuals to coordinate their participation in those things. In this view, memes are not physical objects, like genes, that spread through a population. Rather, memes are about sharability; they are physical properties that can easily be identified by human nervous systems and thus be the basis for shared (cultural) activity.

In that post I considered a very basic case, people making noise at regular intervals. In that case we have two memes, period (the interval between “hits”) and phase (the relationship between streams of hits by different individuals). Now I want to consider a considerably more complex case, the entity jazz musicians know as Rhythm Changes. This entity assumes that, for a given performance, period length and phase value are agreed upon. In fact it assumes a lot more. We’re dealing with a whole lot of memes here.

But I don’t want to get hung up in those details. I just want to characterize Rhythm Changes in a reasonable way and explain just why I insist that we regard Rhythm Changes as a structured collection of physical properties that can be ascribed to a stream of sound. While it would be nice to characterize Rhythm Changes using the language of acoustics, it’s not at all clear to me that we’ve got the necessary concepts. In any event, if we do, I don’t know them. Instead, I’ll couch my description in the schematic terms jazz musicians tend to use when talking about their craft; these terms are derived, in part, from descriptive and analytic concepts developed for European art music (i.e. classical music).

I’m going do this in two posts, the first will be confined to Rhythm Changes itself. The second will consider how Rhythm Changes came into being and how it functions in the popular music system. Read the rest of this entry →

In Search of the Wild Replicator

March 15, 2012 in Evolution, Linguistics


The key to the treasure is the treasure.
– John Barth

In view of Sean’s post about Andrew Smith’s take on linguistic replicators I’ve decided to repost this rather longish note from New Savanna. I’d orignally posted it in the Summer of 2010 as part of a run-up to a post on cultural evolution for the National Humanities Center (USA); I’ve collected those notes into a downloadable PDF. Among other things the notes deal with William Croft’s notions (at least as they existed in 2000) and suggests that we’ll find language replicators on the emic side of the emic/etic distinction.

I’ve also appended some remarks I made to John Lawler in the subsequent discussion at New Savanna.

* * * * *
There’s been a fair amount of work done on language from an evolutionary point of view, which is not surprising, as historical linguistics has well-developed treatments of language lineages and taxonomy, the “stuff” of large-scale evolutionary investigation. While this work is directly relevant to a consideration of cultural evolution, however, I will not be reviewing or discussing it. For it doesn’t deal with the theoretical issues which most concern me in these posts, namely, a conceptualization of the genetic and phenotypic entities of culture. This literature is empirically oriented in a way that doesn’t depend on such matters.

 

The Arbitrariness of the Sign

In particular, I want to deal with the arbitrariness of the sign. Given my approach to memes, that arbitrariness would appear to eliminate the possibility of word meanings could have memetic status. For, as you may recall, I’ve defined memes to be perceptual properties – albeit sometimes very complex and abstract ones – of physical things and events. Memes can be defined over speech sounds, language gestures, or printed words, but not over the meanings of words. Note that by “meaning” I mean the mental or neural event that is the meaning of the word, what Saussure called the signified. I don’t mean the referent of the word, which, in many cases, but by no means all, would have perceptible physical properties. I mean the meaning, the mental event. In this conception, it would seem that that cannot be memetic.

That seems right to me. Language is different from music and drawing and painting and sculpture and dance, it plays a different role in human society and culture. On that basis one would expect it to come out fundamentally different on a memetic analysis.

This, of course, leaves us with a problem. If word meaning is not memetic, then how is it that we can use language to communicate, and very effectively over a wide range of cases? Not only language, of course, but everything that depends on language. Read the rest of this entry →

Imitation and Social Cognition in Humans and Chimpanzees (I): Imitation, Overimitation, and Conformity

January 17, 2011 in Evolution, Science

ResearchBlogging.org

Imitation is often seen as one of the crucial foundations of culture because it is the basis of  social learning and social transmission. Only by imitating others and learning from them did human culture become cumulative, allowing humans to build and improve on the knowledge of previous generations. Thus, it may be one of the key cognitive specializations that sparked the success of the human evolutionary story:

Much of the success of our species rests on our ability to learn from others’ actions. From the simplest preverbal communication to the most complex adult expertise, a remarkable proportion of our abilities are learned by imitating those around us. Imitation is a critical part of what makes us cognitively human and generally constitutes a significant advantage over our primate relatives (Lyons et al. 2007: 19751).

Indeed, there have been some interesting experiments suggesting that the human capacity -and, above all, motivation – for imitation is an important characteristic that separates us from the other great apes.

In a series of intriguing experiments by Victoria Horner and Andrew Whiten from the University of St. Andrews in Scotland, and Derek Lyons and his colleagues from Yale University,  young wild-born chimpanzees and Children aged 3 to 4 were shown how to get a little toy turtle/ a reward out of a puzzle box. In the first condition of the experiment the puzzle box was transparent, whereas in the second condition the puzzle box was opaque.

And here’s the catch: both chimpanzees and children were not shown the ‘right’ or ‘simple’  solution to how to get the reward but one that was actually more complicated and involved unnecessary steps.

Read the rest of this entry →

Futurama Supports Cultural Evolution

September 27, 2010 in Uncategorized

A recent episode of Futurama – ‘A clockwork Origin‘ – sees the Professor go head to head with Creationists, and makes a good point about the difference between Creationism and Evolution.

After his work on the origin of man is mocked, the Professor decides to leave Earth for a desolate planet. However, the nano-bots which he designs to purify the water evolve overnight into a full-scale ecosystem, complete with robo-dinosaurs that are eventually wiped out by a solar flare.

At first, the Professor is unwilling to see the change in the robots as Evolution, saying ‘Those robots didn’t evolve by themselves, I put them there – I’m a genius, get over it’. However, after the crew is captured by robo-anthropologists (anthrobopologists?), he’s forced to admit that they really have Evolved. Further, he’s forced to admit that he has no problem with the idea of a creator playing a small part in the origins of Humans, just as he started the race of robots.

This is all good news for those studying Cultural Evolution – people are coming round to the idea that Evolution is an abstract process rather than the theory of how humans evolved from apes.  Perhaps us researchers will be spared in the robot uprising, which is sure to come:  As Bender says “Robots do everything faster, including evolving”.

Bleg: Why CULTURAL Evolution?

September 16, 2010 in Evolution

That is to ask: Why has there been so much interest in cultural evolution in the last two decades or so? It seems to me that a lot of this thinking is just messing around, seeing if evolutionary ideas can somehow be attached to cultural phenomena in a coherent way. It seems more motivated by a desire to extend evolutionary thinking than by a desire to understand culture. And it’s not obvious to me that anyone has actually explained anything in this process, not so far.

In particular, has anyone used some theory of cultural evolution to explain some phenomenon of culture as well as, and ideally, better than competing non-evolutionary accounts? It’s not at all obvious to me that the answer to that question is “Yes.”

Note that I don’t exempt my own efforts from this criticism, which is why, on the whole, I’ve devoted more time to examining and analyzing cultural phenomenon than I have conceptualizing cultural evolution. In particular, I’ve spent a lot of time thinking about popular music in America, and the interaction of African-derived and European-derived styles (see, e.g. this longish paper) and, more recently, I’ve been looking at graffiti, which I’ll address later on.

Evolution?

One caveat: A lot depends on just what one means by cultural evolution. If one is just using ‘evolution’ as a substitute for ‘change,’ then the question has little meaning. It seems to me that much of memetics is like this, with the added innovation of attributing agency to the memes, rather than to people.

And then there’s gene-cultural coevolution (GCCE). Those folks may well have succeeded in coming up with useful explanations, e.g. lactose tolerance. But it’s not at all clear to me that GCCE can work with the kinds of phenomena that most interest me and that do constitute a great deal of cultural activity. As I’ve explained here, it’s not clear to me that GCCE has anything to say, for example, about something like the growth of graffiti in the last 40 years.

Graffiti 1

By ‘graffiti’ I don’t mean any writing on walls, but the specific practice that originated on the East Coast of the USA in Philadelphia and New York City in the late 1960s and early 1970s. The practice seems to have been pretty much confined to those cities by the early 1980s. But it had gone world-wide by the start of the current millennium. How did that happen? And why?

I don’t see that GCCE has any tools to answer that question. The spread is too fast for any biological changes to have been involved. Whatever’s going on has been going on purely within the cultural sphere. There are obvious things to point to concerning how it happened: 1) Press coverage of early graffiti made the activity more visible. 2) When graffiti became associated with hip-hop, it followed hip-hop in its spread through world pop culture. 3) Photography, books, and films (Style Wars, Wild Style) spread the word. 4) The emergence of the world-wide-web created a new means by which photos of graffiti could made instantly available around the world.

But none of that explains why the practice spread. What made graffiti so attractive to so many people in so many different places around the world? And why has it been, on the whole, so conservative, so that the themes and motifs that originated in the East Coast of the USA in the late 1970s and early 1980s are showing up in Japan in then 2000s? On one level that question answers itself. If the designs changed rapidly, so that putting any old design up on the walls counted as graffiti, then the activity would loose its identity, its genealogical connection with those first writers in New York City and Philadelphia. It would just be painting on walls, illegally. Big deal.

The genealogical connection IS important. Why? Note that, while stylistic conservatism maintains that identity, we also have to allow for the identity of individual writers within the tradtion. The tradition has to have enough internal variety to allow for that.

There are ways of talking about those questions, and you’ll find some of them in the literature, but the question I’m asking is this: Can a strong theory of cultural evolution do a better job of accounting for this spread than any other theory? If so, what would that theory look like?

Read the rest of this entry →

Population Size and Rates of Language Change

August 8, 2010 in Linguistics, Science

In previous posts, I’ve looked at the relationship between cultural evolution and demography (see here, here and here). As such, it makes sense to see if such methods are applicable in language which is, after all, a cultural product. So, having spent the last few days looking over the literature on language and demography, I found the following paper on population size and language change (free download). In it, the authors, Søren Wichmann and Eric Holman, use lexical data from WALS to test for an effect of the number of speakers on the rate of language change. Their general findings argue against a strong influence of  population size, with them instead opting for a model where the type of network influences change at a local level, through different degrees of connectivity between individuals. Here is the abstract:

Previous empirical studies of population size and language change have  produced  equivocal  results. We  therefore  address  the  question  with  a new set of lexical data from nearly one-half of the world’s languages. We first show that relative population sizes of modern languages can be extrapolated to ancestral languages, albeit with diminishing accuracy, up to several thousand years into the past. We then test for an effect of population against the null hypothesis that the ultrametric inequality is satisified by lexical distances among triples of related languages. The test shows mainly negligible effects of population, the exception being an apparently faster rate of change in the larger of two closely related variants. A possible explanation for the exception may be the influence on emerging standard (or cross-regional) variants from speakers who shift from different dialects to the standard. Our results strongly indicate that the sizes of speaker populations do not in and of themselves determine rates of language change. Comparison of this empirical  finding with previously published computer simulations suggests that the most plausible model  for  language  change  is  one  in  which  changes  propagate  on  a  local level in a type of network in which the individuals have different degrees of connectivity.

As I’m in the middle of several other things at the moment I don’t really have time to provide a thorough review of this paper. Having said that, I agree with their claim of population size being unlikely to account for rates of language change. I reckon their results would be stronger if they factored in population density. So those that are dense and large will change faster than those which are large and distributed. The main point being that population size and population density influence the degree of social interconnectivity. Nettle (1999), for instance, argues that “spreading an innovation over a tribe of 500 people is much easier and takes much less time than spreading one over five million people.” This is fairly reasonable if we are looking at the generation of a single innovation within each of these populations. However, if those 500 people are spread across a large distance, then their transmission chain is going to be stretched: effectively lowering the rate of transmission. The same applies for a population of five million individuals who are packed into a small area: Arguably, given the right conditions, we can arrive at a situation where a population of five million show greater levels of interconnectivity than 500. I think it’s this aspect, the level of social interconnectivity, which may be more relevant to the rate of language change (other things to test for, include: writing systems/literacy and inter-language contact).

Language About Language

August 8, 2010 in 101, Evolution

How is it, then, that we can talk about talking? If you are willing to assume the existence of basic perceptual and cognitive capacities, a relatively simple answer follows immediately. The sounds of talk are, after all, sounds like any other sounds. We can perceive them in the same way we perceive the sound of a waterfall or a bird’s song, a thunderclap or the rustling of leaves in the wind, a cricket’s chirp or the breaking of waves on a beach. All are things we can hear, easily and naturally, and so it is with the sound of the human voice.

Roman Jakobson famously theorized that language has six functions: referential, emotive, poetic, conative, phatic, and the metalingual function. That’s the function we’re interested in, our capacity to speak about speech. Jakobson talked of the metalingual function as an orientation toward the language code, which seems just a bit grand. For I’m led to believe that many languages lack terms for explicitly talking about the ‘code.’ Thus, in The Singer of Tales (Atheneum 1973, orig. Harvard 1960), Albert Lord attests (p. 25):

Man without writing thinks in terms of sound groups and not in words, and the two do not necessarily coincide. When asked what a word is, he will reply that he does not know, or he will give a sound group which may vary in length from what we call a word to an entire line of poetry, or even an entire song. [Remember, Lord is writing about oral narrative.] The word for “word” means an “utterance.” When the singer is pressed then to way what a line is, he, whose chief claim to fame is that he traffics in lines of poetry, will be entirely baffled by the question; or he will say that since he has been dictating and has seen his utterances being written down, he has discovered what a line is, although he did not know it as such before, because he had never gone to school.

While I’m willing to entertain doubts about the full generality of this statement – “man without writing” – I assume the it is an accurate report about the Yugoslavian peasants among whom Milman Parry and Albert Lord conducted their fieldwork and that it also applies to other preliterate peoples, though not necessarily to all.

Given those caveats, the paragraph is worth re-reading. Before doing so, recall how casually we have come to see language as a window on the workings of the mind in the Chomskyian and post-Chomskyian eras. If that is the case, then what can one see through a window that lacks even a word for words, that fails to distinguish between words and utterances? And what of the poets who don’t know what a line is? The lack of such knowledge does not stand in the way of the poeticizing, no more than the lack of knowledge of generative grammar precludes the ability to talk intelligently on a vast range of subjects.

Read the rest of this entry →

Time Travel, Dreams and The Origin of Knowledge

July 29, 2010 in Evolution, Irreverant and Irrelevant

I’ve been attending a weekly seminar on the Metaphysics of Time Travel, given by Alasdair Richmond.  Yesterday, he was talking about the way knowledge arises in causal chains.  Popper (1972 and various others) argues that “Knowledge comes into existence only by evolutionary, rational processes” (quoted from Paul Nahin, ‘Time Machines: Time Travel in Physics, Metaphysics and Science Fiction, New York, American Institute of Physics, 1999: 312).  Good news for us scholars of Cultural Evolution.  However, Richmond also talked about the work of David Lewis on the nature of causality.  There are three ways that causal chains can be set up:

The first is an infinite sequence of events each caused by the previous one.  For example, I’m typing this blog because my PhD work is boring, I’m doing a PhD because I was priced in by funding, I applied for funding because everyone else did … all the way back past my parents meeting and humans evolving etc.

The second option is for a finite sequence of events – like the first option, but with an initial event that caused all the others, like the big-bang.

The third option is a circular sequence of events.  In this, A is caused by B which is caused by A.  For instance, I’m writing doing a PhD because I got funding and I got funding because I’m doing a PhD, because I got funding.  There is no initial cause, the states just are. This third option seems really odd, not least because it involves time-travel.  Where do the states come from?  However, argues Lewis, they are no more odd than any of the other two options.  Option one has a state with no cause and option two has a cause for every event but no original cause.  So, how on earth can we get at the origin of knowledge if there is no logical possibility of determining the origin of any sequence of events?

One answer is just to stop caring after a certain point.  Us linguists are unlikely to get to the point where we’re studying vowel shifts in the first few seconds of the big bang.

The other answer is noise.  Richmond suggested that ‘Eureka’ moments triggered by random occurrences, for instance (Nicholas J. J. Smith, ‘Bananas Enough for Time Travel?’, British Journal for the Philosophy of Science, Vol. 48, 1997: 363-89). mishearing someone or a strange dream, could create information without prior cause.

Spookily, the idea I submitted for my PhD application came to me in a dream.

Culture-driven population dynamics: sustainable or unsustainable?

July 27, 2010 in Research Blogging, Science

ResearchBlogging.org This post was chosen as an Editor's Selection for ResearchBlogging.orgWhen looking at culture-driven population dynamics, a common assumption is that there’s a positive feedback between cultural evolution and demographic growth. The general prediction, then, is for unlimited growth in population and culture. Yet models based on these assumptions tend to ignore important aspects of cultural evolution, namely: (1) cultural transmission is not perfect; (2) culture does not always promote population growth. Ghirlanda et al (2010) incorporate these two features into a model, and arrive at some interesting conclusions. In particular, they argue those populations maintaining large amounts of culture may run the risk of extinction rather than stability or growth.

Read the rest of this entry →