<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Replicated Typo</title>
	<atom:link href="http://replicatedtypo.com/comments/feed" rel="self" type="application/rss+xml" />
	<link>http://replicatedtypo.com</link>
	<description>Culture, its evolution and anything inbetween</description>
	<lastBuildDate>Thu, 17 May 2012 21:53:25 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>Comment on Having more children affects your basic word order by Claire</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-31022</link>
		<dc:creator>Claire</dc:creator>
		<pubDate>Thu, 17 May 2012 21:53:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-31022</guid>
		<description>You&#039;ve made my day with this! However, I wonder about your information structure story. Free word order languages (the languages correlated with the fewest number of children here) usually put the new/important information first, whether it&#039;s a noun phrase or verb. So your explanation should apply better to non-dominant word order languages than to SOV languages.</description>
		<content:encoded><![CDATA[<p>You&#8217;ve made my day with this! However, I wonder about your information structure story. Free word order languages (the languages correlated with the fewest number of children here) usually put the new/important information first, whether it&#8217;s a noun phrase or verb. So your explanation should apply better to non-dominant word order languages than to SOV languages.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by John Pate</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30912</link>
		<dc:creator>John Pate</dc:creator>
		<pubDate>Tue, 15 May 2012 07:42:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30912</guid>
		<description>sigh, I really need to proofread my comments. The second model contains the predictors of interest (and any control random effects), and takes as its response variable the residuals of the control model.</description>
		<content:encoded><![CDATA[<p>sigh, I really need to proofread my comments. The second model contains the predictors of interest (and any control random effects), and takes as its response variable the residuals of the control model.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by John Pate</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30911</link>
		<dc:creator>John Pate</dc:creator>
		<pubDate>Tue, 15 May 2012 07:40:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30911</guid>
		<description>It has transformed it into many binary variables. As I said before, this is a genuine ambiguity in your data, so there isn&#039;t a single straightforward response. One thing you can do is divide your predictors into control predictors (whose values you do not care about) and predictors of interest. Since you don&#039;t seem to be especially interested in the effect of religion on word order, it seems more natural to treat religion as a random effect with many levels rather than an explosion of binary variables. This is like using a hierarchical bayesian model; mixed effects regression can be viewed as the frequentist version of a hierarchical bayesian model.

Another thing you can do (and this is what I did in my cog-sci paper last year http://homepages.inf.ed.ac.uk/s0930006/pate-goldwater_predictability-effects-IDS-ADS_2011.pdf) is fit two models. The first model contains *only* your control predictors. The parameters of this model will not be stable, but the predicted values will be, which means that the residuals will be. You then fit a second model containing your predictors of interest (and any control random effects). If you have a predictor of interest that is correlated with a control predictor, this approach explains as much variation as possible using the control predictor, and the predictor of interest is significant iff it can explain variation that the control predictor cannot. This is conservative (it&#039;s possible that, under the true generative story we&#039;re trying to discover, both the response variable and the control variable are influenced by the predictor of interest), but I think it&#039;s the best you can do without gathering less ambiguous data.</description>
		<content:encoded><![CDATA[<p>It has transformed it into many binary variables. As I said before, this is a genuine ambiguity in your data, so there isn&#8217;t a single straightforward response. One thing you can do is divide your predictors into control predictors (whose values you do not care about) and predictors of interest. Since you don&#8217;t seem to be especially interested in the effect of religion on word order, it seems more natural to treat religion as a random effect with many levels rather than an explosion of binary variables. This is like using a hierarchical bayesian model; mixed effects regression can be viewed as the frequentist version of a hierarchical bayesian model.</p>
<p>Another thing you can do (and this is what I did in my cog-sci paper last year <a href="http://homepages.inf.ed.ac.uk/s0930006/pate-goldwater_predictability-effects-IDS-ADS_2011.pdf" rel="nofollow">http://homepages.inf.ed.ac.uk/s0930006/pate-goldwater_predictability-effects-IDS-ADS_2011.pdf</a>) is fit two models. The first model contains *only* your control predictors. The parameters of this model will not be stable, but the predicted values will be, which means that the residuals will be. You then fit a second model containing your predictors of interest (and any control random effects). If you have a predictor of interest that is correlated with a control predictor, this approach explains as much variation as possible using the control predictor, and the predictor of interest is significant iff it can explain variation that the control predictor cannot. This is conservative (it&#8217;s possible that, under the true generative story we&#8217;re trying to discover, both the response variable and the control variable are influenced by the predictor of interest), but I think it&#8217;s the best you can do without gathering less ambiguous data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by Sean</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30902</link>
		<dc:creator>Sean</dc:creator>
		<pubDate>Mon, 14 May 2012 16:53:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30902</guid>
		<description>Ah.  The religion variable was entered as a single categorical predictor, the table is how R presents the results, so has it transformed it into many binary variables?  But I take your point.  How do you get around this?  Could you use something like a Bayesian causal graph model?</description>
		<content:encoded><![CDATA[<p>Ah.  The religion variable was entered as a single categorical predictor, the table is how R presents the results, so has it transformed it into many binary variables?  But I take your point.  How do you get around this?  Could you use something like a Bayesian causal graph model?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by John Pate</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30901</link>
		<dc:creator>John Pate</dc:creator>
		<pubDate>Mon, 14 May 2012 16:42:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30901</guid>
		<description>Ack! In the first paragraph, I meant to say the first scenario concludes the second predictor is significant but not the first, the second scenario concludes the first predictor is significant but not the second, and the third scenario could come out either way.</description>
		<content:encoded><![CDATA[<p>Ack! In the first paragraph, I meant to say the first scenario concludes the second predictor is significant but not the first, the second scenario concludes the first predictor is significant but not the second, and the third scenario could come out either way.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by John Pate</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30900</link>
		<dc:creator>John Pate</dc:creator>
		<pubDate>Mon, 14 May 2012 16:41:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30900</guid>
		<description>interactions are potentially interesting, but not what I was pointing out. Consider a case where you have perfectly correlated predictors. You can obtain exactly the same predicted values by giving the first predictor a coefficient of 0 and giving a very high coefficient to the other, or giving the second predictor a coefficient of 0 and giving very high coefficients to the first, or giving moderate coefficients to either. In the first case, you&#039;re likely to conclude the first predictor is significant but the second is not, in the second case you&#039;re likely to conclude the second predictor is significant but the first is not, and in the third case you might conclude neither or both of them are significant. When you have highly correlated predictors, you can explain your response variable using any subset of them: it&#039;s a genuine ambiguity in your data.


Your religion variables, for example, are probably highly (negatively) correlated, since someone is unlikely to be both Bahai and Evangelical Christian. If, in the true generative process we&#039;re trying to discover, Bahai religion has a strong positive effect, you could explain that by either giving Bahai a large coefficient *or* by giving Evangelical Christian a smaller coefficient.</description>
		<content:encoded><![CDATA[<p>interactions are potentially interesting, but not what I was pointing out. Consider a case where you have perfectly correlated predictors. You can obtain exactly the same predicted values by giving the first predictor a coefficient of 0 and giving a very high coefficient to the other, or giving the second predictor a coefficient of 0 and giving very high coefficients to the first, or giving moderate coefficients to either. In the first case, you&#8217;re likely to conclude the first predictor is significant but the second is not, in the second case you&#8217;re likely to conclude the second predictor is significant but the first is not, and in the third case you might conclude neither or both of them are significant. When you have highly correlated predictors, you can explain your response variable using any subset of them: it&#8217;s a genuine ambiguity in your data.</p>
<p>Your religion variables, for example, are probably highly (negatively) correlated, since someone is unlikely to be both Bahai and Evangelical Christian. If, in the true generative process we&#8217;re trying to discover, Bahai religion has a strong positive effect, you could explain that by either giving Bahai a large coefficient *or* by giving Evangelical Christian a smaller coefficient.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by Sean</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30899</link>
		<dc:creator>Sean</dc:creator>
		<pubDate>Mon, 14 May 2012 16:16:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30899</guid>
		<description>@John Pate:  That&#039;s a good point, I didn&#039;t explore this too much.  The point of the big stepwise regression model was to see whether the linguistic factors would usefully remain in the model.  Part of the point is that we should expect colinearity, but it&#039;s a good idea to actually quantify it.  I could use a better model selection process to find the optimal model that included interactions, but the whole thing would take a while to compute!  Another problem is that the sub-sample of the data that each linguistic variable is tested on will be different, due to missing values in the WALS, so the sample run on the stepwise regression was only about 30% of the whole database.</description>
		<content:encoded><![CDATA[<p>@John Pate:  That&#8217;s a good point, I didn&#8217;t explore this too much.  The point of the big stepwise regression model was to see whether the linguistic factors would usefully remain in the model.  Part of the point is that we should expect colinearity, but it&#8217;s a good idea to actually quantify it.  I could use a better model selection process to find the optimal model that included interactions, but the whole thing would take a while to compute!  Another problem is that the sub-sample of the data that each linguistic variable is tested on will be different, due to missing values in the WALS, so the sample run on the stepwise regression was only about 30% of the whole database.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by John Pate</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30897</link>
		<dc:creator>John Pate</dc:creator>
		<pubDate>Mon, 14 May 2012 15:32:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30897</guid>
		<description>That&#039;s a lot of predictors! How much collinearity exists in your model? Your parameters may not be stable.</description>
		<content:encoded><![CDATA[<p>That&#8217;s a lot of predictors! How much collinearity exists in your model? Your parameters may not be stable.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Spurious correlation bonanza to mark Replicated Typo 2.0 reaching 100,000 hits by Podcast on spurious correlations between social structures and linguistic structures &#124; Replicated Typo</title>
		<link>http://replicatedtypo.com/spurious-correlation-bonanza-to-mark-replicated-typo-2-0-reaching-100000-hits/4374.html/comment-page-1#comment-30888</link>
		<dc:creator>Podcast on spurious correlations between social structures and linguistic structures &#124; Replicated Typo</dc:creator>
		<pubDate>Sun, 13 May 2012 20:39:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=4374#comment-30888</guid>
		<description>[...] me about my work on spurious correlations between social structures and linguistic structures (see my overview post here).  Christos Christodoulopoulos challenges me to find a link between the number of children a [...]</description>
		<content:encoded><![CDATA[<p>[...] me about my work on spurious correlations between social structures and linguistic structures (see my overview post here).  Christos Christodoulopoulos challenges me to find a link between the number of children a [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Having more children affects your basic word order by Sean</title>
		<link>http://replicatedtypo.com/having-more-children-affects-your-basic-word-order/5252.html/comment-page-1#comment-30887</link>
		<dc:creator>Sean</dc:creator>
		<pubDate>Sun, 13 May 2012 20:37:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.replicatedtypo.com/?p=5252#comment-30887</guid>
		<description>A longer version of my interview at EU:Sci is now available online:
&lt;a href=&quot;http://www.eusci.org.uk/podcasts/eusci-podcast-extra-sean-roberts-spurious-correlation&quot; rel=&quot;nofollow&quot;&gt;Listen here!&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>A longer version of my interview at EU:Sci is now available online:<br />
<a href="http://www.eusci.org.uk/podcasts/eusci-podcast-extra-sean-roberts-spurious-correlation" rel="nofollow">Listen here!</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

