Sunday, August 02, 2009
My post below alluded to the fact that there seems to be a non-trivial between region difference in male life expectancy, even controlling for race, in the United States. From what I can tell Americans seem to have a somewhat schizophrenic attitude toward the reality of regionalism. On the one hand we are a relatively mobile people, and the original social-political aspect of states has been superseded by states as simply arbitrary sub-national units. And yet regional identities are still alive, most notably in the case of Southerners (with Texas as perhaps a special particular case even in the South, along with other areas such as Cajun Country). The differences are obvious in the case of accent and dialect, but one might think of these as simply indicators of a host of implicit underlying variables which are often imperceptible until one takes oneself "out of region." In Albion's Seed David Hackett Fisher explored the possible cultural roots of American regionalism as a work of history, while in The Nine Nations of North America Joel Garreau treated the subject in the manner of contemporary human geography.
These works paint with a broad brush, and explore the variation on a relatively coarse scale. Most Americans are aware of local religionalisms to a far greater level of detail, something which they are often not explicitly cognizant of. As a personal example I spent my adolescence in an area of the Intermontane West where both Mormons and "cowboys" were well represented. Though both groups were politically conservative, culturally there were stark differences which everyone was implicitly aware of. It was only later on that I learned that this region had experienced an influx of people from the Upper South in the 19th century, and later "Okies", which was evident in the speech patterns of some individuals. On the other hand many of the Mormons had roots in Utah and eastern Idaho, and were cultural descendants of New England Yankees or later Northwest European converts who emigrated to Utah (the Mormon fixation on genealogy meant that if you had Mormon friends you would usually find out where their family was from through casual conversation since they knew). Last fall Steve Sailer pointed out that the counties where Barack Obama underperformed John Kerry, against the national trend, were those settled by and dominated by the Scots-Irish in the 18th century. Greg Cochran has told me that he was aware as a child the differences between Midwesterners whose origins were in the Upper South, and those who were Yankees. Why does this matter? Because American public policy is often predicated on ceteris paribus assumptions once race and income are accounted for. Public policy prescriptions generated on the federal level will make the nod to race and class as interaction effects, but rarely allude to the possibility that white Americans even controlling for class may behave differently because of distinct cultural traditions. American regionalism is often conceived of as how you speak and what you eat, but I believe that these are simply the most obvious aspects of whole folkways, which are often assumptions and behaviors we take for granted. But I come here not to talk, but to explore. The paper Eight Americas: Investigating Mortality Disparities across Races, Counties, and Race-Counties in the United States has the data for the white male longevity for each county in the United States. The Census has data on median household income, as well as the proportion of non-Hispanic whites in each county, or at least a subset. Unfortunately the tables I found had many counties missing for income and the proportion non-Hispanic white, so when I merged them with the one from the supplemental data from the paper above I was left with far fewer counties. I invite readers to point to better data sets in the comments than what I found poking through the Census website. There are certainly many likely variables which might explain longevity differences between regions, from climate to military service to participation in risky behaviors (the Mormon ban on alcohol probably means fewer men die of stupid acts at younger ages). But income is the primary predictor people think of, so it is what I focused on. Below are a set of charts and maps where I try and tease out regional variation. The x-axis is always median household income, while the y-axis is male life expectancy. Keep in mind that I filtered and constrained the data set in various ways when viewing the results, as my choices naturally have an effect. My point in presenting these results is to leverage reader knowledge about local variation. I am not interested in offering general explanations of why variation exists within the United States, rather, I am interested in outliers, and sharp local gradients. As the data was limited to counties which are at least 80% or more non-Hispanic white, there is a strong skew toward some regions, rural areas and less populous counties. This is not optimal, but I think it does the trick for this cursory examination. All counties where non-Hispanic whites are 80% or more, male life expectancy vs. median household ![]() All counties where non-Hispanic whites are 80% or more, male life expectancy vs. median household, labeled only with states ![]() What I'm really interested in is the middle of the distribution, not the really rich or really poor counties. So I limited to incomes between $35,000 and $65,000 dollars. So the same as above, but now constrained as noted. ![]() ![]() Focus on the outliers. What is going on in Baker County, Florida? Raw data is below, but I want to map these results above. Again these are the counties from the chart above (income between $35 and $65 K) shaded in proportion to the value of of the residual. In other words, a "dark" blue county is far deviated from the trendline by being above it, while a "dark" red county is deviated by being below it. Being above the trendline means that the county has a high life expectancy for its income, while below means it has one below what one would expect for income. ![]() As I said above, there are constraints with these data. Some counties are missing from the source tables which I used, and only those counties present in all of the source datasets remain. Additionally, the map excludes very wealthy areas (parts of New England) and very poor ones (much of Appalachia), as well as those areas where less than 80% of the population is non-Hispanic white. The income data here surely exaggerations differences in real consumption; it isn't taking into account cost of living. But, I think the general insight from the earlier map remains: being close to Canada is good for a county's average life expectancy. Here are the counties 2 or more years above the trendline: FL - Charlotte 2.008085 ND - Ward 2.015647 SD - Lawrence 2.050383 MT - Gallatin 2.058849 ND - Cass 2.079347 WI - Marathon 2.112865 WA - Kittitas 2.146117 WI - Dunn 2.153582 MN - Steele 2.156426 IA - Bremer 2.200543 TX - Bandera 2.202348 MN - Stearns 2.262364 WA - Whatcom 2.280879 MN - Winona 2.289539 MN - Crow Wing 2.296179 ID - Kootenai 2.319326 WI - Wood 2.365409 NE - Madison 2.386554 MN - Martin 2.407487 MI - Emmet 2.418643 NY - Tompkins 2.437007 NY - Seneca 2.546057 PA - Union 2.568985 CO - Larimer 2.582040 NE - Buffalo 2.583082 IA - Henry 2.662992 MN - Freeborn 2.683949 MN - Mower 2.770022 KS - Douglas 2.811094 CO - La Plata 2.815263 WI - Eau Claire 2.821042 WI - Clark 2.920767 MN - Brown 2.980544 MN - Kandiyohi 3.064475 WA - Island 3.071250 IA - Mahaska 3.080397 UT - Iron 3.114267 WA - Jefferson 3.229158 PA - Centre 3.274080 IA - Winneshiek 3.305467 MI - Leelanau 3.378293 ID - Latah 3.605875 IA - Johnson 3.618503 OR - Polk 3.661479 MO - Nodaway 3.750706 IA - Story 3.761283 KS - Riley 3.812826 UT - Washington 3.857329 MN - Douglas 3.871383 SD - Brookings 3.893517 ID - Madison 4.116757 UT - Cache 4.261088 IA - Sioux 4.312095 OR - Benton 4.544464 And 2 or more years below: FL - Baker -7.775926 AL - Walker -4.976348 AR - Greene -4.273662 MD - Cecil -3.862680 TX - Hardin -3.856310 TN - Carroll -3.577800 GA - Bartow -3.459386 IN - Starke -3.429478 WV - Berkeley -3.344611 GA - Jackson -3.282126 MS - George -3.254395 TN - Wilson -3.244293 AL - Chilton -3.208314 TX - Orange -3.199165 AL - Marshall -3.176659 OK - Garvin -3.107875 TN - Henry -3.006837 NC - Currituck -2.953442 WV - Jefferson -2.951280 GA - Walker -2.876994 VA - Warren -2.823080 AL - St. Clair -2.801636 TX - Fannin -2.779233 AR - Lonoke -2.673197 MS - Hancock -2.639797 FL - Nassau -2.636036 KY - Scott -2.605132 TN - Robertson -2.579745 GA - Murray -2.565466 TN - Lawrence -2.544601 TN - Maury -2.534866 MO - Jefferson -2.503979 TN - Dickson -2.490682 GA - Walton -2.475931 GA - Gordon -2.433042 MI - Osceola -2.378020 FL - Clay -2.370529 GA - Paulding -2.369467 TX - Wise -2.366306 IA - Marshall -2.331662 MS - Pearl River -2.283195 OK - Grady -2.256928 340 MO - St. Francois -2.224602 WY - Sweetwater -2.212283 IL - Lee -2.204632 AZ - Mohave -2.203554 TX - Van Zandt -2.147798 MI - Calhoun -2.143441 TN - Obion -2.138999 KY - Kenton -2.124380 WV - Kanawha -2.121422 OH - Madison -2.115574 IN - Dearborn -2.089985 GA - Oconee -2.077321 KY - Nelson -2.059997 TN - Rhea -2.056843 TN - Cheatham -2.053162 WV - Raleigh -2.006031 (all these are the counties between $35 and $65 K in median household income. The trendline was generated from this constrained sample as well) |