Temperature Trends

# Did Federal Climate Scientists Fudge Temperature Data to Make It Warmer?

## Practicing the Dark Art of Trend Adjustment

"Right after the year 2000," climate change skeptic Tony Heller claimed last month, federal climate scientists "dramatically altered US climate history, making the past much colder and the present much warmer….This alteration turned a long term cooling trend since 1930 into a warming trend." Heller (nom de blog Steven Goddard) says that these adjustments " cooled 1934 and warmed 1998, to make 1998 the hottest year in US history instead of 1934."

Heller's assertions induced a frenzy of commentary, attracting the attention of The Drudge Report, the Telegraph, The Daily Caller, and Fox News. A few days later, the hullabaloo was further stoked by reports that scientists at the National Climatic Data Center (NCDC) had quietly reinstated July 1936 as the hottest month on record in the continental U.S. instead of July 2012. (For the record, the National Oceanic and Atmospheric Administration—the NCDC's parent agency—has declared 2012 the hottest year on record for the lower 48 states, and the months between August 2011 and July 2012 as the hottest 12-month period on record. The year 2012 was also the warmest year in the 36-year satellite temperature record.)

In response to the brouhaha, the NCDC press office sent out a rather defensive statement noting that its new U.S. temperature dataset based on climate division adjustments has, indeed, restored July 1936 to its hellish pinnacle. "We recalculate the entire period of record to ensure the most up-to-date information and to ensure proper comparison over time," said the press release (which, oddly, is not available online). "In this improved analysis, July 1936 is now slightly warmer than July 2012, by a similar very small margin." It added that this "did not significantly change overall trends for the national temperature time series" and that the "year 2012 is still easily the warmest on record."

But never mind the quibbling over which month in the past century was the hottest. Is Heller right when he claims that NCDC scientists are retrospectively fiddling with the national thermostat to bolster the case for man-made global warming?

The answer is complicated.

When Heller produced his temperature trend for the continental United States, he basically took the raw temperature data from the U.S. Historical Climatology Network from 1895 to the present and averaged them. He made no adjustments to the data to take into account such confounders as changes in location, equipment, time of observation, urban heat island effects, and so forth. Heller argues that these changes more or less randomly cancel out to reveal the real (and lower) trend in average U.S. temperatures.

In contrast, the researchers at the NCDC have spent years combing through U.S. temperature data records trying to figure out ways to adjust for confounders. In 2009, the NCDC researchers detailed how they go about adjusting the temperature data from the 1,218 stations in the Historical Climatology Network (HCN). They look for changes in the time of observation, station moves, instrument changes, and changes in conditions near the station sites (e.g., expanding cities). They filter the data through various algorithms to detect such problems as implausibly high or low temperatures or artifacts produced by lazy observers who just keep marking down the same daily temperatures for long periods.

They've clarified a lot this way. For example, simply shifting from liquid-in-glass thermometers to electronic maximum-minimum temperature systems "led to an average drop in maximum temperatures of about 0.4°C and to an average rise in minimum temperatures of 0.3°C." In addition, observers switched their time of observation afternoon to morning. Both of these changes would tend to artificially cool the U.S. temperature record.

Urban areas are warmer than the countryside, so previous NCDC researchers had to adjust temperature datasets account for the effects of urban growth around weather stations. The center's 2009 study conceded that many HCN stations are not ideally situated—that they now sit near parking lots, say, or building HVAC exhausts. Such effects tend to boost recorded temperatures. The researchers argue that they do not need to make any explicit adjustments for such effects because their algorithms can identify and correct for those errors in the temperature data.

Once all the calculating is done, the 2009 study concludes, the new adjusted data suggests that the "trend in maximum temperature is 0.064°C per decade, and the trend in minimum temperature is 0.075°C per decade" for the continental U.S. since 1895. The NCDC folks never rest in their search for greater precision. This year they recalculated the historical temperatures, this time by adjusting data in each of the 344 climate divisions into which the coterminous U.S. is divvied up. They now report a temperature trend of 0.067°C per decade.

The NCDC have also developed a procedure for infilling missing station data by comparing temperatures reported from the nearby stations. Why? Because as many as 25 percent of the original stations that comprised the HCN are no longer running. Essentially, the researchers create a temperature trend for each missing station by interpolating temperature data from nearby stations that are still operating. Skeptics like Heller argue that that the virtual "zombie stations" that infill missing data have been biased to report higher than actual temperatures.

Some sort of infilling procedure needs to be done. Let's say that there are records from five stations, all of which report time series of 1, 2, 3, 4, and 5. The average of each therefore comes to 3. If two stations fail to report on the second day, missing records of 2, then the average of their remaining four records is now 3.25 instead of 3. In trying to address the problem of missing data from closed stations, the NCDC folks average other stations to fill in the absent 2s. According to climate change skeptic blogger Brandon Shollenberger, what Heller does is the equivalent of averaging the raw data from the notional five stations to report 3, 3, 3, 3.25, and 3.25. "He'd then accuse the people of fraud if they said the right answer was 3, 3, 3, 3, 3," Shollenberger writes.

Let's assume that all of the NCDC's adjustments are correct. What do they reveal? The center's 2009 study concluded, "Overall, the collective effect of changes in observation practice in the U.S. HCN stations is the same order of magnitude as the background climate signal (e.g., artificial bias in maximum temperatures is about -0.04°C per decade compared to the background trend of about 0.06°C per decade). Consequently, bias adjustments are essential in reducing the uncertainty in climate trends." In other words, the asserted bias is almost as big as the asserted trend. Even with the best intentions in the world, how can the NCDC be sure that it has accurately sorted the climate signal from the data noise such that it has in fact reduced the uncertainty in climate trends?

Well, for one thing, other scientists have found a similar trend. Another group of researchers at Berkeley Earth use a different statistical method in which any significant changes to the temperature record of any station are treated as though a new station had been created. They use eight times more data than the NCDC does. Via email, Berkeley Earth researcher Zeke Hausfather notes that Berkeley Earth's breakpoint method finds "U.S. temperature records nearly identical to the NCDC ones (and quite different from the raw data), despite using different methodologies and many more station records with no infilling or dropouts in recent years." He is also quite critical of Heller's simple averaging of raw data.

The NCDC also notes that all the changes to the record have gone through peer review and have been published in reputable journals. The skeptics, in turn, claim that a pro-warming confirmation bias is widespread among orthodox climate scientists, tainting the peer review process. Via email, Anthony Watts—proprietor of Watts Up With That, a website popular with climate change skeptics—tells me that he does not think that NCDC researchers are intentionally distorting the record. But he believes that the researchers have likely succumbed to this confirmation bias in their temperature analyses. In other words, he thinks the NCDC's scientists do not question the results of their adjustment procedures because they report the trend the researches expect to find. Watts wants the center's algorithms, computer coding, temperature records, and so forth to be checked by researchers outside the climate science establishment.

Clearly, replication by independent researchers would add confidence to the NCDC results. In the meantime, if Heller episode proves nothing else, it is that we can continue to expect confirmation bias to pervade nearly every aspect of the climate change debate.

1. We live in a climate with an unpredictable past.

1. In the Soviet Union, you can’t change the future, but you can change the past!

1. In Soviet Russia, past changes you!

What seems to be missing in all this are re-estimates based on assigned missing values (where values are actually not missing). This gives you a range of values for each data point… you know what the value "ought to be"… you know what your estimates are… you can then build a model to predict "ERROR". This lets you independently estimate the sources of error in your RAW values. Well, if you've got a Wayback Machine, couldn't you just remeasure the temperature in 1934 for these guys and solve the question once and for all? Ron, Why did you decide to exclude Watt, Pielke, et al’s identification of significant siting issues through his Surface Stations project from this overview? Why also not acknowledge that over the past week there has been a shift towards people accepting Goddard’s claims of serious defects in the value added data set – when you originally published Watt’s initial argument that he didn’t think Goddard was right? 1. What is going on is that the USHCN code is that while the RAW data file has the actual measurements, for some reason the final data they publish doesn’t get the memo that good data is actually present for these stations, so it “infills” it with estimated data using data from surrounding stations. It’s a bug, a big one. And as Zeke did a cursory analysis Thursday night, he discovered it was systemic to the entire record, and up to 10% of stations have “estimated” data spanning over a century: 2. Assuming Watts is being accurate here, this really merits an ‘update’ extension by Bailey to the original article. Thanky Tarran. 3. Watts and others have written a mea culpa for their initial assessment of Goddard’s work. 4. Like I said when Watts originally threw the flag, I find him the most honest broker in the market. He was extremely forthright yet again about why he has changed his mind. If only everyone in this debate were equally dedicated to science. 1. BL: they are saying that Goddard uncovered a problem not that the way he parsed the data is right. 5. t: I have actually queried all the folks you mention. May I also suggest you click on the “ideally situated” link? Happy 4th y’all! 9. And seriously Reason, Climate Change Chicanery? Since this morning’s links there has been an article on stupid regulations, the unfolding Obamacare disaster, the recent Hobby Lobby ruling, another article on stupid regulations, and the free(-ish) press being on life support in the UK. I can almost hear the writers of the next season of 24 furiously storyboarding. 2. Cool. When’s he going to run for president? LePage seems like a pretty OK guy for a successful politician. Even though I’ve known about him for several years now, I still hear “Paula Page” every time I hear his name. 3. Our current Lt. Gov. ran an add in the Tribune that her Team Red opponent supported the beltway sniper (because NRA). I ceased to be amazed at the left’s ability to point to an nth degree of separation as proof of guilt while denying direct personal relationships as anything but coincidence. 4. Sure, if by “is directly responsible,” you mean, has indirect links with alleged terrorists, of the sort that George W. Bush cited regarding Saddam Hussein when he was contriving excuses to invade Iraq. 5. “Therefore the republican governor of Maine is directly responsible for killing six cops since 2000.” Exactly like Obama is responsible for Bill Ayers youthful indiscretions….it would seem… Except I don’t think Ayers killed anyone… 13. CDC tells drowsy drivers to not hit the road. I was unaware that being sleepy was a disease. We’re all fucked. 1. It’s a chronic ailment which has a 100% morbidity rate. 2. People get hurt. People who get hurt end up getting medical care. People who get medical care are in the ‘health system’. anyone in the ‘health system’ is therefore subject to “public health” scrutiny. Centers for Disease Control therefore has a say in the matter of your driving habits. 3. Carbon dioxide is now a pollutant so why not sleepyness a disease? 4. Is this really something people need to be told by anyone? 5. There’s an epidemic of sleepy truckers. 1. Well, that’s what happens when you start testing for amphetamines. 2. You can get sleep trucked in? 1. Yup, Sandman was getting back problems, and his cousin Louie is in the Teamsters. 14. As a Master of Science (puffs out chest) I have to say I find the practice of ‘infilling’ both disturbing and offensive. No, you can’t just extrapolate your raw data. GO MEASURE IT. Station not working? FIX IT. If I had ever tried to ‘extrapolate’ some of my raw data set from other parts of my raw data set my supervisor would have raped me…more than he already did. 1. Sounds like you work for a bitter-clinging denier to me. 1. Haha no he was only sceptical of things that went against his preconceived notions. He could be wildly credible one day and a sceptic hard-ass later that same day. I’m glad I only did a MSc, but that’s another story. 1. I’m glad I only did a MSc, but that’s another story. I’ve lived through a story like that. 1. Brothers in arms lab coats. Wanna compare scars? 2. Sounds to me like he works for Warty. 1. Warty would be a hell of a lot more self-aware than the guy I worked for. We got along in the end but there were rough times. 1. In my experience most students don’t care for their advisers. Count me amongst them. 2. The software development, configuration management, and verification practices at Climatic Research Unit — University of East Anglia are so bad as to be nearly criminal in my professional opinion. 1. Better or worse than the IRS? 1. The IRS email scandal is just a straightforward case of obstruction of justice. As far as I know, the IRS software that runs the income tax system is generic bloatware produced by the federal procurement process and mostly works the way it is intended to work. The software models developed for measuring and predicting climate change are fundamentally fucked (as indicated by the leaked emails). Coded by people not qualified to do the work; hacked to produced the desired outcome; funded by government contracts. Borderline fraud in my opinion. 1. Yeah. I agree with you I was just being an ass. 2. Where is the borderline part? 1. mens rea Zealots that truly believe their own crap are actually more dangerous than mere criminals. The destruction of data to avoid FOIA requests should have brought jail time. But I’m not convinced the original fuckups were intentional instead of just gross incompetence. 2. I downloaded one of the older models about 5 years ago – Model E. The coding in it was atrocious…even for Fortran. I would fire one of my developers if they wrote shit like I saw in that lump of spaghetti. 1. My first job out of college was writing Fortran for the AV-8 fight simulator. What a mess. I could only take it for 6 months before fleeing to the private sector. I don’t know that all government funded software is that bad but I have my suspicions. 1. Fortran 4 and assembly, but not government code 😉 3. “you can’t just extrapolate your raw data.” No shit. 4. I would say, go ahead and infill it if that’s the accepted protocol, and then put in the paper “one drawback is that we had to infill, making the conclusions less certain”. Don’t infill and then shout, OMG my paper proves WE ALL GONNA DIE!!! 5. I agree fully as a building designer I often do site/land surveys based on a 10×10 grid and then interpolate the topography in between. However when in the filed a 10×10 grid can go completely around a boulder or a group of trees so you have to physically measure these outside items. I’ve seen site plans by others that use infill and they completely miss banks and scarfs, you name. I can make mountains and valleys disappear from any site. real measurements for real world results. 6. Did you not have a full functioning force fielded extrapolator? 7. Interpolating can be OK. The real problem here is that some stations had valid data and they STILL interpolated. Then they claim that their algorithm is working properly. Incredible. 15. Hot enough for ya? 16. Bullshit corporate middle-management lingo word of the day: Upgradation. A middle manager’s love isn’t like a square’s love. 1. Any company wherein jargon like that isn’t laughed out of the building (with the speaker booted too) has grown too large for efficiency and should start pruning managers. 1. has grown too large for efficiency and should start pruning managers. *nods knowingly* 2. I think they need to do a Deep-Dive SWOT Analysis, some Blue Sky thinking, then set a new Cadence for upgrades based on the Voice of the Customer. 17. Everyone talks about the weather, but Obama is doing something about it! 1. *golf clap* 18. We have a consensus among ‘scientists’ about climate change. It seems they are now desperately trying to get the climate on board with them. 19. 0.07 degrees/decade? What’s the measurement error on the equipment? 1. I read somewhere that mercury thermometers have an accuracy of +/-2 degrees for a 4 degree swing. this is not even a decimal point but somehow they believe they can make a math program that literally has to guess where it swings and how far down to the 100th of a degree. I don’t buy it for a minute. lets go beyond the accuracy of the thermometer itself if you are short and looking up at it, it is hotter if you are tall it is lower and if your right on level with it then you still only have a 4? swing. Things are even worse for metal coil thermometers. 20. ” In the meantime, if Heller episode proves nothing else, it is that we can continue to expect confirmation bias to pervade nearly every aspect of the climate change debate.” So both sides are at fault, eh, Ron? I’m glad I didn’t pay to read this. What I hear you thinking, rather than saying, is “Heller is 90% out to lunch, but I don’t quite have the balls to say so.” 1. Re: “The center’s 2009 study conceded that many HCN stations are not ideally situated?that they now sit near parking lots, say, or building HVAC exhausts. Such effects tend to boost recorded temperatures. The researchers argue that they do not need to make any explicit adjustments for such effects because their algorithms can identify and correct for those errors in the temperature data.”

I’m no expert, but consider:

Since the 1930s, possibly billions of heat-absorbing/radiating structures (including dark roads, parking lots, etc.) have have been added to the earth’s surface, replacing an unfathomable number of acres of CO2-drinking vegetation. The ever-spreading out of the added heat from these structures might, I suspect, make it very difficult for the algorithms — processing rules created by humans and subject to interpretation and error — to successfully separate all this heat out so that accurate calculations can be made.

Could someone please enlighten this layperson?

“Does the ‘fireplace-brick effect’ contribute to global warming?” http://relevantmatters.wordpre…..l-warming/