28 Oct, 2020 15:59

The use and abuse of polls in US elections

By David A. Schultz, Hamline Distinguished University Professor in the Departments of Political, Science and Legal Studies, professor of Law at the University of Minnesota. He is the author of more than 35 books and 200 articles on various aspects of American politics, most recently Encyclopedia of Money in American Politics (2018) and Presidential Swing States (2018). Follow Prof. Schultz on Tiwtter @profDschultz

Among the single most frequent questions I am asked every election cycle, but especially this one, is “Are the polls accurate?” The answer is: they can be, but not for what most people think.

This question is generally preceded with the statement “The polls were entirely wrong in 2016, they said Clinton would win and she did not.” Are the polls accurate, is there a problem with them? Are the polls in 2020, as in 2016, missing hidden Trump voters? To answer this question one needs to understand some basic points about polling.

Good polls are sort of accurate: But know what you are surveying

First, when it comes to 2016, the good national polls were entirely accurate. They said that Hillary Clinton would win the national popular vote by about two-three percentage points, with a margin of error or about three points. These polls were dead on score. The problem was not the polling but their relevance.

We do not elect the president by the national popular vote and instead it is the electoral college which is essentially 50 separate state elections (plus the District of Columbia). National polls for the purposes of predicting presidential winners are entirely irrelevant. Ignore them all because they are looking at the wrong unit of analysis.

The Biden-Trump race is both over and too close to call

Polls are not predictors but statistical snapshots in time

But additionally, remember first that polls are supposed to be statistical profiles of a population. This means that a good poll is a small sample of a larger population that resembles the latter in all relevant characteristics. Polls are only as good as the assumptions that go into them. Good pollsters accurately reflect who is likely to vote, the partisan, geographic, or other make up of the electorate. If you make bad assumptions, you get bad results. This is the old “garbage in, garbage out” theory.

Polls also are not predictors – they are snapshots in time. Lots of things can happen between the time a poll is done and an election occurs. Candidate strategies matter, as do messaging, and other intervening variables. Thinking that polls are predictors is the root of many problems.

The flaws in FiveThirtyEight

Consider Nate Silver and FiveThirtyEight. Four years ago they predicted an 80%+ chance Clinton would win. As of October 26, 2020 the prediction is an 88% chance of a Biden victory. The model used here is based on polls – using them as predictors of what will happen on election day.

If the polls on which they are based are wrong, the predictions will be wrong, even if we still concede that polls are not predictors. FiveThirtyEight’s predictive model is premised on a way of thinking about polls that is simply wrong.

An example of bad polling: Minnesota US Senate race

It is possible that Biden will win, but the polls are very close in the critical swing states such as Pennsylvania, Michigan, and Wisconsin. But accepting everything I said in this essay, there is also a difference between good and bad polls.

What if neither Democrats nor Republicans want to win in 2020? No one wants the task of changing the full diaper of US Empire

Nerd warning: Confidence levels versus credibility intervals

Finally, there is one last problem that only nerds like me can appreciate. The survey did not employ confidence levels but instead a credibility interval to determine the accuracy of the poll. Why is this important?

When polls are done the question to be asked is what is the probability that the sample is a good representation of the entire relevant population. The smaller the confidence level, statistically the better the chances it is a good survey. The gold standard for survey research is a confidence level of .05. This means there is a 95% chance that the sample is an accurate representation of the entire population. This .05 also means there is still a 5% chance the sample is skewed and therefore the poll is bad.

Over 52 million early votes cast in US election, putting country on pace for biggest turnout in over a century

Conclusion: Polls can be useful, but analysis based on them often isn't

The morale of the story is that polls done well can be good and accurate and accurate snapshots in time. But there is a lot of bad polling. Even worse, there is a lot of bad analysis based on polling. Four years ago analysts got it wrong when they let the disbelief of a Trump victory cloud their thinking. They also failed to understand the proper level of analysis to do presidential polling and how to understand whether a poll is valid or reliable.

This article was first published on Prof. Schultz's blog.

The statements, views and opinions expressed in this column are solely those of the author and do not necessarily represent those of RT.

Opinion