Hidden Brain - Our Noisy Minds

Episode Date: May 18, 2021

Psychologist Daniel Kahneman says there are invisible factors that distort our judgment. He calls these factors “noise.” The consequences can be found in everything from marriage proposals to medi...cal diagnoses and prison sentences. This week on Hidden Brain, we consider how to identify noise in the world, and in our own lives. If you like our work, please consider supporting it! See how you can help at support.hiddenbrain.org. And to learn more about human behavior and ideas that can improve your life, subscribe to our newsletter at news.hiddenbrain.org.

Transcript
Discussion (0)
Starting point is 00:00:00 This is Hidden Brain, I'm Shankar Vedantam. We're going to start today with a little experiment. I'll be the guinea pig. I'm going to open the stopwatch app on my phone. I'll hit start and count off 5 seconds while looking at the phone. 1, 2, 3, 4, 5. Okay, let me do that again. 1, 2, 3, 5. Okay, let me do that again. 1, 2, 3, 4, 5.
Starting point is 00:00:32 Okay, now I'm going to hit start and count our 5 seconds without looking at the phone. 1, 2, 3, 4 4 5. It was 5.43 seconds. Let's do it again. 1 2 3 4 5. Much better. 5.2 seconds. Last time. One, two, three, four, five.
Starting point is 00:01:10 Five point five, nine seconds. The errors I made seem trivial. But it turns out they are not. Multiply the small mistakes I made in milliseconds over all the countless decisions I make every day, and you can end up with a serious problem. Multiply the errors I make as an individual by an entire society made up of other era-prone humans and you can get disaster. What makes these mistakes insidious is that they are rarely the result of conscious decision-making. Human judgment is imprecise.
Starting point is 00:01:59 An imprecise judgment produces unwanted variability, what the Nobel Prize winning psychologist Daniel Kahneman calls noise. Wherever there is judgment, there is noise, and there is more of it than you think. This week on Hidden Brain, the gigantic effect of inadvertent mistakes in business, medicine, and the criminal justice system, and how we think have revolutionized many areas of the social sciences. He was my guest on Hidden Brain for our 100th episode. We talked about his early research and his first book, Thinking Fast and Slow. As we close in on our 200th episode, we wanted to bring him back to talk about a set of ideas
Starting point is 00:03:04 he's been working on for several years. They're described in his new book, Noise, a flaw in human judgment. Daniel Conneman, welcome to Hidden Brain. Glad to be here. I want to begin by exploring what you mean by the term noise. You spend sometimes starting an insurance company and one of the things an insurance company needs to do is to tell prospective clients how much their premiums are going to cost. So an
Starting point is 00:03:32 underwriter says, if you want us to cover you against this loss, here's this quote, from the insurance company's point of view, Danny, what is the risk of offering quotes that are too high and also quotes that are too low? Well, a quote that is too high, you are very likely to lose the business because there are competitors and they'll offer a better price. A quote that is too low, you're leaving money on the table and you may not be covering your losses if you do that to a great deal. So errors in both directions are costly. We define noises unwanted variability
Starting point is 00:04:08 in judgments or decisions. That is, if the same client would get different quotes from different underwriters in the same company, this is bad for the company. And variability is a basic component of error. So I think of the insurance business as being driven by mathematics. That's my stereotype that there are hard-nosed statisticians who work at these companies. So I would not expect a quote from one underwriter to be widely different from the next. You asked executives at this insurance company how much variability they expected between underwriters.
Starting point is 00:04:43 What was their estimate of this kind of subjective variability? I mean, it turns out that there is a very general answer to that question and people have a very general idea about that number will be and it's around 10% now when we actually measured that in an insurance company, The answer was 55%. And that was a number that was an amount of variability, as we call it, an amount of noise, that no one expected. And that really is what set me off on this journey that led to this book. Now, the difference between 10% and 55% might seem trivial. Who cares? Well, the consequences of this variability were anything but trivial.
Starting point is 00:05:32 I mean, I asked people what actually would be the cost of setting up a premium that is too high or too low. And when they carried out that exercise, they thought that the overall cost of these mistakes within the billions of dollars. Now, what was in some sense saving that company was that probably other companies were noisy as well. But if you have a company that is noisy, well others are noise free, the noisy company is going to lose a lot of money very quickly. So, with the insurance company, it's not just that the insurance company is losing money.
Starting point is 00:06:12 There is also a cost that's being paid by all the people who are trying to get insurance. It might be that if you happen to get a quote that's too high, you might end up being uninsured or you might be spending more on insurance that you need to be spending. There is sort of a general human cost to these errors, not just in terms of the bottom line for the insurance company. Well, of course, when you have a noisy underwriting system, then the customer is facing a lottery that the customer has not signed up for. And that is true everywhere.
Starting point is 00:06:42 That is wherever people reach a judgment or a decision by using their mind, rather than computing. Wherever there is judgment, there is noise, and there is more of it than you think. I want to look at a few other places, because in some ways what's striking about your book is both a number of different domains where you see noise and the extent of noise in those different domains including in places where you really feel this should be a setting where noise does not play a role. out there who found that in asylum cases, this was a courtroom in Miami, one judge would grant asylum to 88% of the applicants and another granted asylum to only 5% of the applicants. So this is more than a lottery, this is like playing roulette. This is a scandal, clearly the system isn't operating well. In many situations it's just that when people look at the same
Starting point is 00:07:45 data, they see them differently. They see them more differently than they expect. They see them more differently than anyone would expect. That's the basic phenomenon of what we call system noise. That is, when you have a system that ought to be producing judgments or decisions that are predictable, they turn out not to be predictable and that's noise. You also describe in some ways there are different kinds of noise. So if you're an asylum judge and I'm an asylum judge and we have very different subjective readings that can produce very different answers. But it could also be that if you are reviewing a case in the morning and you are reviewing a case in the afternoon, it's possible that just within yourself your own judgments can be noisy. Can you talk about that
Starting point is 00:08:39 idea as well? I mean, it's not only possible, it actually is the case, that when people are asked the same question or evaluate the same thing on multiple occasions, they do not reach the same answers. For example, radiologists who have shown the same image on two separate occasions and are not reminded that it's the same image, really with the stressing frequency reach different diagnoses on the two occasions. That we know it's true even for fingerprint examiners whom we really would not expect to be noisy at all, but actually they vary when you show them the same fingerprints twice.
Starting point is 00:09:24 By the way, that's important. They do not vary in the sense that somebody would make a match on one occasion and would positively say, it is not a match on the other. But fingerprint examiners are allowed to say, I'm not sure. And between, I'm not sure, and I am sure that it's a matter, it's not a match. There is variability. One of the things that you point out is that you don't expect that the lottery of who is reviewing your file is going to make a huge difference or that extraneous factors would play a huge role. The researcher Uri Simonson found that college admissions officers pay more attention to the academic attributes of candidates on cloudy days
Starting point is 00:10:06 and to non-academic attributes when the weather is sunny. He titled this paper, clouds make nerds look good. Talk about this idea that extraneous factors, whether someone's hungry, what the weather is like, that can affect people's judgment, too. Indeed, it's been established in the justice system. If you're a defendant, you have to hope for good weather, because on very hot days, judges assign most severe sentences. And that is true, though judges are here conditioned, but it's the outside temperature. Nevertheless, seems to have an effect.
Starting point is 00:10:41 It's been established in at least one study that for judges who are keen on football, the result of their team on Sunday or Saturday, depending on whether it's professional or college, will affect the judgment they make on the Monday. And they will be more severe if their team lost. No! He missed the extra point wide-right!
Starting point is 00:11:08 That's a terrifying idea, isn't it, Danny, that you're sort of hoping that your judge's football team wins the Sunday before your case has heard. Oh, yes, absolutely. And you're also hoping to find a judge who is in a good mood, to find a judge who is rested, has had a good night, who is in a good mood, to find a drug who is rested, has had a good night, who is not too tired. And your chances of being prescribed antibiotics or painkillers differ
Starting point is 00:11:34 in the course of the day. So doctors tend to prescribe more antibiotics toward the end of the day when they are tired, than earlier in the day when they are fresh, and they are more likely to prescribe pain killers later in the day, simply because it's an effort to resist the patient who wants pain killers, and when you're very tired and depleted, that effort becomes more difficult. So completely extraneous factors have a distressing, laudrific. Noise in medicine often shows up under a different name. The most strange factors have a distressing leadodry effect. Noise in medicine often shows up under a different name.
Starting point is 00:12:10 Medical mistakes. Stunning medical news tonight about how many Americans have something go wrong when they go to the hospital. The astronomical number, one in three patients, will face a mistake during a hospital stay. And these are costly errors. One study estimate medical mistakes cost the US more than 17 billion dollars a year. The doctors had discovered that Sarah didn't have cancer in the first place. She'd been misdiagnosed and all the pain and treatment that she went through was for absolutely nothing.
Starting point is 00:12:47 So, Daddy, can you talk about these two different dimensions of noise in the medical sphere, the ways in which it might cause us to get diagnosed with conditions we might not have, but also for doctors to misconditions and problems that we actually do have? The contribution of noise is that which physician looks at the data makes a difference. And there is a lot of that, that is we know that physicians disagree on diagnosis and they also disagree on treatment. And that is a little shocking, that there is that element of lottery. So errors could happen for many reasons, including luck,
Starting point is 00:13:27 which is not an area in judgment, but where information was missing. But in some cases, the errors cannot be described in any other way than noise that is different doctors looking at the same case, reaching different conclusions. It might seem obvious from these examples that noise is a big problem and that combating noise makes a lot of sense. Who could argue against reducing arbitrary decisions and inconsistent rules? It turns out a lot of people have a problem with doing just that, and one of those people might be you.
Starting point is 00:14:08 problem with doing just that and one of those people might be you. You're listening to Hidden Brain, I'm Shankar Vedantam. This is Hidden Brain, I'm Shankar Vedantam. We've seen how noise pervades many aspects of our personal and social lives. It can lead to wildly different estimates on our insurance premiums. It affects judgments, doctors make about our health. It can determine whether we get a job or a promotion. In their new book, Noise, a flaw in human judgment, Daniel Kahneman and his co-authors, Olivier Sibouny and Cass Sunstein, show that noise also shapes what happens in the criminal justice system. It affects decisions
Starting point is 00:14:53 that send people to prison or sentence them to execution. Danny Judge Marvin Frankel worked as a United States District Judge and he made a name for himself by pointing out inconsistencies in the criminal justice system. He once wrote a case about two men convicted for caching counterfeit checks, both amounts were for less than $60. One man got a sentence of 30 days in prison. The other got 15 years. What did Judge Frankel make us such disparities? I mean, he thought it's unjust.
Starting point is 00:15:27 He thought it's extraordinarily unfair, I mean, which it seems to be on the face of it. So he really felt that the justice system should be reformed to avoid this role of completely unpredictable, unreasonable factors that determine the fate of defendants. You know, Danny, I feel like in the last year, I've seen dozens of stories that talk about disparities of all kinds, including disparities in the criminal justice system. And invariably, when I read these stories about disparities, they talk about the idea that it's about bias,
Starting point is 00:16:03 that it's about racial bias or gender bias or some other kind of bias. So when Judge Frankl comes along and says, you know, defendants are being given vastly different sentences. The very first thing that pops in my head is maybe these defendants were of different races and what we're really seeing is racial bias at play rather than noise. How can we tell the difference between racial bias and noise? It's actually easy to do because when you want to measure noise,
Starting point is 00:16:32 you can conduct a kind of study that we call the noise audit. And so you take professionals, for example, judges, and you show them a fictitious case, and you ask them to make judgments as they would normally. Now, you know that it's the same case, they've all been given the same information. They should give you the same judgment. The differences among them cannot be attributed to bias. And indeed, when Judge Frankl calls to happen,
Starting point is 00:17:03 he calls many noise audits to be performed. He actually conducted some himself. And in the most famous one, 208 federal judges evaluated 16 cases and assigned sentences to 16 cases. And this gives you an idea of the lottery that a defendant would face, in that where the average sentence is seven years in jail, the probable difference between two judgments is over three years. So that seems to be unacceptable.
Starting point is 00:17:47 So, based on the work of Judge Franklin others, Congress eventually passed a law that basically limited the amount of discretion that judges had. Talk about the effects that this law had on reducing noise, were their studies conducted to actually figure out if these were reducing noise? Yes, studies were conducted and actually you can look at many cases and look at the variability of judgments, many cases and you find that the variability significantly diminished, which indicates that their noise was in fact reduced. However, something else happened. The judge was hated it. They hated this restriction on their ability to make free decisions, and they felt that justice was not being served.
Starting point is 00:18:37 So even as the data was showing that the noise was reducing and sentencing, in other words, sentencing was becoming more consistent. Many judges were upset that their discretion was being taken away. And Judge Jose Cabránis was one of those who spoke up. And I want to play you a clip of something he said in 1994. This was a discussion at Harvard University where they were talking about these guidelines that were aimed to reduce ethnic disparities
Starting point is 00:19:03 and sentencing by limiting the amount of discretion that judges had. Here is Judge Cabránis. These arcane and mechanistic computations are intended to produce a form of scientific precision, but in practice they generate a dense fog of confusion that undermines the legitimacy of the judges' sentencing decisions. Dhani, I want to draw your attention to what Judge Cabrán is saying. When you limit the variability of sentencing, you're telling judges,
Starting point is 00:19:33 for this offense, you have to do X, for that offense, you have to do Y, a lot of judges feel their hands are tied and they feel the art of law is being reduced to a mechanistic science. Well, you know, if it takes a mechanistic science to produce justice, then I think we should seriously consider some mechanistic science. And what seems to be happening is that from the perspective of the judge, they feel that they're evaluating every detail of the case and that they are producing a just judgment because they are convinced that what they are doing is a just
Starting point is 00:20:12 judgment. And somehow it's very difficult to convince judges that another judge from their respect a great deal, presented with the same case, would actually pass a different sentence. That argument doesn't seem to have penetrated when Judge Cabrames made that assertion, that in fact there is a problem and there is a problem to be resolved.
Starting point is 00:20:39 He was in effect as I hear him. He was denying the existence of a problem. Hmm. Psychologists talk about a phenomenon called naïve realism that in some ways explains why it is, I am bewildered that you would not see the world exactly the way that I see the world. Can you explain what naïve realism is and how it speaks to the question we just discussed about judges not just reaching different conclusions but being bewildered that anyone would reach a different conclusion than them.
Starting point is 00:21:06 Well, you know, we feel that we see the world as it is. It's the only way we see it, and what we see is real, what we see is true. And it makes it very difficult to believe and to imagine that someone else looking at the same reality is going to see it differently. But in fact, we are struck by how different they are in the context of criminal justice, the variability in sentences, is shocking. But when you're looking at it from the perspective of a judge who looks at cases individually and feels that he or she is making correct judgments for every case individually, then it looks as if any attempt to restrict their freedom is going to cause injustice to be performed. But they are simply not accepting, I think, the statistics that
Starting point is 00:22:06 tell them that another judge looking at the same case would actually pass a different sentence. So these debates about sentencing reform raged in the 1980s and 1990s, and eventually in the early 2000s, the Supreme Court struck down the guidelines that bound the way judges were operating, and sentencing reform essentially went away, giving discretion back to judges. Is what happened, what I fear happened? Did noise come back into the system? Oh yes, I mean there is evidence that noise came roaring back and there is also evidence that judges were a lot happier
Starting point is 00:22:49 without the guidelines than they had been earlier. One of the ironic things that you and others have found is that even though there is this distinction between noise and bias, when the noise came back after the Supreme Court ruling, black defendants were actually among those who were the most severely harmed by this. Is it possible in some ways they can be intersections between noise and bias? In other words, they can amplify one another? Certainly.
Starting point is 00:23:15 I mean, when you are constraining people and reducing noise, you're reducing the opportunities for bias to take place. So attempts to reduce noise and attempts to control noise are going to, in general, not invariably, but are very likely to control and reduce bias as well. If noise produces many of the adverse outcomes we see, if noise produces much of the unfairness we see, we see, if noise produces much of the unfairness we see, why is it that critiques of disparities invariably talk about bias? Turns out that's because of the way our minds work. As we discussed in a recent series of episodes, the brain is a storytelling machine, and the story of bias caders to our hunger for simple explanations. I mean, clearly, bias in general is a better story. That is, you see something happening.
Starting point is 00:24:12 It had the character of an event. It had the character of something that is caused by a psychological force of some kind. Variability noise is uncaused. Noise doesn't lend itself to a causal story and really the mind is hungry for causes and that leads us very naturally to thinking to him the biases. That errors must be explainable. So if I get a misdiagnosis because a doctor doesn't like the color of my skin, that might not make me feel good, but at least I can make sense of what happened. Once I settle on an explanation of racism or sexism
Starting point is 00:24:52 or homophobia, I tell myself I have every right to get angry. When I discuss what happened with others, they'll get angry too. By contrast, a misdiagnosis produced by noise is, by definition, no one's fault. The error may have harmed me, but I can't lay the blame on someone's evil intentions. Noise is a very opposite of a good story. It's meaningless, and that can make me feel even worse. Here's another problem. When I see a judge pass a really harsh sentence or a very even worse. Here's another problem. When I see a judge pass a really harsh sentence or a very light sentence, I can come up with
Starting point is 00:25:31 a story of bias to explain this individual case. You cannot do that with noise. You cannot spot noise by looking at any individual case. You have to measure it in the aggregate. It shows up only when you look at the statistics and many of us are uncomfortable turning to data as our guide to the truth. Reprefer stories and anecdotes and stories and anecdotes are better at illustrating the problem of bias. Stories and anecdotes are what the mind is prepared for. Statistical thinking is alien to us.
Starting point is 00:26:12 And statistical thinking is the only way to detect noise because it's variability. It's sort of absurd to say about any single case that it is noisy. You say that if you have no idea of how it came about. But noise is a phenomenon that you observe statistically and that you can analyze only statistically. And that is not appealing. So there's an even deeper problem than the fact that noise is detectable only through statistics,
Starting point is 00:26:42 whereas bias, you can tell a story about bias. For many people making decisions, the data is simply not even available. So at a statistical level, you can see an insurance company is demonstrating noise, but many of the decisions we are making are decisions we make as individuals. So if I want to propose marriage, and I feel like proposing marriage on a moonlit night in the spring time, I have no idea if my decision to propose marriage on that evening is being shaped by noise or not. I don't have a statistical set of how I would behave on the different circumstances. You know, the truth of the matter is that no one can tell you that this decision was noisy.
Starting point is 00:27:20 What you can tell is that when you look at the collection of decisions of people deciding to get married, that collection is noisy. There is no reason to believe that these steps which improve judgments in the statistical case do not apply when somebody decides to get married. If noise is present in the decisions where you can observe it, it's also present when you cannot observe it. Some years ago, I interviewed the researcher Berkeley-Datewurst. He talked about how people respond when a mistake has been made by a human
Starting point is 00:27:58 versus an algorithm. I want to play you a short excerpt of something he told me. People fail to use the algorithm after they'd seen the algorithm perform and make mistakes, even though they typically saw the algorithm outperform the human. In our studies, the algorithms outperform people by 25 to 90%. So he's basically saying the algorithms are significantly better than the humans, but when a mistake is made, and algorithms, of course, can make mistakes, and humans can make mistakes, he's saying that you prefer the human to make the mistake. And I think intuitively that feels correct to me. If I'm going to get a misdiagnosis when I go to a doctor, I would feel better if it's the doctor who's made
Starting point is 00:28:36 the mistake than an unfeeling, unthinking algorithm. I think that's absolutely true. And, you know, when we're looking at a road accident, we somehow feel less bad about it, if it was a driver error, then if it was a self-driving car that caused the accident. Algorithms, they make errors. The error they make, by the way, are different from the errors that people would make,
Starting point is 00:29:03 and they look stupid to people. Algorithms make errors that people would make, and they look stupid to people. Algorithms make areas that people think are ridiculous. Now we don't get to hear what algorithm think of the areas that people make. And we do know that algorithm just make far fewer of them in many cases, and you have to trade off the higher overall accuracy against the discomfort of abandoning human judgment and trusting an algorithm. Yeah.
Starting point is 00:29:33 You know, this might actually be a subtext of much of your lifetimes work, Danny, but it seems to me that fighting noise requires a certain amount of humility, and it seems to me that humans are not humble. Well, they're not humble for fairly straightforward reason. We do not go through life imagining different ways of seeing what we see. We see one thing at a time, and it feels right to us. And you know, that is really the source of the problem of ignoring noise. This is why it is so difficult to imagine it. I want to talk just for a brief moment about places where noise can potentially be useful.
Starting point is 00:30:17 So let's say, for example, you have a company that's trying to innovate and come up with new ideas, or you're in a creative enterprise where you're going to pitch different ideas for movies. In some ways you might want to actually maximize the variability of the ideas you get. So noise is not always bad, sometimes it can actually lead to good things. Yeah, we don't call it noise in those cases. So we reserve the term noise for undesirable variability. There are indeed many situations in life in which variability is a blessing, certainly in creative enterprises, also evolution. So anything that allows you to select the better one of multiple responses wherever there
Starting point is 00:30:59 is a selection mechanism, variability is a good thing. But variability in the absence of a selection mechanism is a sheer loss of accuracy. And those are the cases that we talk about. So if you had a way when you have multiple underwriters of finding out who is doing a better job than whom, and using that in order to improve their training, that would be a case where you could make positive use of variability, but in the absence of such a mechanism,
Starting point is 00:31:33 that variability just is a sheer loss. When we come back, how to fight noise? You're listening to Hidden Brain, I'm Shankar Vedanta. When we come back, how to fight noise? You're listening to Hidden Brain, I'm Shankar Vedanta. This is Hidden Brain, I'm Shankar Vedanta. Noise is endemic. It's also very difficult to fight in part because judges and doctors and police officers don't like to think of themselves as capricious. We don't think of our judgments as being arbitrary, certainly not when it comes to really important decisions.
Starting point is 00:32:14 Even when we are told about how noise is affecting our judgments and decisions, we hate to be shackled by rules. Danny in 1907, Charles Darwin's cousin, Francis Galton, asked 787 villagers at a county fair to estimate the weight of a prize ox. None of the villagers guessed the right answer, but then Galton did something with their answers that got him very close to the correct answer. What did he do, Danny? Well, he simply took the average, and the average, I think, was within two pounds of the correct weight. And that led to a lot of research that was summarized in a recent book by James Sura Wiecki on the wisdom of crowds, and the fact that when you take multiple judgments, independent judgments, and average them, you eliminate noise. This, by the way, is guaranteed to eliminate noise. So if you take multiple judgments, there is no guarantee that it will reduce bias
Starting point is 00:33:21 because this the judges agree on the bias, then the bias will remain when you take the average. Indeed, it will be even more salient. But what is absolutely guaranteed is that when you average independent judgments, you are eliminating noise. When you take four and a hundred judges, you are reducing noise by one half. When you take a hundred, you're reducing it by 90%. So there is some mathematics of noise that lends itself to analysis that doesn't apply to bias. So it's really remarkable. The correct weight of that ox was 1,198 pounds. And as you said, that was one or two pounds off the correct weight. And I want to point out that the reason averaging the responses produces a better answer
Starting point is 00:34:09 is that noise is random. You're taking advantage of the fact that various estimates will be randomly high or low and that's why when you average them out, you're going to get closer and closer to the correct answer. And what happens when you have different people making the same judgment of the same objects and then you are going to average them, then the errors they make cancel each other out. But when people make judgments about different cases, errors don't cancel them out. If you set too high a premium in one case and too low a premium in the other case, that doesn't make you right. That just makes things worse. So this idea that errors cancel out, you have to apply it quite precisely.
Starting point is 00:34:53 They cancel out when you average judgments are the same thing. And also the judgments have to come from people who in some ways who are independent of one another. If I am seeing the judgment you make and then I make my judgment afterwards, my judgment really is just a reflection of your judgment, not an independent one. That's right. And, you know, what happens, basically, is when you have witnesses who talk to each other, the value of their testimony is sharply reduced. Because in effect, in the extreme, if you have one witness who is very assertive, all the other witnesses fit their story to his, then you have one witness, regardless of how many testify.
Starting point is 00:35:35 One of the most remarkable aspects of the wisdom of the crowd that you describe in the book has to do with how you can elicit the wisdom of the crowd just from yourself. You cite research by Edward Wool and Harold Pashler that asked people to make judgments about the same thing separated by a certain amount of time. What do they find when you average out these different estimates? Well, for example, you know, if you ask people whether the population of London and you ask it once and then you wait a couple of weeks, say, and you ask it again, the striking thing is that most people will not give you the same number on the two occasions. And the second striking thing is that the average of the two responses is more likely to
Starting point is 00:36:23 be accurate than either of the responses. The first response is better than the second, but the average is better than both. In one of the studies they conducted, they actually asked people to make estimates that were different than their initial estimates, and then they averaged out the estimates, and they found that noise was reduced even further. Why would this be the case, Danny? Well, here what you're trying to do, and you can do it within an individual, is you're leaning against yourself.
Starting point is 00:36:54 You made one judgment, and then you ask people to think, how could that judgment be wrong and then make another? And that turns out to be indeed better than merely asking the same question wise. In some ways, this provides a solution to the conundrum I posed to Danny. If noise is detectable only by studying statistical averages, how do I reduce noise in decisions I am making as an individual?
Starting point is 00:37:26 The answer? Try to make the same decision over and over under different conditions. One way to tell if noises behind my decision to propose marriage is to ask myself whether I would make the same decision under different circumstances, not just on a moonlit night in the springtime, but in the a moonlit night in the springtime, but in the heat of summer or in the dead of winter. If I reach the same answer in these different settings, it's possible I could still be making a mistake, but at least I can be somewhat reassured that my decision is not the result
Starting point is 00:37:59 of random extraneous factors. Scientists are exploring lots of ways to reduce noise. The researchers send them a lion-athen and his colleagues, devise an algorithm to advise judges on whether to grant bail to suspects. These are people who have been arrested, but who have not yet been put on trial. Keeping them in jail can cause all kinds of hardship.
Starting point is 00:38:25 People can lose jobs or lose custody of their children while they're incarcerated awaiting trial. It's costly for taxpayers to keep people in jail. But letting someone dangerous out of jail can cause harm. Maybe they go on to commit other crimes. The researchers had the algorithm offer advice to judges about whether to grant bail. They found that if judges incorporated the recommendations, this could reduce the number of people in jail by 42 percent without increasing the risk of crime.
Starting point is 00:39:01 The research goes further than that, and that allowing the algorithm to inform the judge is actually not the best way of doing it. The research suggests quite strongly that when you have a judge and an algorithm that are looking at the same data, with some exceptions it's better to have the algorithm have the last word, and this is very non-intuitive. Besides being actually superior in some ways in terms of judgment, one of the things that algorithms do better than people is that they're not noisy. They're actually much more consistent.
Starting point is 00:39:35 Can you talk about this that in some ways one of the advantages that algorithms have is even when their judgments might not be as good as humans because they have less noise than humans, you're able to get better outcomes. Well, noise is a source of inaccuracy. And algorithms, by their nature, are noise-free. That is, when you present the same problem to two computers running the same software, they are going to give you the same answer, which is not true of different bail judges.
Starting point is 00:40:08 So that advantage is, in many cases, sufficient to make algorithms superior to people. But I don't want to create the impression that our solution to the problem of noise is algorithms, because even if it were the solution, there's just too much opposition to algorithms. So ultimately, we're talking about improving judgments. In some domains, algorithms can be used, and I think where they can be used, they should be used. But this is a long process, a slow process, because human judgment is going to make the important decisions for quite a while. Isn't it interesting, though, Danny,
Starting point is 00:40:52 that when you look at the news and you see the news coverage of algorithms, I feel like just in the last year, I've seen dozens of articles talking about algorithmic bias, about how algorithms in some ways can make judgments worse. And it is the case that you can have poorly designed algorithms. You can argue that the old sentencing rules that we had, three strikes on your out, in some ways, that is an algorithm.
Starting point is 00:41:14 But you could argue the algorithm in some ways was too crude to capture what actually needed to be done. But isn't it striking that there's so little attention that's paid by contrast to the potential good that algorithms can do because we're again we're so focused with the story of intent of saying a bad outcome happened and algorithm caused it clearly algorithms need to be thrown out the window. I mean we do not want to accept the errors that blind rules will make.
Starting point is 00:41:38 You know I was talking to someone who designed self-driving cars. And they realize that self-driving cars, it's not enough for them to be a hundred times safer than regular drivers. They effectively have to be almost perfect before they will be admitted. And it's that kind of bias that is completely human and natural. We like the natural over the unnatural.
Starting point is 00:42:08 We prefer human drivers and human doctors to make mistakes rather than self-driving cars and medical algorithms. And that's just the fact of psychology. You're talking the book about something that you call decision hygiene. And others have talked about this idea as well. What is decision hygiene and why the analogy to public health? When you're thinking of dealing with biases like a specific disease, so you can think of a vaccine or you can think of medication, which is specific a specific disease. So you can think of a vaccine or you can think of medication,
Starting point is 00:42:45 which is specific to that disease. But when you're washing your hands, you're doing something entirely different. You have no idea what germs you might be killing. And if you're good at it, you will never know because the germs are dead. And a similar distinction can be drawn between different ways of fighting errors. There is a difference between procedures that are specifically aimed at particular biases
Starting point is 00:43:15 and procedures that are intended generally to improve the quality of the judgment and decisions. And the way that this feeds back on the individual is that if there are procedures that are good for organizations and for repeated decisions, they should be good for individuals and for singular decisions. So if I'm a CEO of a corporation or if I'm a policy maker and I'm hearing this conversation about noise, can you give me two or three really specific suggestions on ways that I can reduce noise in my decision-making or in my company's decision-making or in my organization or community? Well, I think the first step would be to
Starting point is 00:43:59 ask whether you have a task in the organization that is carried out by interchangeable functionaries like underwriters or emergency room physicians. They're carrying out the same task, making the same kinds of judgments, and you would like those judgments to be noise-free, to be uniform. So first of all, identify whether you have that case in your organization. If you do, we strongly recommend you measure noise. That is, you actually take those individuals, present them with similar cases, and see the, observe the variability in the judgments. And possibly that may lead you to want to do something about it. But the first step is just to measure noise because our intuition about the magnitude of
Starting point is 00:44:50 noise are systematically wrong. Danny thinks we should learn from the saga of the rise and fall of sentencing reform. Once you detect noise in an organization, it may be wiser to avoid trying to fix the problem by asking everyone to follow rigid rules. As we've seen, people hate to have their judgement question, they hate to have their discretion limited, and they detest anything that smacks of mechanistic rules. The main thing to do, if you're attempting to improve the judgment of people in an organization is to convince those people that they want their judgments to be better. If you impose it as a set of rules that all of them will follow, they will resist it, they will feel they are being robotized
Starting point is 00:45:40 and they're likely to sabotage whatever you propose. I mean, this is well known in insurance companies that provide the underwriters, in many cases, with information or even with a technical price, with the suggestion about what premium should be assigned and underwriters are very likely to completely ignore those and to follow their judgment. And basically, I would think, you know, it's obvious advice. If you have a group of people who are noisy, have that group try to find the solution to the noise,
Starting point is 00:46:18 have them develop procedures that will make them uniform, do not impose procedures on them, but work with them to make them more uniform, because actually they will recognize that they would like to be in agreement with each other. But letting them feel that what they are doing is what they want to do, rather than what they are being forced to do. That is clearly a very important step if people really want to have organizations that improve their judgments. Daniel Kahneman, Olivia Siboni and Cass Sunstein are the authors of Nois, a flaw in human judgment.
Starting point is 00:47:00 Danny, thank you for joining me today on Hidden Brain. It was really my pleasure. Hidden Brain is produced by Hidden Brain Media. Our production team includes Bridget McCarthy, Laura Quarelle, Kristen Wong, Ryan Katz, Autumn Barnes, and Andrew Chadwick. Tara Boyle is our executive producer. I'm Hidden Brain's executive editor. Our unsung hero today is Rosalind Tardisilius. She's a producer in New York City
Starting point is 00:47:35 who helped us record this interview with Danny Connomen. Rosalind got to Danny's place early to set up for the interview and she was incredibly kind, conscientious and patient. At various points in my conversation with Danny, sirens blared outside. At one point, a refrigerator in Danny's apartment woke up and started making noise. Through all of it, Roslyn figured out how to get a crystal clear recording. Thank you Roslyn, you are a true on Sanghira. If you like this episode and like our show, please consider supporting us.
Starting point is 00:48:13 Go to support.hiddenbrain.org to learn how you can help. Every little bit makes a difference, and it means a lot to us to see you step forward to help. I'm Shankar Vedantam, see you next week.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.