Suhas Thejaswi | Aalto University

Why human-in-the-loop is not enough for high-stakes decision making
June 02, 2026

Click to read more

In recent months, I have attended several talks on AI ethics, policy, and governance for automated decision-making systems. Many of these discussions focus on applications where the stakes are high: health care, law enforcement, welfare allocation, hiring, military decision support, and other settings where a wrong decision can seriously affect a person's life, as well as society more broadly.

A recurring recommendation in these conversations is: keep a human in the loop.

The recommendation sounds reasonable, right. If an AI system is only used to provide information such as predictions, risk scores, or recommendations, and if a human expert retains final authority, then perhaps the system is not really making the decision. The human can review the output, correct the system when it is wrong, and remain accountable for the final outcome. So what is the problem here?

But is human-in-the-loop decision-making actually enough to address the issue?

I do not think it is, at least not by itself.

The problem is not that human oversight is useless. In many domains, human judgment is useful and necessary. The problem is that simply placing a human after an algorithmic recommendation does not guarantee meaningful oversight. A human decision-maker who is shown an AI recommendation is not making the same decision they would have made without it. The recommendation changes the decision environment. It changes what information is salient, which options feel natural, what evidence receives attention, and which course of action appears easier to justify.

In other words, the AI/algorithmic system does not need to make the final decision in order to influence or shape the final decision.

Decision support is also choice architecture

Work by Richard Thaler and Cass Sunstein popularized the idea of choice architecture: the way options are presented can predictably influence what people choose. Defaults, rankings, warnings, labels, and framing effects can all guide behavior without formally removing freedom of choice. This is the central idea behind their work on nudges and decision environments ¹ ².

A decision-support system is also a form of choice architecture. It may not force the human to accept its recommendation, but it structures the human's attention. It can make some options appear more credible, more reliable or simply easier to select.

This matters because high-stakes decisions are often made under uncertainty, time pressure, incomplete information, and institutional constraints. A doctor may not have unlimited time to question every diagnostic suggestion. A judge or caseworker may face a heavy workload. A military analyst may operate under intense pressure and need to make decisions in the order of seconds. In such settings, an AI-generated recommendation is not merely "additional information." It can become an anchor.

Research in judgment and decision-making has long shown that humans rely on heuristics when reasoning under uncertainty. Daniel Kahneman and Amos Tversky argued that people often use mental shortcuts such as representativeness, availability, and anchoring ³ ⁴. These shortcuts can be useful, but they can also prompt us make systematic errors that are predictably irrational. The point is, when we give a human an AI recommendation, we are not adding information to a neutral decision-maker. We are adding information to a human decision-maker with limited attention, limited time, and predictable cognitive vulnerabilities.

The danger of automation bias

One risk is automation bias: the tendency to over-rely on automated systems, especially when they are usually correct. If a system is more accurate than the average human most of the time, it becomes psychologically and institutionally difficult to question it. Over time, the human may stop treating the system as an input among many and begin treating it as the default answer.

This is especially dangerous because many AI systems fail unevenly. They may perform well on average while failing for particular subgroups, rare cases, distribution shifts, or unusual contexts. A human reviewer may not know when they are looking at one of those failure cases. If the system is usually right, why should the human override it now? What evidence would be enough? Who bears responsibility if the override is wrong?

This creates a paradox. The more useful and accurate a decision-support system becomes in ordinary cases, the harder it may become for humans to detect the exceptional cases where they should resist the advice from these systems.

A simple analogy is search recommendations. Most people rarely look beyond the first page of search results. This does not mean they are forbidden from doing so. The option is available. But the ranking strongly shapes our attention. The first page (or sometimes even the top results) becomes, in practice, the information environment within which many users make judgments and take decisions. Similarly, a decision-support system may preserve formal human authority while still strongly shaping the range of decisions that feel reasonable.

Formal control is not the same as meaningful control

This is why I am skeptical of policy proposals that treat "human-in-the-loop" as a sufficient safeguard. The phrase hides several very different arrangements.

A human may be asked to approve an AI recommendation without enough time to review it. A human may see a risk score without understanding how it was produced. A human may technically have authority to override the system, but face institutional pressure not to do so. A human may be blamed for accepting a bad recommendation even though the system was designed in a way that made meaningful scrutiny unrealistic.

In such cases, human oversight becomes procedural. The human is present, but not necessarily empowered.

For human oversight to be meaningful, the human decision-maker must be able to answer basic questions such as:

When is this system likely to fail?
What kinds of cases are outside its reliable domain?
What evidence should justify overriding it?
Will I be penalized for disagreeing with it?
Do I have enough time, training, and information to evaluate its recommendation?
Is the system's performance known across most relevant contexts?
Are overrides monitored only as individual deviations, or also as signals of possible system failure?

Without answers to these questions, "human-in-the-loop" can become a comforting slogan rather than a real safeguard.

Humans can improve decisions, but only under the right conditions

The conclusion should not be that humans are always better than machines. That would also be wrong. In many settings, statistical models and automated decision-making systems can outperform unaided human judgment. Indeed, one motivation for introducing automated decision-making systems is that human decisions are often affected by biases, inconsistency, and seemingly irrelevant factors. Kahneman, Sibony, and Sunstein discuss this problem in Noise, where they argue how judgments can vary substantially across decision-makers even when they are evaluating similar cases⁵. The issue, therefore, is not whether humans or machines are superior in general. The more important question is how the combined human-machine decision-making system behaves.

Sometimes, AI support may improve decisions. Sometimes, humans may correctly reject bad recommendations. But sometimes, AI may introduce new errors, amplify existing biases, or make human decision-makers less vigilant. The performance of the overall system depends not only on model accuracy, but also on interface design, incentives, training, workload, accountability, and the distribution of errors.

This is the central point: a human-in-the-loop system is not automatically a safer system. It is a new decision-making system, and it must be studied as such.

What should policy require instead?

If policymakers want to require human oversight in high-stakes AI systems, the requirement should be more specific than "put a human in the loop." A stronger policy framework would ask whether the human has genuine capacity to intervene.

This could include requirements such as:

Human reviewers should be trained on known failure modes of the system.
AI recommendations should be accompanied by uncertainty estimates and clear limits of applicability.
Systems should be evaluated not only for model accuracy, but also for their effect on human decisions.
Institutions should measure whether humans actually override the system in appropriate cases.
Decision-makers should not be punished simply for disagreeing with an algorithm.
High-stakes deployments should be tested under realistic workloads, time pressure, and institutional incentives.
There should be auditing for subgroup-specific failures and rare but consequential errors.

The key shift is to evaluate the human-AI decision process together, not just the AI model in isolation.

Where our understanding is still limited

There is still much we do not fully understand. We need better evidence about when human oversight improves outcomes and when it merely creates the appearance of accountability. We need to know how professionals in different domains respond to AI recommendations under real institutional pressures. We need to understand how expertise, training, workload, explanations, uncertainty displays, and liability rules affect the willingness and ability to override automated suggestions.

We also need more domain-specific research. The right form of oversight in medical diagnosis may not be the same as in military targeting, welfare administration, or hiring. A generic human-in-the-loop requirement may be too blunt for the complexity of these settings.

The deeper problem is that high-stakes decision-making is not only a technical problem. It is a problem of human judgment under uncertainty, institutional design, incentives, accountability, and authority. AI systems enter into this already complicated environment and reshape it.

So yes, humans should often remain involved in high-stakes decisions. But we should not mistake human presence for human control.

A human-in-the-loop is not enough unless the loop itself is carefully designed, tested, and governed.

References

Richard H. Thaler and Cass R. Sunstein, Nudge: Improving Decisions About Health, Wealth, and Happiness, Yale University Press, 2008. ↩
Richard H. Thaler, Cass R. Sunstein, and John P. Balz, "Choice Architecture," 2010. ↩
Daniel Kahneman and Amos Tversky, "Judgment under Uncertainty: Heuristics and Biases," Science, 185(4157), 1124--1131, 1974. ↩
Daniel Kahneman, Thinking, Fast and Slow, Farrar, Straus and Giroux, 2011. ↩
Daniel Kahneman, Olivier Sibony, and Cass R. Sunstein, Noise: A Flaw in Human Judgment, Little, Brown Spark, 2021. ↩

Attention is all you need
January 25, 2026

Click to read more

I recently came across an essay by Prof. C. Sheshadri on why email destroys his productivity by breaking focus and fragmenting attention: "About students emailing me". It is blunt but uncomfortably relatable. Reading it made me realize that over the last couple of years, I have slowly trained myself into a habit I do not even like.

I check email far more often than I used to. Even when I am on leave or on vacation, even when I have no intention of replying, I still end up checking "just in case" something important came in. That habit has occasionally been bad enough to spill into sleep: wake up in the middle of the night, reach for the phone, check email. Nothing urgent, nothing that can not wait… and yet there I am, fully awake. And then there is the modern upgrade to email: instant messaging.

Some collaborations I am involved use tools like Google Chat or Slack for immediacy. I understand the appeal. But the design of these platforms nudges you toward constant availability. The "seen" indicator is the worst part: people know you have read a message, and suddenly there is an implicit expectation that you should respond now. Even when you are travelling. Even when you are on vacation. Even when you are trying to live like a normal human. It took me longer than it should have to admit that this was unhealthy.

So I changed something small, but surprisingly effective.

What I changed?

I removed work email and work messaging apps from my phone.

Not "turned off notifications." Not "I will check less." Actually removed them. Now I check email and messages only on my laptop, when I am intentionally in "work mode." I also started setting clearer expectations with my colleagues: I may not respond outside work hours unless it is genuinely necessary. If something is truly time-sensitive, they can reach me via my personal email. This felt slightly awkward at first, like I was being "less responsive" or "not a team player." But here is what I learned: being always reachable is not the same as being reliable.

What happened afterwards?

Two things changed almost immediately:

1. My brain stopped splitting itself into tiny fragments. I could focus for longer stretches without that background anxiety of "what did I miss?"

2. My responses improved. When I do reply now, I am more thoughtful and less reactive. I am not firing back half-baked answers just to clear a notification.

Also, and this one surprised me, I have not experienced any disasters because I did not respond immediately. Nothing 'went terribly wrong.' No major opportunities vanished. No collaborations collapsed. The world kept spinning. What did change is that I felt calmer, more in control, and (ironically) more productive.

The uncomfortable truth

Email and chat are not just communication tools. They are attention-extraction machines.

They pull me out of deep focus. They keep me in a low-level state of alertness. They make you feel like I am always behind. And they blur the boundary between work and life until there is no boundary left. So now I treat boundaries as part of the job.

If you are someone who finds yourself constantly checking email and messages, or feeling pressured to respond the moment a message arrives, I would strongly recommend trying a version of this. You do not have to go extreme. Start small. But protect your attention.

"Attention is all you need" for being productive. And it is way too valuable to donate to a never-ending inbox.

Hello World ...
June 15, 2025

Click to read more

For years, I have thought about starting a blog—a space to document day-to-day questions that wander in my mind and to reflect about things that I find interesting/relevant today and dismiss as irrelevant tomorrow. I have also wanted to jot down summaries of the books I read. But every time I consider doing it, I end up procrastinating—telling myself I will get to it after I finish this or that deadline. Of coarse, that mythical moment of being "done with deadlines" never seems to arrive. The only constant is the fact that I am never truly free. At some point, I have to admit to myself: if I keep waiting for the perfect, interruption-free time to start anything new, I will be waiting forever. So, I have decided to stop waiting—and just make time for it.

Why do I want to do this? Well, partly because writing helps me think more clearly. Case in point: while writing this very paragraph, I found myself wondering why I even want to write a blog in the first place. I often have vague ideas—about research questions, life, or what is happening in the world—and I can ramble on about them endlessly without ever arriving at a concrete conclusion. But writing forces me to slow down, think deeply, and articulate my thoughts precisely. It helps me channel my inner introvert to question my assumptions, and resist the urge to jump to convenient conclusions. Writing, for me, is a way to challenge myself abount my presumptions, to examine how/why I could be wrong.

Also, I read many books and articles (though not as much as I would like to or once hoped to). Not just research related ones, but also books that help me grow and think beyond research—from fields like (behavioural) economics, behavioural psychology, and (auto)biographies. It is a habit that has shaped me both personally and professionally, often pushing me to question my own thoughts. But, like most people, I am blessed with the human gift of forgetfulness (which, to be fair, probably keeps us all sane). As a result, I tend to forget many things I once thought I understood well. So perhaps this blog can also serve as a kind of personal notebook—a place to capture key takeaways, so I do not have to reread entire books just to recall a few important points. It might also encourage me to take better notes while reading, especially when I come across something meaningful or worth remembering for the future.

Finally, what I write here reflects my personal opinions. Everything I write comes from a specific context—one shaped by my own experiences, the work I do, the life I live and have lived, the air I breathe, the food I eat ... you get the idea. My views may not align with yours, and that is okay. I respect your right to your opinions—just allow me the same courtesy. I will try not to be a radical, so do not expect anything wild. What I am doing is making a promise to myself: to write something meaningful, something I will be proud to share—something I will not regret writing, and will enjoy coming back to read when I am older and, hopefully, wiser.

June 15, 2025
Helsinki, Finland