Journal

If every choice you made paid off, would you stick with one routine or keep switching things up just in case? Researchers gave pigeons that exact scenario, rewarding them no matter which sequence they completed.

Rather than settling into a predictable pattern, the birds kept exploring, favoring some options for a while before drifting to others.

Even under constant success, their behavior never fully stabilized, revealing a surprising mix of habit and curiosity.

What the pigeons actually did

Inside small test chambers, six birds worked through five illuminated buttons that could be pecked in 120 different orders.

By tracking their choices over time, Ed Wasserman at the University of Iowa showed that rewards reduced variety but did not produce a fixed sequence.

Across months of repetition, each pigeon shifted which patterns it favored, with preferred sequences rising and falling rather than stabilizing.

That instability defines the core tension of the result, where success strengthens behavior while still preserving ongoing change that demands further explanation.

What happens without guidance

More than a century ago, the Law of Effect – the idea that rewarded actions tend to return – became a foundation of behavioral science.

Here, however, the payoff never favored one route over another, because every five-peck path ended with food.

Without a best shortcut to discover, the birds still trimmed some options while continuing to sample many others.

The combination matters because it separates two linked ideas: choosing a reward more often and squeezing out alternatives over time.

Pigeon habits that never stuck

Over eight months and roughly 30,000 trials, the pigeons did become more focused – but never fully predictable. Their behavior tightened, yet it never settled into a single fixed routine.

Reward still played its expected role, reinforcing whatever had just worked and making it more likely to be repeated next time. But that effect didn't last forever.

Instead, the birds kept shifting their preferences. A sequence that dominated one day could fade the next, with no penalty for moving on.

That constant rise and fall of "favorite" patterns kept any one habit from taking over completely.

Today's favorite, gone tomorrow

Most reward theories expect a stable winner to emerge once practice gradually strips away wasted motions. Public comments from the team made clear how surprising the continued churn felt inside the lab.

"Such dramatic behavioral instability is most definitely not consistent with the Law of Effect," said Wasserman.

Those swings in preferred sequences suggest that success was steering behavior, but not steering hard enough to erase curiosity.

Order near chaos

Wasserman described that balance as the edge of chaos, where order holds but room for change remains.

Within this balance between stability and variation, a useful behavior can repeat often enough to earn food while backup options stay alive.

"Maybe it's in their best interest to keep some variability in their behavior," said Wasserman.

The edge of chaos idea reaches beyond pigeon pecks, but this experiment keeps the claim narrow and tied to one simple task.

Trying new things still worked

Across species and in human decisions, the exploration-exploitation tradeoff, balancing new options against known ones, keeps reappearing.

Animals forage, search, and learn by mixing both tendencies, because a fixed rule can waste opportunities when conditions change.

The pigeon results fit that wider pattern, though they show it in a stripped-down task with little real-world clutter.

This wider connection makes the birds useful not because they mimic people, but because they expose a clean decision problem.

Implications beyond pigeons

The study also points toward more intricate behavior, where success may depend on repeating a pattern without repeating it exactly.

Skilled performance in music, sports, and art often preserves a goal while letting timing, force, or sequence vary a little.

Controlled looseness can help a system absorb noise, fatigue, or changing tools before performance breaks.

Still, pigeons pecking buttons do not explain creativity, and this experiment never tried to settle that larger issue.

Limitations of the study

There's an important catch in how this experiment was designed. Every sequence earned a reward, and none came with a penalty, so the pigeons never had to deal with real consequences for a "bad" choice.

Out in the real world, that's rarely the case. Animals face delays, dead ends, predators, and competition, all of which make trial-and-error far more costly.

There's also the question of scale. A small group of pigeons working through a touchscreen task can reveal clean patterns, but it doesn't capture the full complexity of how different animals behave in the wild.

These limits don't weaken the findings. Instead, they help define what comes next, pointing researchers toward tougher, more realistic tests that can show how far this balance between habit and exploration really goes.

Why reward doesn't lock habits

Behavioral science has long treated reward as a force that strengthens whatever action just succeeded.

These pigeons suggest that process works in two directions at once, reinforcing what works while still leaving space to explore.

That shift matters for any theory that reduces learning to simple habit lock-in or mechanical obedience. Instead, reward seems to narrow behavior without closing it off entirely – helping explain when persistence pays off and when continued search protects success.

That balance may be the real takeaway. Reward made the pigeons more focused, but never so narrow that they stopped exploring.

Future studies can test how far this pattern extends, from more complex animal settings to human skills that still depend on variation and flexibility.