In 2011, Howard Marks — co-founder of Oaktree Capital — circulated a memo that became widely read across investing circles. Its thesis was simple: first-level thinking is available to everyone. Second-level thinking is rare and that's why it creates advantage.
First-level thinking: "This company has bad news — sell." Second-level thinking: "This company has bad news, everyone will sell, the price will drop below fair value, and in six months it will recover. Buy while everyone else is selling."
First-level thinking asks: what will happen? Second-level thinking asks: what will happen after that? And who doesn't see it coming? The difference isn't intelligence. It's the habit of thinking one move further than the moment.
For practitioners: first-level thinking designs a session and asks "will this produce learning today?" Second-level thinking asks "if this produces learning today, what will participants do differently next week — and will their environment support that or resist it?"
British India. The colonial administration was alarmed by the density of cobras in Delhi. Solution: pay a bounty for every dead cobra. First-order result: people killed cobras and collected the bounty. Second-order result: people began breeding cobras to kill and collect the bounty. When the administration discovered this and canceled the program, the breeders released their now-worthless snakes. The cobra population was higher than before the program started.
The bounty solved the symptom and created the condition. This is the cobra effect: an incentive that reliably produces the opposite of its intended outcome because the second-order effects were never examined.
Training programs run cobra effects constantly. The mandatory compliance training that teaches employees to click through acknowledgment screens without reading them. The skills certification that measures test-completion rather than skill application. The leadership development program that increases leaders' vocabulary about leadership while leaving their behavior unchanged — and which now makes them better at explaining why the unchanged behavior is acceptable.
In 1975, economist Sam Peltzman published a study of US mandatory seatbelt laws. The prediction was straightforward: seatbelts save lives. The finding was less comfortable: occupant fatalities fell while non-occupant fatalities did not fall in step — and on some measures rose — consistent with drivers, feeling safer, driving faster and less carefully. The methodology remains contested, and replication efforts since have produced mixed results.
This is risk compensation: when we reduce one source of danger, we often take on more risk elsewhere, returning to roughly our prior level of perceived risk exposure. The safety system doesn't eliminate risk; it relocates it.
The practitioner's Peltzman question: when a learning environment offers extensive structure, scaffolding, and support, does the participant develop a corresponding tolerance for ambiguity — or does the safety system absorb the cognitive work that was supposed to build it? Practitioners running long-form cohorts often see a version of this pattern: learners who flourish under heavy scaffolding falter when the scaffolding is removed. The mechanism deserves naming even where the formal training-context study base is thin.
The refresh module paradox: when training completion rates are low, the standard response is to add a refresher module — making the content shorter, simpler, more accessible. First-order effect: completion rates rise. Second-order effect: the simplification removes the cognitive effort that produces retention. Completion goes up; behavior change goes down.
The onboarding acceleration paradox: when new hires feel unproductive, companies compress onboarding. First-order effect: faster time-to-productivity on measured tasks. Second-order effect: higher attrition at 12–18 months, when the unaddressed gaps in tacit knowledge become visible. The cost of the second-order effect typically exceeds the cost of the extended onboarding by 3–5x.
Daniel Kahneman's research is relevant here: fast thinking (System 1) feels right but generates first-order answers. Slow thinking (System 2) is effortful and generates second-order analysis. Organizations under pressure systematically reward System 1 decisions, even when they produce second-order damage.
Second-order thinking becomes a justice question when we ask: whose second-order effects get examined, and whose don't?
A training intervention designed to develop "executive presence" produces first-order results: participants receive coaching on communication, gravitas, and visibility. Second-order result: the definition of "executive presence" is implicitly white, male, and senior — meaning women and people of color who adopt the prescribed behaviors are often penalized for them ("too aggressive," "not collegial") while receiving the same advice that rewards white men. The training fixes nothing and may make it worse.
The generational version: L&D programs designed for "digital natives" embed an assumption that younger workers learn better with technology and older workers learn better with print or in-person instruction. First-order: sessions are differentiated by age. Second-order: we reinforce a generational hierarchy that treats younger workers as technologically sophisticated but experientially shallow, and older workers as wise but unable to adapt. Both groups are damaged by the frame.
The second-order question for every training design: if this works as intended, who benefits, who is burdened, and what does it teach people about who belongs?
The practitioner's second-order question is not "will this work?" It's "if this works exactly as designed, what does the world look like six months later — and is that the world we want?"
Donella Meadows identified leverage points in systems — places where a small shift produces large change. The highest leverage point is not changing parameters or flows; it's changing the goals of the system. Most training interventions change behaviors. Second-order practitioners ask: what goal does this behavior serve, and is that goal the right one?
The practitioner who teaches "how to have difficult conversations" is changing a behavior. The second-order question: in this organization, what happens to people who have difficult conversations? If they get managed out, the training is preparing people to be managed out more articulately. The intervention needed isn't communication skills — it's the goal of the system that penalizes honesty.
Sources — Bibliography
- Marks, H. (2011). The Most Important Thing: Uncommon Sense for the Thoughtful Investor. Columbia University Press. — The practitioner-language account of first vs. second-level thinking.
- Bastiat, F. (1850). "That Which Is Seen and That Which Is Not Seen." FEE.org. — The original unseen consequences argument. One essay. Read in 20 minutes.
- Peltzman, S. (1975). The effects of automobile safety regulation. Journal of Political Economy, 83(4). — The empirical foundation for risk compensation as a second-order effect.
- Meadows, D.H. (2008). Thinking in Systems: A Primer. Chelsea Green Publishing. — The best single source for understanding cascades, feedback loops, and leverage points.
- Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux. — System 1 vs. System 2 as the cognitive mechanism behind first-order default thinking.
- Rivera, L.A. (2015). Pedigree: How Elite Students Get Elite Jobs. Princeton University Press. — The second-order effects of "culture fit" and "executive presence" in professional development.
- Senge, P. (1990). The Fifth Discipline. Doubleday. — "Fixes that fail" and "shifting the burden" as systems archetypes; the organizational-learning version of second-order thinking.
- Taleb, N.N. (2012). Antifragile: Things That Gain from Disorder. Random House. — The second-order argument at full scale: systems that are fragile to second-order effects vs. those that gain strength from them. Chapter 2 is directly relevant to training design.