Executive Thesis
The mainstream story says developers fail to automate their core tasks because they are too busy, because the backlog is too large, or because management has not given them enough tools. That story is convenient because it turns automation failure into a scheduling problem. It is also incomplete. The hidden psychological reason developers avoid automating core work is that automation threatens the identity, status system, and safety contract that made them valuable in the first place.
Most experienced engineers learned to win by being the person who knows the build incantation, the release ritual, the migration sequence, the fragile production dashboard, the undocumented edge case, or the manual recovery path at 2:00 a.m. That knowledge is useful, but it can become a trap. When a task is painful and important, the person who can perform it manually gains leverage. When that task is automated, the leverage moves from the individual hero to the system. The business wins, but the local social hierarchy changes. The engineer who was rewarded for being indispensable may experience automation as self-erasure.
This is not an argument that developers are lazy, irrational, or anti-automation. It is the opposite. The behavior is often rational inside the incentive structure leaders created. If promotion, trust, urgency, and recognition flow toward people who personally rescue systems rather than people who remove future rescue work, then the organization should expect toil to survive. If managers praise velocity but punish production risk, engineers will preserve manual gates. If executives demand AI adoption while making headcount reduction the loudest subtext, engineers will quietly avoid building tools that make their own roles look smaller. If architecture decisions are judged by visible feature delivery while platform leverage is treated as invisible overhead, automation becomes politically expensive.
The Serious CTO thesis is this: automation is not primarily a tooling rollout. It is a status redesign. The technical question is whether a task can be automated; the leadership question is whether the people closest to the task can afford to make themselves less locally indispensable. Until that second question is answered, the first question is theater.
A serious engineering organization should treat unautomated core tasks as signals, not merely inefficiencies. They reveal where knowledge is trapped, where incentives reward heroics, where psychological safety is weak, where managers measure the wrong output, and where developers are protecting identity under the language of pragmatism. The control framework is not “buy more AI tools.” It is: make automation count as production work, reward leverage over heroics, measure toil explicitly, protect people from automation-driven status loss, and require every recurring manual workflow to have an owner, a risk classification, and a retirement plan.
The Narrative Conflict: Mainstream Belief vs. Reality
The mainstream belief has three versions.
The first is the backlog version: developers do not automate because they are overloaded. Automation is treated as a luxury that can happen after features ship, incidents calm down, and the team has a mythical quiet quarter. This belief sounds reasonable because many engineering teams are indeed overloaded. Production support, roadmap pressure, hiring gaps, security reviews, customer escalations, and codebase decay all compete for attention. In that environment, manual work looks like the fastest path through the day.
The second is the tooling version: developers do not automate because they lack the right platform. If the organization buys better CI/CD, a workflow engine, internal developer platform tooling, GitHub Copilot, better observability, or an AI agent framework, automation will naturally happen. This belief sounds reasonable because tools genuinely matter. Bad tooling raises the cost of automation; good tooling lowers it.
The third is the cultural version: developers do not automate because the culture is immature. This is closer to the truth, but often still vague. Leaders say they need a “DevOps culture,” “ownership,” or “continuous improvement,” then continue rewarding the same visible emergency labor that keeps manual work alive.
Reality is sharper. Developers often do not automate core tasks because manual work can be a rational strategy for preserving safety and status. Manual deployment rituals keep decision control in a small group. Manual debugging procedures make senior engineers visibly necessary. Manual data fixes maintain privileged knowledge. Manual testing steps allow teams to say “we were careful” even when the system is not actually safer. Manual approval chains protect managers from accountability. Manual internal reports keep business context concentrated. The organization calls these behaviors prudence; sometimes they are. But when repeated tasks stay manual for months or years, prudence has usually become an incentive system.
This is why automation conversations so often degrade into surface debates. Someone proposes automating a recurring release task. Another engineer says the edge cases are too complex. A staff engineer says the manual step catches subtle failures. A manager says the team cannot risk destabilizing the process right now. Each statement may contain truth. But the deeper question is: what would happen to local power, recognition, and blame if the manual ritual disappeared?
The same pattern now appears around AI-assisted development. The public narrative says AI will make developers more productive. Research and industry reports show real productivity gains in some settings, especially for bounded coding tasks and less experienced workers. But inside teams, adoption is often uneven. Some engineers use AI tools aggressively; others avoid them or use them only at the edges. The hidden reason is not merely skepticism about model quality. It is that AI assistance changes the basis of expertise. If the old status system rewarded memorized syntax, library familiarity, and speed at producing boilerplate, a tool that compresses those advantages can feel threatening. Engineers who built identity around being fast implementers may resist tools that make implementation less scarce.
A CTO should not mock this fear. They should design around it. People do not abandon identity just because a dashboard says productivity increased. They need a new status ladder: from implementer to system designer, from task owner to leverage creator, from incident hero to reliability architect, from code producer to judgment amplifier.
Quantitative / Evidence Base
Several evidence lanes matter: automation economics, developer productivity research, organizational safety research, and DevOps/reliability practice.
First, automation and AI can create real productivity improvements, but those improvements are uneven and context-dependent. In a field experiment on generative AI in customer support, Brynjolfsson, Li, and Raymond found that access to AI assistance increased worker productivity on average, with the largest gains for less experienced and lower-skill workers. That does not map one-to-one to software engineering, but it does show an important mechanism: automation can transfer tacit patterns from high performers into tools that help others. That is precisely why incumbents may feel exposed. The task knowledge that once made a person unusually valuable can become system knowledge.
Second, software-specific studies also show task-level productivity gains. GitHub reported controlled research in which developers using GitHub Copilot completed a programming task faster than developers without it. The paper “The Impact of AI on Developer Productivity: Evidence from GitHub Copilot” likewise found measurable effects in a controlled setting. These studies should not be stretched into a universal claim that AI doubles engineering output. They show something narrower and still important: tools can reduce friction in certain coding tasks, and the value depends on task type, workflow, review discipline, and developer judgment.
Third, developer productivity is multidimensional. The SPACE framework argues that productivity cannot be reduced to lines of code, commits, or speed alone; it includes satisfaction and well-being, performance, activity, communication and collaboration, and efficiency and flow. The DevEx framework similarly emphasizes feedback loops, cognitive load, and flow state. This matters because automation avoidance is not just lost hours. Manual core tasks increase cognitive load, break flow, create bottlenecks, and make performance depend on hero availability. The cost appears in delivery predictability, onboarding time, incident recovery, and morale before it appears in a neat accounting ledger.
Fourth, the DORA research program repeatedly ties strong software delivery outcomes to capabilities such as continuous delivery, loosely coupled architecture, fast feedback, and generative organizational culture. Google Cloud’s explanation of Westrum organizational culture emphasizes that high-performing technology organizations make information flow easy, encourage cooperation, and treat failures as opportunities to improve systems rather than assign blame. Recurring manual work is often the opposite: information is trapped in people, cooperation depends on personal availability, and failure becomes a search for the person who skipped the ritual.
Fifth, psychological safety is central. Google’s Project Aristotle work popularized the finding that psychological safety was the most important dynamic in its study of effective teams. Amy Edmondson’s broader research argues that psychologically safe environments allow people to speak up about risk, uncertainty, and mistakes. Automation requires exactly that kind of speech. A developer must be able to say: “This deployment ritual is fragile,” “I am the only one who knows this process,” “This manual approval is theater,” or “If we automate this, my role needs to evolve.” In low-safety cultures, those statements are dangerous. People will instead produce technical objections that sound safer than identity or status concerns.
Sixth, research on employee silence is relevant. Morrison and Milliken described how organizational forces can create climates where employees withhold information about problems. That silence is expensive in engineering. The people doing manual toil often know which tasks should be automated first. If they believe that speaking up will make them look replaceable, disloyal, slow, or difficult, the automation roadmap will be built from management guesses rather than operational reality.
Seventh, reliability practice treats toil as a first-class concern. Google’s SRE material defines toil as manual, repetitive, automatable, tactical work that grows with service size. The SRE tradition does not merely say toil is annoying; it says unmanaged toil crowds out engineering work and should be bounded. Postmortem culture makes the same leadership point: resilient organizations examine systems and incentives, not just individual mistakes. If a manual step prevents a failure, the serious question is not “who remembered the step?” It is “why does safety depend on memory?”
Eighth, workforce and labor-market reports show why fear around automation is not imaginary. The World Economic Forum’s Future of Jobs reports and OECD employment analyses repeatedly discuss changing skill demands, job transformation, and worker exposure to automation. Even when technology augments more than replaces, workers hear a threat: the skills that got them here may not be the skills that keep them valuable. For developers whose identity is tied to direct implementation, this is personal.
The evidence does not support the simplistic claim that developers refuse automation because they are irrational. It supports a more operational claim: automation changes the distribution of knowledge, power, risk, and recognition. Organizations that ignore those forces will under-automate even when the tooling is excellent.
Technical and Operational Consequences
The first consequence is knowledge concentration. When a recurring core task stays manual, the real system boundary moves out of code and into memory. The source of truth becomes “ask Priya,” “ask Marc,” “follow the release checklist in the old doc,” or “remember the one customer exception.” That may work at ten people. It fails at scale. It slows onboarding, increases bus factor, and makes every reorg or resignation a reliability event.
The second consequence is brittle execution. Manual work is not automatically unsafe, but repeated manual work creates variable execution. People get tired. Context changes. Checklists drift. Edge cases multiply. A manual process that worked for one service becomes folklore when applied to twenty. The organization accumulates invisible branching logic: if customer A, do this; if migration B, call that person; if region C, wait for a Slack confirmation. Eventually the process cannot be reasoned about because it is not a process. It is a social dependency graph.
The third consequence is fake risk management. Teams often defend manual gates as safer. Sometimes they are right: a human review can catch contextual issues that automation misses. But many manual gates are not real controls. They are comfort rituals. A person clicking approve after scanning a noisy dashboard is not the same as an automated invariant, a canary analysis, a rollback mechanism, or a clear SLO violation gate. The manual step creates the feeling of safety while preserving the underlying fragility.
The fourth consequence is poor debugging. When core workflows are manual, failures are harder to reconstruct. Logs may show what the system did, but not what the operator thought, skipped, inferred, or changed outside the system. Incident review becomes an archeological exercise through Slack threads, screenshots, shell history, and memory. Automated workflows can still fail, but they leave artifacts: code, configuration, run history, inputs, outputs, and repeatable states.
The fifth consequence is distorted engineering economics. Manual work consumes senior attention in small increments that rarely appear as roadmap items. Ten minutes here, thirty minutes there, a release afternoon every week, a production support ritual every morning. Because the cost is fragmented, it survives budgeting scrutiny. Meanwhile, the opportunity cost is enormous: senior engineers spend their scarce judgment on task execution rather than system improvement.
The sixth consequence is slower AI adoption. Teams that fear automating core tasks will also struggle to adopt AI responsibly. AI tools require explicit workflows, review gates, evaluation criteria, and changed definitions of competence. If the organization cannot convert a manual deployment ritual into an automated pipeline, it will not magically convert engineering work into reliable AI-augmented systems. It will either ban the tools, use them chaotically, or pretend adoption is happening because licenses were purchased.
The Hidden CTO / Engineering Leadership Failure
The hidden failure is that leaders often ask for automation while preserving the reward structure that makes automation unattractive.
A CTO says, “We need to eliminate toil,” but the promotion packet celebrates the engineer who saved the release at midnight. A VP says, “We need platform leverage,” but the quarterly review asks every team to defend feature output. A manager says, “We need better documentation,” but the highest-status engineers are the ones people cannot operate without. An executive says, “AI will make us more productive,” while simultaneously hinting at hiring freezes. The organization’s words say automate; its nervous system says stay indispensable.
This is why automation programs fail quietly. Nobody openly says, “I am afraid that automating this will reduce my value.” Instead, the objections become technical:
- The edge cases are too complex.
- The workflow changes too often.
- The tooling is not ready.
- We cannot risk it this quarter.
- It will take longer to automate than to do manually.
- Only senior people have the judgment for this.
Some of these objections are valid. A serious CTO should steel-man them. But they should also ask what incentive would make the same engineer excited to automate. If the answer is unclear, the organization has a leadership problem, not a tooling problem.
The leadership failure is also visible in measurement. Many teams do not measure toil directly. They measure sprint points, incidents, uptime, cycle time, or feature count. Those metrics matter, but they can miss the recurring manual tax. A team can hit sprint goals while senior engineers quietly burn hours on release coordination. A service can meet uptime goals while relying on a tiny group of humans to keep it stable. A roadmap can look healthy while internal developer experience decays.
The CTO’s job is to make invisible leverage visible. Automation should be treated as production work because it changes production capacity. If the organization only funds automation as leftover time, it is choosing manual dependency as strategy.
The Practical Control Framework
The control framework starts with a simple inventory: identify recurring manual tasks that are important, repeated, automatable, and tied to production, delivery, compliance, customer support, or internal developer flow. Do not outsource the inventory to management alone. Ask the people doing the work. Specifically ask: “Which tasks would you automate if you knew it would not reduce your standing here?”
Then classify each task across five dimensions.
First, recurrence: how often does it happen? Daily work deserves different urgency than quarterly work, but rare high-risk work may still matter.
Second, blast radius: what breaks if the task is done wrong or late? A release step, data correction, access change, or customer migration can carry large risk even if it is not frequent.
Third, knowledge concentration: how many people can perform it correctly without private coaching? If the answer is one or two, the task is a leadership risk.
Fourth, automation readiness: is the task deterministic, rule-based, observable, and testable? If not, can the team first create better logs, checklists, invariants, or APIs?
Fifth, status sensitivity: who gains recognition from doing the manual work today, and what new recognition will replace it after automation?
The last dimension is the one leaders skip, and it is the one that prevents theater. For every automation initiative, define the new status ladder. The engineer who automates a painful release process should receive visible credit not just for a tool, but for increasing organizational leverage. The person who documents and transfers tribal knowledge should not be treated as less essential; they should be treated as someone who converted personal expertise into institutional capability. Promotion criteria should explicitly reward reducing toil, shrinking bus factor, improving developer experience, and making safe behavior the default.
Next, require an automation retirement plan for recurring manual workflows. Not every task must be automated immediately. Some manual steps are legitimate transitional controls. But every recurring core manual task should have one of three labels:
- Keep manual because human judgment is essential and the control is real.
- Automate now because the task is repeated, risky, and deterministic enough.
- Prepare for automation by improving observability, APIs, test coverage, data quality, or policy clarity.
This prevents the common limbo where everyone agrees automation would be good someday but no one owns the path.
Next, separate automation from headcount threat. Leaders cannot promise that technology will never change roles. But they can be honest about the operating model. The message should be: “We automate toil so engineers move up the value chain, not so the organization can preserve broken workflows with fewer people.” Then prove it through assignments. Move people from manual execution into platform design, reliability engineering, developer experience, security automation, architecture governance, or customer-facing technical judgment.
Next, install feedback loops. Track toil hours, manual deployment steps, after-hours interventions, repeated support scripts, onboarding blockers, and number of single-owner workflows. Review them like operational risk, not like hygiene chores. Tie them to business outcomes: release frequency, incident recovery, time to onboard, feature lead time, customer escalation time, audit readiness, and engineering retention.
Finally, treat AI adoption as a forcing function for this discipline. AI will expose which parts of engineering work are explicit enough to augment and which parts are trapped in undocumented judgment. Before buying more AI tooling, ask: where are our workflows already explicit, evaluated, and reviewable? Where do we depend on private memory? AI amplifies systems; it does not fix leadership ambiguity.
The Steel-Man Argument
The strongest counterargument is that some developer tasks should remain manual because automation can encode bad assumptions, hide failure modes, and create a dangerous illusion of safety. This is true. Automation is not inherently good. A bad automated process can fail faster, at larger scale, with fewer humans noticing. Some tasks require contextual judgment, ethical review, customer nuance, regulatory interpretation, or architectural discretion. Some workflows change so frequently that premature automation creates maintenance burden. Some teams are so early in product discovery that manual work is the correct way to learn.
There is also a legitimate career concern. If leaders use automation primarily as a cost-cutting weapon, developers are rational to protect themselves. No amount of inspirational language will overcome evidence that the organization treats automation as a path to layoffs. In that environment, resistance is not psychological weakness; it is self-defense.
The steel-man position is therefore not “automate everything.” It is “automate only when the workflow is stable enough, the control can be verified, and the people affected have a credible path to higher-value work.” A human-in-the-loop approval may be appropriate if the human is adding judgment rather than rubber-stamping fear. A manual step may be appropriate if the team is still discovering the right invariants. The CTO’s job is to distinguish real judgment from disguised toil.
Strategic Path Forward
The strategic move is to reframe automation from a productivity side project into an organizational design problem. Start with one high-friction workflow that everyone knows is painful. Map the current human steps. Identify what knowledge is explicit, what knowledge is tribal, what risk the manual step allegedly controls, and what status the current process confers. Then automate the smallest safe slice and publicly credit the people who converted their expertise into leverage.
Do not begin with a grand platform mandate. Begin with a status signal. Make the first visible win show that the organization values people who remove toil more than people who repeatedly survive it. Use the win to update promotion examples, performance language, architecture review expectations, and team dashboards. Ask managers to report not only what shipped, but what recurring work was eliminated.
For AI specifically, stop framing adoption as “developers versus tools.” Frame it as a shift from implementation scarcity to judgment scarcity. The engineers who thrive are not the ones who manually type the most code. They are the ones who can define the problem, constrain the system, evaluate outputs, protect reliability, understand customers, and turn repeated work into durable capability.
The hidden psychological reason developers fear automating their core tasks is that they learned, correctly, that indispensability is safer than leverage in many organizations. The CTO’s job is to make that lesson obsolete.
Works Cited
- Erik Brynjolfsson, Danielle Li, and Lindsey R. Raymond, “Generative AI at Work,” NBER Working Paper 31161. https://www.nber.org/papers/w31161
- Peng et al., “The Impact of AI on Developer Productivity: Evidence from GitHub Copilot,” arXiv. https://arxiv.org/abs/2302.06590
- GitHub, “Research: quantifying GitHub Copilot’s impact on developer productivity and happiness.” https://github.blog/news-insights/research/research-quantifying-github-copilots-impact-on-developer-productivity-and-happiness/
- McKinsey Global Institute, “The economic potential of generative AI: The next productivity frontier.” https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier
- World Economic Forum, “The Future of Jobs Report 2025.” https://www.weforum.org/publications/the-future-of-jobs-report-2025/
- OECD, “OECD Employment Outlook 2023: Artificial Intelligence and the Labour Market.” https://www.oecd.org/en/publications/oecd-employment-outlook-2023_08785bba-en.html
- Nicole Forsgren, Margaret-Anne Storey, Chandra Maddila, Thomas Zimmermann, Brian Houck, and Jenna Butler, “The SPACE of Developer Productivity,” ACM Queue / Microsoft Research. https://queue.acm.org/detail.cfm?id=3454124
- Abi Noda, Margaret-Anne Storey, Nicole Forsgren, and Michaela Greiler, “DevEx: What Actually Drives Productivity,” ACM Queue. https://queue.acm.org/detail.cfm?id=3595878
- Microsoft Research, “The SPACE of Developer Productivity.” https://www.microsoft.com/en-us/research/publication/the-space-of-developer-productivity/
- DORA / Google Cloud, “2024 Accelerate State of DevOps Report.” https://dora.dev/research/2024/dora-report/
- Google Cloud Architecture Center, “DevOps culture: Westrum organizational culture.” https://cloud.google.com/architecture/devops/devops-culture-westrum-organizational-culture
- Google re:Work, “Guide: Understand team effectiveness.” https://rework.withgoogle.com/print/guides/5721312655835136/
- Amy C. Edmondson, “The Fearless Organization” overview and related work. https://amycedmondson.com/books/the-fearless-organization/
- Gary P. Pisano, “The Hard Truth About Innovative Cultures,” Harvard Business Review. https://hbr.org/2019/01/the-hard-truth-about-innovative-cultures
- Elizabeth Wolfe Morrison and Frances J. Milliken, “Organizational Silence: A Barrier to Change and Development in a Pluralistic World,” Academy of Management Review. https://www.researchgate.net/publication/258183815_Employee_silence
- Google SRE Book, “Eliminating Toil.” https://sre.google/sre-book/eliminating-toil/
- Google SRE Book, “Postmortem Culture: Learning from Failure.” https://sre.google/sre-book/postmortem-culture/
- NIST, “AI Risk Management Framework.” https://www.nist.gov/itl/ai-risk-management-framework
- Gene Kim, Jez Humble, Patrick Debois, John Willis, and Nicole Forsgren, “The DevOps Handbook, Second Edition.” https://itrevolution.com/product/the-devops-handbook-second-edition/
- Nicole Forsgren, Jez Humble, and Gene Kim, “Accelerate.” https://itrevolution.com/product/accelerate/
- Robert Kegan and Lisa Lahey, “Immunity to Change,” Harvard Business Press. https://www.gse.harvard.edu/ideas/usable-knowledge/09/03/immunity-change
- John D. Sterman, “Learning from Evidence in a Complex World,” American Journal of Public Health. https://pmc.ncbi.nlm.nih.gov/articles/PMC1447696/
Comments
Post a Comment