Worker-level · Automation
Agents do junior tasks end-to-end. Demands more-skilled employees to validate fallible output; associated with the documented decline in junior hiring.15
The dominant vendor framing for agentic AI is no longer a tool you buy but a workforce you manage.
A compiled, source-verified research digest — every claim cites a downloaded source, every figure is drawn from the data behind it. Not a personal essay.
Within roughly a year, the leading enterprise-software vendors re-described agentic AI not as a feature but as labour. Salesforce calls Agentforce a “digital labor platform” for a “limitless workforce”;1 Marc Benioff told Davos that today’s CEOs “will be the last to lead all-human workforces”;2 Microsoft built its 2025 thesis around the “Frontier Firm” and the “agent boss”;910 and NVIDIA’s Jensen Huang predicted that “the IT department of every company is going to be the HR department of AI agents.”34 The investment thesis underneath it — “services-as-software,” reaching for the trillions enterprises spend on salaries and outsourced services rather than the ~$200–300B software market5612 — reframes software as capital that produces labour, uncoupling a firm’s output capacity from its headcount.12 If labour becomes a thing you provision rather than only hire, span of control and the number of management layers stop being inherited constraints and become design choices,1516 which is why a new operating role — the orchestrator, or “agent manager” — is emerging across the vendor and analyst literature.10112122 The deployment reality is far behind the rhetoric: ~13% of organisations have agents in production,20 Gartner expects over 40% of agentic projects to be cancelled by 2027,8 and the cleanest early labour signal — augmentation overtaking automation on Claude.ai17 — argues that, so far, AI is more co-worker than replacement.
The defining rhetorical move of the 2024–25 enterprise cycle was to stop selling AI as software and start selling it as workers. Salesforce announced Agentforce 2.0 as “the digital labor platform for enterprises, enabling a limitless workforce through AI agents for any department,” and Benioff framed the company as “the leader in digital labor solutions,” letting “any company … build a limitless workforce.”1 A year later, the Agentforce 360 launch generalised the move into a stated vision of the “Agentic Enterprise … where AI doesn’t replace people, it elevates them,” connecting “humans, agents, and data on one trusted platform.”7 The language is deliberate: a workforce, not a product; build and manage, not deploy and use.
The framing is not confined to one vendor. At Davos in January 2025, Benioff put it in generational terms — “Today’s cohort of CEOs will be the last to lead all-human workforces” — and described leaders as soon “managing not only human workers but also digital workers,” while insisting the technology is “supporting employees, not replacing them.”2 Microsoft’s 2025 Work Trend Index introduced the Frontier Firm, an organisation “built around … intelligence on tap and human-agent teams,” and the agent boss: someone who “builds, delegates to and manages agents to amplify their impact.”10 Microsoft frames digital labour for these firms as being as critical to the business model as human labour.10 NVIDIA’s Jensen Huang, in his CES 2025 keynote, gave the management analogy its sharpest form: “In a lot of ways, the IT department of every company is going to be the HR department of AI agents in the future,” with agents that must be recruited, trained on company-specific vocabulary and policy, and supervised.34
Three flagged claims were checked against primary or near-primary capture. (1) The Salesforce “digital labor platform … limitless workforce” framing and Benioff’s quotes are verbatim from the Agentforce 2.0 release;1 the salesforce.com newsroom returned HTTP 403 to the fetcher, so text was taken from a verbatim reprint and corroborated by search. (2) Huang’s “HR of AI agents” line is anchored by two independent press captures of the CES 2025 keynote (Fortune and The Drum);34 NVIDIA publishes no verbatim transcript of the keynote video, so no transcript was fabricated. (3) A widely-circulated “$3–12T digital labor revolution” figure attributed to Benioff’s Davos remarks does not appear in the Fortune report of that speech and is therefore not cited here.2
Today’s cohort of CEOs will be the last to lead all-human workforces. — Marc Benioff, Salesforce, World Economic Forum (Davos), January 20252
Under the marketing sits a venture thesis that explains why the labour framing is more than branding. Foundation Capital’s Joanne Chen and Jaya Gupta named it a shift from software-as-a-service to service-as-software: where SaaS sells a tool and leaves the customer responsible for the outcome, the new model has the vendor assume outcome responsibility — “Instead of QuickBooks, you offer tax services … conducted by an AI accountant.”5 The size of the prize is the whole point. Foundation Capital frames a $4.6 trillion opportunity — roughly $2.3T of global salaries (sales and marketing, software engineering, security, HR) plus ~$2.3T of outsourced IT and business-process services — and is explicit that “the real prize isn’t the $200B SaaS market: it’s the $4.6T enterprises spend on salaries and services.”56 The unit they keep returning to is the gap between a software vendor’s revenue and the wage bill it could displace: Salesforce earns ~$35B a year against the ~$1.1T companies spend annually on sales and marketing salaries.5
The term’s origin is contested and the corpus does not collapse it. Foundation Capital popularised the VC thesis and the $4.6T sizing in its April 2024 essay and 2025 retrospective.56 Separately, HFS Research’s Phil Fersht is independently credited with coining “Services-as-Software (SaS)” and frames a distinct ~$1.5T-by-2035 market.23 These are two different theses with different numbers and different originators; this paper attributes each figure only to its own source. The closely related “software becomes labor” articulation is Alex Rampell’s at a16z.12 (The a16z podcast “Software is Eating Labor” was not transcribed — audio only — so the written essay anchors the argument.)
Foundation Capital’s year-one retrospective is useful precisely because it is less triumphant. The firms separating from the hype share three traits: forward-deployed engineers became “one of the most strategic assets,” because “integration is not a post-sale activity. It is the product surface”; the customer “expects to experience functionality, integration, and outcome before a contract is signed”; and pricing is moving from seats toward outcomes.6 The thesis, in other words, only pays off where the vendor can actually deliver an outcome — a caveat that matters when we reach the deployment data.
The economic engine of the thesis is a reclassification: software stops being a tool that humans operate and becomes capital that produces labour directly. Rampell’s a16z essay states it as “software becomes labor.” The mechanism is that traditional enterprise software digitised offline processes — filing cabinets into Workday, tickets into Zendesk, ledgers into QuickBooks — but those systems “required human ‘users.’” The breakthrough is that “the ‘users’ of the digitized filing cabinet do not have to be humans.”12 Once the user can be a machine, the relevant market is no longer the software budget but the labour budget — and the comparison is stark: ~$300B of enterprise software a year against a white-collar labour market of “many, many trillions of dollars a year.”12
This breaks the pricing model that built the incumbents. Workday, Intuit, Zendesk and Salesforce are per-seat businesses, and “a business will need fewer (if any!) seats as the system … takes actions on its own.”12 Foundation Capital’s outcome-based-pricing argument is the same coin’s other face: charging for results “creates a much more scalable pricing model” and gives the vendor an incentive to expand usage rather than ration seats.5 Salesforce’s Agentforce skills are already priced per action — Sales Development and Sales Coaching at “$2 per conversation” — which is a per-unit-of-work price, not a per-seat one.1 The labour-economics case is that capital scales in ways skilled labour cannot: “AI will always show up to work, can be trained instantly,” whereas nursing takes years and mortgage brokers could not be conjured when rates dropped in 2021.12
There is a serious academic articulation of this reclassification. Farach models AI as “agent capital” — a distinct production input that “reduces the friction of managing workers, expanding spans of control, compressing hierarchies.”16 Acemoglu, the leading sceptic, also frames AI in capital-vs-labour terms but predicts the gains accrue to capital while labour’s share erodes, with only “modest” aggregate productivity effects (below).14 The reclassification is widely accepted; what it produces is contested.
The operational consequence vendors emphasise is the breaking of the link between how much a firm can produce and how many people it employs. Microsoft’s framing of the capacity problem is the demand-side case: 80% of the global workforce reports too little time or energy for the job, employees are interrupted “every 2 minutes,” and 53% of leaders say productivity simply must increase — so 45% of leaders name “expanding team capacity with digital labor” a top priority.910 The supply-side promise is the vendor outcome data: Salesforce reports its own support agents resolving 83% of customer queries without human intervention, and customer deployments resolving large shares of work autonomously.17
Two structural signals support genuine uncoupling, both from Microsoft’s data. AI-native startups grew headcount 20.6% year-over-year against 10.6% for Big Tech — but the more telling number is forward-looking: 33% of leaders are considering headcount reductions, even as 78% consider hiring for new AI-specific roles (95% at Frontier Firms).910 Capacity is being decoupled from bodies in both directions at once — fewer of some roles, net-new categories of others. But the honest empirical question is whether output-without-headcount is actually showing up in employment, and here the corpus is careful.
The strongest empirical signal of labour substitution is concentrated, not economy-wide. Stanford’s “Canaries in the Coal Mine” finds a relative employment decline for workers aged 22–25 in the most AI-exposed occupations over Oct-2022 to Jul-2025, using ADP payroll microdata. This corpus flags an ambiguity in the headline figure: the most-cited number (firm-controlled) is ~13%, while some cuts report 16%, and the software-developer-specific figure is ~20%.13 Crucially, employment for workers 30+ in the same fields “remained stable or continued to grow,” wages were largely stable, and declines concentrated where AI automates rather than augments.13 Anthropic’s independent measure finds a consistent ~14% decline in young-worker hiring into exposed occupations (marginally significant) and no significant unemployment rise for highly exposed workers overall.18 The defensible claim is “an early, age-concentrated effect on entry-level hiring,” not “AI is replacing workers.”
If output capacity uncouples from headcount, two pillars of organisation design — the span of control (how many reports one manager can handle) and the number of layers between the front line and the top — stop being fixed by human cognitive limits and become parameters a firm can set. This is where the academic literature is most useful, because it complicates the glib “AI flattens the org” claim in both directions.
Farach’s model is the clearest statement of the design-choice view: because “agent capital reduces the friction of managing workers,” it directly expands spans of control and compresses hierarchies — and, importantly, the same technology forks into different outcomes depending on parameters. Under positive “task-creation elasticity,” AI “expand[s] the frontier of feasible work” rather than only cutting jobs; and an “elite-complementarity” parameter determines whether AI is general infrastructure (broad-based gains) or a tool that “amplifies top managers selectively” (superstar concentration). Structure becomes a chosen regime, and that parameter “becomes a policy lever.”16 The org chart is no longer downstream of human limits; it is a decision with distributional consequences.
Xu, Hou, Chen and Xie sharpen the nuance and puncture two simplifications. First, the much-discussed decline in junior employment “reflects deployment choices favoring automation over augmentation, not an inevitable consequence of GenAI itself” — directly echoing the Stanford finding that effects concentrate where AI automates.1513 Second, span of control does not simply widen: “across all four deployment architectures, the span of control initially contracts before eventually expanding” as the technology improves — because early, fallible AI needs more skilled oversight before it needs less.15 The design space they model is itself a 2×2 — automation vs augmentation, worker- vs expert-level deployment — which is exactly the set of choices a firm now has to make deliberately.
Agents do junior tasks end-to-end. Demands more-skilled employees to validate fallible output; associated with the documented decline in junior hiring.15
Agents assist junior staff. Allows relaxed entry requirements — the choice that preserves the early-career ladder.15
Agents take on expert tasks. Expert-level deployment “uniformly lowers entry-level skill requirements,” broadening access to knowledge work.15
Agents amplify experts. Also lowers entry requirements; span of control “initially contracts before eventually expanding.”15
The moment a firm runs more than one agent, it inherits a coordination problem that looks exactly like management. IBM’s reference definition is explicit that the dominant solution is a literal hierarchy: in hierarchical orchestration, “a manager agent decomposes a complex goal into sub-tasks and delegates each to a specialist agent. The manager monitors progress, handles failures, and reassembles results into a final output.”22 The other patterns — sequential (each agent passes output to the next) and concurrent (agents work independent sub-goals, merged by an orchestrator) — are the same primitives an operations manager already knows.22 Deloitte describes the enterprise versions as supervisor-agent models, adaptive networks and hybrids, and predicts firms will move along an “autonomy spectrum” — human in-the-loop, on-the-loop, out-of-the-loop — beginning the shift toward human-on-the-loop orchestration in 2026.21
The human role this creates is the through-line of the whole vendor literature. Microsoft’s agent boss — “someone who builds, delegates to and manages agents” — is the named version, and it explicitly applies “at any organizational level,” with new job categories already cited: AI Trainer, Data Specialist, Security Specialist, Agent Specialist, ROI Analyst.910 Huang’s “IT as the HR of AI agents” is the same role located at the infrastructure layer — recruiting, onboarding and supervising a digital workforce.34 And Microsoft’s Agent 365 is that HR function turned into a product: a “control plane for AI agents” with a registry providing a “comprehensive inventory of all agents,” unique agent IDs operating under “principle of least privilege,” role-based dashboards, and “unified observability across your entire agent fleet.”11 The vocabulary — fleet, registry, least-privilege, observability — is borrowed equally from HR and from IT, which is precisely Huang’s point.
The scale that makes orchestration a first-order problem rather than a curiosity comes from the market projections: Deloitte sizes the autonomous-agent market at $8.5B in 2026 rising to $35B by 2030 (or $45B “with improved orchestration”), and Microsoft cites an IDC projection of 1.3 billion agents by 2028.2111 A firm managing thousands of agents will need the same things a firm managing thousands of people needs — identity, access control, performance monitoring, and a manager — which is why “agent sprawl” and the registry that contains it are now explicit product categories.11
The corpus would be dishonest if it presented the labour framing without the gap between rhetoric and deployment, which is wide. On adoption, the surveys are consistent that production use is the exception: BCG finds only 13% of organisations have deployed agents integrated into workflows, with 56% experimenting and 31% having done nothing; only 33% of workers even claim a clear understanding of what an agent is.20 McKinsey, on a different base, finds 88% of organisations use AI somewhere but only 23% are scaling agents in at least one function, and “nearly two-thirds … have not yet begun scaling AI across the enterprise.”19
Gartner supplies the sharpest counterpoint: “More than 40% of agentic AI projects will be canceled by the end of 2027,” on a poll of 3,400+ organisations, because “most agentic AI projects right now are early-stage experiments or proof of concepts that are mostly driven by hype.”8 The firm coined “agent washing” for the rebranding of chatbots and RPA as agents, and estimates that “of the thousands of vendors claiming agentic solutions … only around 130 offer real agentic features.”8 Deloitte independently echoes the >40% cancellation figure.21 The labour framing, in other words, is running well ahead of the deployable technology.
The macroeconomic sceptic completes the picture. Acemoglu’s task-based model predicts AI delivers “no more than a 0.71% increase in total factor productivity over 10 years,” revised “below 0.55%” once hard-to-learn tasks are counted — a direct deflation of multi-trillion-dollar transformation narratives — and warns the capital–labour income gap may widen even so.14 Set against the vendor framing, this is the strongest reminder that “digital labour” is a thesis about the future, not a measured fact about the present.
Reconciled honestly, the corpus supports a narrower and more interesting claim than the marketing. The reframing is real and near-universal across vendors and investors: agentic AI is being built, priced and governed as labour, with a coherent economic logic — software as capital that produces work, sized against the wage bill rather than the software budget.1512 The organisational implication is also real and is the part most likely to outlast the hype cycle: once managing workers gets cheaper, span of control and layer count become design variables, and a management role for agents — orchestrator, agent boss, “IT as HR” — emerges by necessity rather than fashion.15161022
But the substitution claim — that AI is replacing human labour at scale now — is not what the evidence shows. Production deployment is a minority,2019 a plurality of projects is expected to fail,8 the employment effect so far is concentrated in entry-level hiring rather than broad displacement,1318 and the single cleanest behavioural signal points the other way: on Claude.ai, augmentation has just overtaken automation, 52% to 45%, reversing the August-2025 position — having moved 41% → 42% → 49% → 45% on the automation side over the year.17 For now, the modal interaction is a person working with an agent, not an agent working instead of a person.
The sources disagree on whether the augmentation/automation balance is a durable property of how AI is used or an artefact of current capability and cost. The Anthropic data shows it has already flipped twice and the split is close;17 Xu et al. argue the balance is a deployment choice firms make, not a fixed ratio;15 and Farach’s “regime fork” implies the same technology can resolve either way depending on who benefits.16 Whether “digital labour” ends as augmentation-at-scale or substitution-at-scale is, on this evidence, undetermined — and may be a matter of choice rather than destiny.