Paper 05

AI as a New Kind of Labour

The dominant vendor framing for agentic AI is no longer a tool you buy but a workforce you manage.

23 verified sources A — The nature of the shift

A compiled, source-verified research digest — every claim cites a downloaded source, every figure is drawn from the data behind it. Not a personal essay.

Abstract

Within roughly a year, the leading enterprise-software vendors re-described agentic AI not as a feature but as labour. Salesforce calls Agentforce a “digital labor platform” for a “limitless workforce”;1 Marc Benioff told Davos that today’s CEOs “will be the last to lead all-human workforces”;2 Microsoft built its 2025 thesis around the “Frontier Firm” and the “agent boss”;910 and NVIDIA’s Jensen Huang predicted that “the IT department of every company is going to be the HR department of AI agents.”34 The investment thesis underneath it — “services-as-software,” reaching for the trillions enterprises spend on salaries and outsourced services rather than the ~$200–300B software market5612 — reframes software as capital that produces labour, uncoupling a firm’s output capacity from its headcount.12 If labour becomes a thing you provision rather than only hire, span of control and the number of management layers stop being inherited constraints and become design choices,1516 which is why a new operating role — the orchestrator, or “agent manager” — is emerging across the vendor and analyst literature.10112122 The deployment reality is far behind the rhetoric: ~13% of organisations have agents in production,20 Gartner expects over 40% of agentic projects to be cancelled by 2027,8 and the cleanest early labour signal — augmentation overtaking automation on Claude.ai17 — argues that, so far, AI is more co-worker than replacement.

The reframing: from software you use to labour you manage

The defining rhetorical move of the 2024–25 enterprise cycle was to stop selling AI as software and start selling it as workers. Salesforce announced Agentforce 2.0 as “the digital labor platform for enterprises, enabling a limitless workforce through AI agents for any department,” and Benioff framed the company as “the leader in digital labor solutions,” letting “any company … build a limitless workforce.”1 A year later, the Agentforce 360 launch generalised the move into a stated vision of the “Agentic Enterprise … where AI doesn’t replace people, it elevates them,” connecting “humans, agents, and data on one trusted platform.”7 The language is deliberate: a workforce, not a product; build and manage, not deploy and use.

The framing is not confined to one vendor. At Davos in January 2025, Benioff put it in generational terms — “Today’s cohort of CEOs will be the last to lead all-human workforces” — and described leaders as soon “managing not only human workers but also digital workers,” while insisting the technology is “supporting employees, not replacing them.”2 Microsoft’s 2025 Work Trend Index introduced the Frontier Firm, an organisation “built around … intelligence on tap and human-agent teams,” and the agent boss: someone who “builds, delegates to and manages agents to amplify their impact.”10 Microsoft frames digital labour for these firms as being as critical to the business model as human labour.10 NVIDIA’s Jensen Huang, in his CES 2025 keynote, gave the management analogy its sharpest form: “In a lot of ways, the IT department of every company is going to be the HR department of AI agents in the future,” with agents that must be recruited, trained on company-specific vocabulary and policy, and supervised.34

Verification note — sourcing the headline quotes

Three flagged claims were checked against primary or near-primary capture. (1) The Salesforce “digital labor platform … limitless workforce” framing and Benioff’s quotes are verbatim from the Agentforce 2.0 release;1 the salesforce.com newsroom returned HTTP 403 to the fetcher, so text was taken from a verbatim reprint and corroborated by search. (2) Huang’s “HR of AI agents” line is anchored by two independent press captures of the CES 2025 keynote (Fortune and The Drum);34 NVIDIA publishes no verbatim transcript of the keynote video, so no transcript was fabricated. (3) A widely-circulated “$3–12T digital labor revolution” figure attributed to Benioff’s Davos remarks does not appear in the Fortune report of that speech and is therefore not cited here.2

Today’s cohort of CEOs will be the last to lead all-human workforces. — Marc Benioff, Salesforce, World Economic Forum (Davos), January 20252

The investment thesis: “services-as-software”

Under the marketing sits a venture thesis that explains why the labour framing is more than branding. Foundation Capital’s Joanne Chen and Jaya Gupta named it a shift from software-as-a-service to service-as-software: where SaaS sells a tool and leaves the customer responsible for the outcome, the new model has the vendor assume outcome responsibility — “Instead of QuickBooks, you offer tax services … conducted by an AI accountant.”5 The size of the prize is the whole point. Foundation Capital frames a $4.6 trillion opportunity — roughly $2.3T of global salaries (sales and marketing, software engineering, security, HR) plus ~$2.3T of outsourced IT and business-process services — and is explicit that “the real prize isn’t the $200B SaaS market: it’s the $4.6T enterprises spend on salaries and services.”56 The unit they keep returning to is the gap between a software vendor’s revenue and the wage bill it could displace: Salesforce earns ~$35B a year against the ~$1.1T companies spend annually on sales and marketing salaries.5

The prize is the wage bill, not the software bill (USD, log scale)$100B$1T$10TSaaS market ~$200BEnterprise software ~$300BSales & mktg salaries $1.1THFS Services-as-Software $1.5T (by 2035)FC Services-as-Software $4.6TWhite-collar labour: “many, many trillions / yr” (a16z)
Figure 1.The services-as-software case rests on an order-of-magnitude gap: software markets in the hundreds of billions sit beneath labour and services pools in the trillions. Bars are positioned on a log axis; the white-collar-labour bar is illustrative (the source gives no single figure).Source: a16z (2024); Foundation Capital (2024–25); HFS Research (2024).
Verification note — who coined “services-as-software,” and which trillion

The term’s origin is contested and the corpus does not collapse it. Foundation Capital popularised the VC thesis and the $4.6T sizing in its April 2024 essay and 2025 retrospective.56 Separately, HFS Research’s Phil Fersht is independently credited with coining “Services-as-Software (SaS)” and frames a distinct ~$1.5T-by-2035 market.23 These are two different theses with different numbers and different originators; this paper attributes each figure only to its own source. The closely related “software becomes labor” articulation is Alex Rampell’s at a16z.12 (The a16z podcast “Software is Eating Labor” was not transcribed — audio only — so the written essay anchors the argument.)

Foundation Capital’s year-one retrospective is useful precisely because it is less triumphant. The firms separating from the hype share three traits: forward-deployed engineers became “one of the most strategic assets,” because “integration is not a post-sale activity. It is the product surface”; the customer “expects to experience functionality, integration, and outcome before a contract is signed”; and pricing is moving from seats toward outcomes.6 The thesis, in other words, only pays off where the vendor can actually deliver an outcome — a caveat that matters when we reach the deployment data.

Software as capital that produces labour

The economic engine of the thesis is a reclassification: software stops being a tool that humans operate and becomes capital that produces labour directly. Rampell’s a16z essay states it as “software becomes labor.” The mechanism is that traditional enterprise software digitised offline processes — filing cabinets into Workday, tickets into Zendesk, ledgers into QuickBooks — but those systems “required human ‘users.’” The breakthrough is that “the ‘users’ of the digitized filing cabinet do not have to be humans.”12 Once the user can be a machine, the relevant market is no longer the software budget but the labour budget — and the comparison is stark: ~$300B of enterprise software a year against a white-collar labour market of “many, many trillions of dollars a year.”12

This breaks the pricing model that built the incumbents. Workday, Intuit, Zendesk and Salesforce are per-seat businesses, and “a business will need fewer (if any!) seats as the system … takes actions on its own.”12 Foundation Capital’s outcome-based-pricing argument is the same coin’s other face: charging for results “creates a much more scalable pricing model” and gives the vendor an incentive to expand usage rather than ration seats.5 Salesforce’s Agentforce skills are already priced per action — Sales Development and Sales Coaching at “$2 per conversation” — which is a per-unit-of-work price, not a per-seat one.1 The labour-economics case is that capital scales in ways skilled labour cannot: “AI will always show up to work, can be trained instantly,” whereas nursing takes years and mortgage brokers could not be conjured when rates dropped in 2021.12

There is a serious academic articulation of this reclassification. Farach models AI as “agent capital” — a distinct production input that “reduces the friction of managing workers, expanding spans of control, compressing hierarchies.”16 Acemoglu, the leading sceptic, also frames AI in capital-vs-labour terms but predicts the gains accrue to capital while labour’s share erodes, with only “modest” aggregate productivity effects (below).14 The reclassification is widely accepted; what it produces is contested.

Uncoupling output capacity from headcount

The operational consequence vendors emphasise is the breaking of the link between how much a firm can produce and how many people it employs. Microsoft’s framing of the capacity problem is the demand-side case: 80% of the global workforce reports too little time or energy for the job, employees are interrupted “every 2 minutes,” and 53% of leaders say productivity simply must increase — so 45% of leaders name “expanding team capacity with digital labor” a top priority.910 The supply-side promise is the vendor outcome data: Salesforce reports its own support agents resolving 83% of customer queries without human intervention, and customer deployments resolving large shares of work autonomously.17

Vendor-reported agent outcomes (% — treat as marketing claims)0%50%100%1-800Accountant — tax-season deflection90%Reddit — response-time reduction84%Salesforce — queries resolved, no human83%OpenTable — inquiries handled, no human70%Adecco — conversations after hours51%Indeed — time-to-hire reduction (goal)50%Reddit — support cases automated46%Engine — call-handling-time reduction15%
Figure 2.Customer-reported outcomes cited in Salesforce releases. Metrics are not directly comparable (deflection, response time, case share and after-hours share are different measures) and are vendor-reported, not independently audited. Engine also reported $2M+ in annual savings.Source: Salesforce, Agentforce 2.0 (2024) and Agentforce 360 (2025).

Two structural signals support genuine uncoupling, both from Microsoft’s data. AI-native startups grew headcount 20.6% year-over-year against 10.6% for Big Tech — but the more telling number is forward-looking: 33% of leaders are considering headcount reductions, even as 78% consider hiring for new AI-specific roles (95% at Frontier Firms).910 Capacity is being decoupled from bodies in both directions at once — fewer of some roles, net-new categories of others. But the honest empirical question is whether output-without-headcount is actually showing up in employment, and here the corpus is careful.

Verification note — the early employment evidence is real but narrow

The strongest empirical signal of labour substitution is concentrated, not economy-wide. Stanford’s “Canaries in the Coal Mine” finds a relative employment decline for workers aged 22–25 in the most AI-exposed occupations over Oct-2022 to Jul-2025, using ADP payroll microdata. This corpus flags an ambiguity in the headline figure: the most-cited number (firm-controlled) is ~13%, while some cuts report 16%, and the software-developer-specific figure is ~20%.13 Crucially, employment for workers 30+ in the same fields “remained stable or continued to grow,” wages were largely stable, and declines concentrated where AI automates rather than augments.13 Anthropic’s independent measure finds a consistent ~14% decline in young-worker hiring into exposed occupations (marginally significant) and no significant unemployment rise for highly exposed workers overall.18 The defensible claim is “an early, age-concentrated effect on entry-level hiring,” not “AI is replacing workers.”

$4.6T
Services-as-software opportunity (salaries + outsourced services)
Foundation Capital (2024–25)
82%
Leaders expecting to use digital labour within 12–18 months
Microsoft WTI (2025)
~13%
Relative employment decline, ages 22–25 in most-exposed jobs (16% in some cuts)
Stanford Digital Economy Lab (2025)
>40%
Agentic AI projects predicted cancelled by end of 2027
Gartner (2025)

Org structure becomes a design choice, not an inherited constraint

If output capacity uncouples from headcount, two pillars of organisation design — the span of control (how many reports one manager can handle) and the number of layers between the front line and the top — stop being fixed by human cognitive limits and become parameters a firm can set. This is where the academic literature is most useful, because it complicates the glib “AI flattens the org” claim in both directions.

Farach’s model is the clearest statement of the design-choice view: because “agent capital reduces the friction of managing workers,” it directly expands spans of control and compresses hierarchies — and, importantly, the same technology forks into different outcomes depending on parameters. Under positive “task-creation elasticity,” AI “expand[s] the frontier of feasible work” rather than only cutting jobs; and an “elite-complementarity” parameter determines whether AI is general infrastructure (broad-based gains) or a tool that “amplifies top managers selectively” (superstar concentration). Structure becomes a chosen regime, and that parameter “becomes a policy lever.”16 The org chart is no longer downstream of human limits; it is a decision with distributional consequences.

Xu, Hou, Chen and Xie sharpen the nuance and puncture two simplifications. First, the much-discussed decline in junior employment “reflects deployment choices favoring automation over augmentation, not an inevitable consequence of GenAI itself” — directly echoing the Stanford finding that effects concentrate where AI automates.1513 Second, span of control does not simply widen: “across all four deployment architectures, the span of control initially contracts before eventually expanding” as the technology improves — because early, fallible AI needs more skilled oversight before it needs less.15 The design space they model is itself a 2×2 — automation vs augmentation, worker- vs expert-level deployment — which is exactly the set of choices a firm now has to make deliberately.

The deployment design space (and what it does to entry-level work)

Worker-level · Automation

Agents do junior tasks end-to-end. Demands more-skilled employees to validate fallible output; associated with the documented decline in junior hiring.15

Worker-level · Augmentation

Agents assist junior staff. Allows relaxed entry requirements — the choice that preserves the early-career ladder.15

Expert-level · Automation

Agents take on expert tasks. Expert-level deployment “uniformly lowers entry-level skill requirements,” broadening access to knowledge work.15

Expert-level · Augmentation

Agents amplify experts. Also lowers entry requirements; span of control “initially contracts before eventually expanding.”15

Junior-employment decline is a deployment choice (automation vs augmentation), not a technological inevitability15

Orchestration and the emerging “agent manager”

The moment a firm runs more than one agent, it inherits a coordination problem that looks exactly like management. IBM’s reference definition is explicit that the dominant solution is a literal hierarchy: in hierarchical orchestration, “a manager agent decomposes a complex goal into sub-tasks and delegates each to a specialist agent. The manager monitors progress, handles failures, and reassembles results into a final output.”22 The other patterns — sequential (each agent passes output to the next) and concurrent (agents work independent sub-goals, merged by an orchestrator) — are the same primitives an operations manager already knows.22 Deloitte describes the enterprise versions as supervisor-agent models, adaptive networks and hybrids, and predicts firms will move along an “autonomy spectrum” — human in-the-loop, on-the-loop, out-of-the-loop — beginning the shift toward human-on-the-loop orchestration in 2026.21

Hierarchical orchestration: a management hierarchy of agentsHuman “agent boss”sets goals · on-the-loopManager / orchestrator agentdecomposes goal · delegates · monitors · reassemblesSpecialist agent ASpecialist agent BSpecialist agent Coutputs reassembled → result
Figure 3.The dominant multi-agent pattern reproduces an org chart: a human on the loop, a manager agent delegating to specialist sub-agents, with monitoring and reassembly. This is why the operating analogy is management, not tooling.Source: IBM, “What is AI Agent Orchestration?” (2025); Deloitte, TMT Predictions 2026 (autonomy spectrum).

The human role this creates is the through-line of the whole vendor literature. Microsoft’s agent boss — “someone who builds, delegates to and manages agents” — is the named version, and it explicitly applies “at any organizational level,” with new job categories already cited: AI Trainer, Data Specialist, Security Specialist, Agent Specialist, ROI Analyst.910 Huang’s “IT as the HR of AI agents” is the same role located at the infrastructure layer — recruiting, onboarding and supervising a digital workforce.34 And Microsoft’s Agent 365 is that HR function turned into a product: a “control plane for AI agents” with a registry providing a “comprehensive inventory of all agents,” unique agent IDs operating under “principle of least privilege,” role-based dashboards, and “unified observability across your entire agent fleet.”11 The vocabulary — fleet, registry, least-privilege, observability — is borrowed equally from HR and from IT, which is precisely Huang’s point.

The scale that makes orchestration a first-order problem rather than a curiosity comes from the market projections: Deloitte sizes the autonomous-agent market at $8.5B in 2026 rising to $35B by 2030 (or $45B “with improved orchestration”), and Microsoft cites an IDC projection of 1.3 billion agents by 2028.2111 A firm managing thousands of agents will need the same things a firm managing thousands of people needs — identity, access control, performance monitoring, and a manager — which is why “agent sprawl” and the registry that contains it are now explicit product categories.11

The reality check: hype, cancellation risk, and “agent washing”

The corpus would be dishonest if it presented the labour framing without the gap between rhetoric and deployment, which is wide. On adoption, the surveys are consistent that production use is the exception: BCG finds only 13% of organisations have deployed agents integrated into workflows, with 56% experimenting and 31% having done nothing; only 33% of workers even claim a clear understanding of what an agent is.20 McKinsey, on a different base, finds 88% of organisations use AI somewhere but only 23% are scaling agents in at least one function, and “nearly two-thirds … have not yet begun scaling AI across the enterprise.”19

Agent adoption is mostly pilots, not production0%50%100%BCG (10,635 employees)1356 experimenting / piloting31 noneMcKinsey (orgs)23 scaling39 experimenting38 not scalingDeloitte readiness (550 leaders)28 matureclaim maturity with automation + agent effortsdeployed / scalingexperimenting / pilotingnone / not scaling
Figure 4.Two surveys on different bases agree on the shape: a small production minority, a large experimenting middle. McKinsey’s “experimenting” segment is the 62% experimenting less the 23% scaling; the bases (employees vs organisations) differ and the bars are not strictly comparable.Source: BCG, AI at Work 2025; McKinsey, State of AI 2025; Deloitte, TMT Predictions 2026.

Gartner supplies the sharpest counterpoint: “More than 40% of agentic AI projects will be canceled by the end of 2027,” on a poll of 3,400+ organisations, because “most agentic AI projects right now are early-stage experiments or proof of concepts that are mostly driven by hype.”8 The firm coined “agent washing” for the rebranding of chatbots and RPA as agents, and estimates that “of the thousands of vendors claiming agentic solutions … only around 130 offer real agentic features.”8 Deloitte independently echoes the >40% cancellation figure.21 The labour framing, in other words, is running well ahead of the deployable technology.

The macroeconomic sceptic completes the picture. Acemoglu’s task-based model predicts AI delivers “no more than a 0.71% increase in total factor productivity over 10 years,” revised “below 0.55%” once hard-to-learn tasks are counted — a direct deflation of multi-trillion-dollar transformation narratives — and warns the capital–labour income gap may widen even so.14 Set against the vendor framing, this is the strongest reminder that “digital labour” is a thesis about the future, not a measured fact about the present.

What the evidence actually supports

Reconciled honestly, the corpus supports a narrower and more interesting claim than the marketing. The reframing is real and near-universal across vendors and investors: agentic AI is being built, priced and governed as labour, with a coherent economic logic — software as capital that produces work, sized against the wage bill rather than the software budget.1512 The organisational implication is also real and is the part most likely to outlast the hype cycle: once managing workers gets cheaper, span of control and layer count become design variables, and a management role for agents — orchestrator, agent boss, “IT as HR” — emerges by necessity rather than fashion.15161022

But the substitution claim — that AI is replacing human labour at scale now — is not what the evidence shows. Production deployment is a minority,2019 a plurality of projects is expected to fail,8 the employment effect so far is concentrated in entry-level hiring rather than broad displacement,1318 and the single cleanest behavioural signal points the other way: on Claude.ai, augmentation has just overtaken automation, 52% to 45%, reversing the August-2025 position — having moved 41% → 42% → 49% → 45% on the automation side over the year.17 For now, the modal interaction is a person working with an agent, not an agent working instead of a person.

Automation vs augmentation share of Claude.ai usage55504540Jan ‘25Mar ‘25Aug ‘25latest4149454752Automation shareAugmentation share (published Aug ‘25 onward)
Figure 5.The cleanest behavioural signal of “labour vs co-worker”: automation share rose through August 2025, then augmentation overtook it (52% vs 45%). Augmentation share is only published for the two most recent points.Source: Anthropic Economic Index (2026).
Open question

The sources disagree on whether the augmentation/automation balance is a durable property of how AI is used or an artefact of current capability and cost. The Anthropic data shows it has already flipped twice and the split is close;17 Xu et al. argue the balance is a deployment choice firms make, not a fixed ratio;15 and Farach’s “regime fork” implies the same technology can resolve either way depending on who benefits.16 Whether “digital labour” ends as augmentation-at-scale or substitution-at-scale is, on this evidence, undetermined — and may be a matter of choice rather than destiny.

References

  1. Salesforce Newsroom (2024). Introducing Agentforce 2.0: The Digital Labor Platform for Building a Limitless Workforce. Salesforce. Accessed 2026-06-16.
  2. Burleigh, E. (2025). Marc Benioff: today’s CEOs will be the last to manage all-human workforces (Davos 2025). Fortune. Accessed 2026-06-16.
  3. Morse, B. (2025). NVIDIA’s Jensen Huang says that IT will ‘become the HR of AI agents’. Fortune via Yahoo Finance (CES keynote, Jan 6 2025). Accessed 2026-06-16.
  4. Young, G. (2025). Nvidia’s Jensen Huang predicts IT will morph into AI HR — and the crowd goes wild. The Drum. Accessed 2026-06-16.
  5. Chen, J., & Gupta, J. (2024). AI leads a service-as-software paradigm shift. Foundation Capital. Accessed 2026-06-16.
  6. Chen, J., & Gupta, J. (2025). The $4.6T Services-as-Software opportunity: Lessons from the first year. Foundation Capital. Accessed 2026-06-16.
  7. Salesforce Newsroom (2025). Welcome to the Agentic Enterprise: With Agentforce 360, Salesforce Elevates Human Potential in the Age of AI. Salesforce. Accessed 2026-06-16.
  8. Gartner (2025). Gartner Predicts Over 40% of Agentic AI Projects Will Be Canceled by End of 2027. Gartner. Accessed 2026-06-16.
  9. Microsoft / Edelman Data x Intelligence (2025). 2025: The Year the Frontier Firm Is Born (Work Trend Index Annual Report). Microsoft WorkLab. Accessed 2026-06-16.
  10. Spataro, J. (2025). The 2025 Annual Work Trend Index: The Frontier Firm is born. Microsoft. Accessed 2026-06-16.
  11. Lamanna, C. (2025). Microsoft Agent 365: The control plane for AI agents. Microsoft 365 Blog. Accessed 2026-06-16.
  12. Rampell, A. (2024). Input Coffee, Output Code: How AI Will Turn Capital into Labor. Andreessen Horowitz. Accessed 2026-06-16.
  13. Brynjolfsson, E., Chandar, B., & Chen, R. (2025). Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of AI. Stanford Digital Economy Lab. Accessed 2026-06-16.
  14. Acemoglu, D. (2024). The Simple Macroeconomics of AI. MIT Shaping the Future of Work / Economic Policy (Oxford). Accessed 2026-06-16.
  15. Xu, F., Hou, J., Chen, W., & Xie, K. (2025). Generative AI and Organizational Structure in the Knowledge Economy. arXiv:2506.00532. Accessed 2026-06-16.
  16. Farach, A. (2026). AI as Coordination-Compressing Capital: Task Reallocation, Organizational Redesign, and the Regime Fork. arXiv:2602.16078v3. Accessed 2026-06-16.
  17. Anthropic (2026). The Anthropic Economic Index report: New building blocks for understanding AI use. Anthropic. Accessed 2026-06-16.
  18. Anthropic (2026). Labor market impacts of AI: A new measure and early evidence. Anthropic. Accessed 2026-06-16.
  19. McKinsey & Company / QuantumBlack (2025). The State of AI in 2025: Agents, innovation, and transformation. McKinsey. Accessed 2026-06-16.
  20. Boston Consulting Group (2025). BCG AI at Work 2025: Momentum Builds, But Gaps Remain (Third Edition). BCG. Accessed 2026-06-16.
  21. Deloitte (2025). Unlocking exponential value with AI agent orchestration (TMT Predictions 2026). Deloitte Insights. Accessed 2026-06-16.
  22. IBM (2025). What is AI Agent Orchestration? IBM Think. Accessed 2026-06-16.
  23. Fersht, P. / HFS Research (2024). Seizing the $1.5 Trillion Services-as-Software Opportunity. HFS Research. Accessed 2026-06-16.