📊 Full opportunity report: The deployment. How the AI labs verticallyintegrated into the serviceslayer — the Palantir modelat scale. on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

In early May 2026, the world’s leading AI labs announced a strategic shift to embed engineers directly into client operations, aiming to dominate the enterprise AI deployment market. This move replicates Palantir’s model, blending software with operational services to deepen client dependency and capture larger revenue streams.

In early May 2026, Anthropic and OpenAI announced major initiatives to embed their engineers directly into enterprise client operations, marking a significant shift in how AI models are deployed at scale. This strategy aims to move beyond model performance, focusing instead on operational integration and dependency, which are seen as the new battleground for enterprise AI dominance.

Within 72 hours, Anthropic revealed a $1.5 billion enterprise-services venture with firms including Blackstone, Hellman & Friedman, and Goldman Sachs, to embed Claude into mid-market companies. Simultaneously, OpenAI announced its $4 billion Deployment Company, ‘DeployCo,’ valued at $10 billion pre-money, with 19 investment partners and immediate acquisition of the consulting firm Tomoro, deploying 150 engineers on day one.

Both labs are adopting a model closely modeled on Palantir’s forward-deployed engineer (FDE) approach, where engineers work directly within client workflows, building and deploying operational AI systems rather than just providing recommendations. This approach aims to embed dependency, create switching costs, and generate recurring revenue tied to ongoing deployment work.

The move reflects a recognition that model quality alone no longer drives enterprise AI success; integration, security, workflow redesign, and change management are now the primary bottlenecks, as evidenced by research indicating 95% of generative AI pilots fail to move beyond experimentation.

Experts note that this vertical integration transforms AI deployment from a consulting service into a product formation process, with embedded engineers acting as both builders and operators, increasing client lock-in and revenue potential. However, it also introduces significant risks related to labor intensity and scalability, raising questions about whether margins will expand or compress as deployment scales.

The Deployment — Thorsten Meyer AI
DEPLOY
● DISPATCH / MAY 2026
THORSTEN MEYER AI · ENTERPRISE REORG · § 03
ENTERPRISE REORG · 03
FDE / DEPLOY
Essay · Deployment-Architecture Forensic · 2026-05-29

The deployment.
How the AI labs vertically
integrated into the services
layer — the Palantir model
at scale.

In seventy-two hours, the two largest labs made the same move: embed engineers inside companies, the way Palantir does — because the model isn’t the bottleneck, deployment is.
Anthropic launched a $1.5B venture with Blackstone, H&F, and Goldman; hours later OpenAI launched its $4B Deployment Company (19 partners, $10B pre-money) and bought Tomoro for 150 forward-deployed engineers. The structure is copied from Palantir “almost line for line” — the engineer flies to the client, learns the workflow, ships software that wraps a model around the problem, and stays until production works. The reason is a ratio: for every $1 on software, companies spend $6 on services. The labs sold the software dollar; the services dollar is six times larger. The structural argument: the labs are vertically integrating into the services layer because the model commoditizes, the services layer is six times larger, and the FDE is not a consulting arm but a product-formation mechanism that converts deployment into uncapped, token-metered, operationally-locked revenue. The risk: the FDE resembles consulting more than software — and whether it scales is the open Palantir question they have all inherited.
72 hrs
Between the two labs making
the identical structural move
$1 : $6
Software dollar vs services dollar ·
the labs had the smaller half
~70%
Anthropic inference margin (from 38%) ·
why the embedded customer is rational
18-20%
Palantir services as % of revenue ·
the unresolved scalability question
THE DEPLOYMENT· ANTHROPIC $1.5B JV · BLACKSTONE / H&F / GOLDMAN· OPENAI DEPLOYCO $4B · $10B PRE-MONEY · 19 PARTNERS· TOMORO ACQUI-HIRE · 150 FDEs DAY ONE· COPIED FROM PALANTIR ALMOST LINE FOR LINE· $1 SOFTWARE : $6 SERVICES· THE MODEL IS NOT THE BOTTLENECK · DEPLOYMENT IS· 95% OF GENAI PILOTS FAIL TO LEAVE PILOT· FDE JOB POSTINGS +800% IN 2025· FDE = PRODUCT FORMATION, NOT SERVICES ARM· OPERATIONAL DEPENDENCY, NOT CONTRACTUAL LOCK-IN· SEAT PRICING → TOKEN PRICING · UNCAPPED CEILING· TOKENS ARE THE NEW COAL · PALANTIR IS THE TRAIN· BULL · PRODUCT FORMATION AT SOFTWARE MARGINS· BEAR · LABOR-BOUND SERVICES AT CONSULTING MARGINS· BECOMING THE CONSULTANTS THEY COMPRESS· THE DEPLOYMENT· ANTHROPIC $1.5B JV · BLACKSTONE / H&F / GOLDMAN· OPENAI DEPLOYCO $4B · $10B PRE-MONEY · 19 PARTNERS· TOMORO ACQUI-HIRE · 150 FDEs DAY ONE· COPIED FROM PALANTIR ALMOST LINE FOR LINE· $1 SOFTWARE : $6 SERVICES· THE MODEL IS NOT THE BOTTLENECK · DEPLOYMENT IS· 95% OF GENAI PILOTS FAIL TO LEAVE PILOT· FDE JOB POSTINGS +800% IN 2025· FDE = PRODUCT FORMATION, NOT SERVICES ARM· OPERATIONAL DEPENDENCY, NOT CONTRACTUAL LOCK-IN· SEAT PRICING → TOKEN PRICING · UNCAPPED CEILING· TOKENS ARE THE NEW COAL · PALANTIR IS THE TRAIN· BULL · PRODUCT FORMATION AT SOFTWARE MARGINS· BEAR · LABOR-BOUND SERVICES AT CONSULTING MARGINS· BECOMING THE CONSULTANTS THEY COMPRESS·
FIG. 01 — THE SIMULTANEOUS MOVE · TWO LABS, ONE STRUCTURE, 72 HOURS
When the two fiercest competitors make the identical move in three days, it is not a bet — it is a recognition
Both read the same constraint and reached the same answer: the model is not enough
Anthropic · May 4
PE-portfolio distribution
$1.5B
  • Blackstone, H&F, Goldman ($300M / $300M / $150M)
  • Apollo, General Atlantic, Leonard Green, GIC, Sequoia
  • Embed Claude in PE portfolio companies — hundreds of mid-market firms
  • Aligned with ~80% enterprise mix
OpenAI · May 11
Acqui-hire and scale
$4B
  • $10B pre-money · 19 partners (TPG, Bain, Advent, Brookfield)
  • Bought Tomoro — 150 FDEs day one (Tesco, Virgin Atlantic, Red Bull)
  • Builds the enterprise depth it lacked
  • ~2.7x the capital of Anthropic’s vehicle
OpenAI did not build the FDE org from scratch — it bought one (Tomoro) to start with 150 engineers already operating, a statement that the deployment work matters enough that building it organically was too slow. When competitors converge this precisely — standalone services entity, embedded engineers, investor-network distribution, FDE model — the move is not a differentiated bet; it is both companies concluding there is only one answer. Both labs are now, in addition to model companies, deployment companies — and they became so in the same week.
FIG. 02 — THE SIX-TO-ONE RATIO · WHY THE SERVICES LAYER IS THE PRIZE
The labs had been competing for one-seventh of the value their own technology unlocks
For every dollar on software, companies spend six on services
$1
Software
(the labs sold this)
$6
Services — implementation, integration, change management
(the deployment move claims this)
The ratio exists because making software work inside a real organization is harder than building it. For enterprise AI, the labs say model performance is no longer the bottleneck — integration, security review, evaluation harnesses, and workflow redesign are. MIT: 95% of GenAI pilots fail to leave the experimental phase. The scarce input is the engineer who understands both the technology and the business — FDE job postings rose 800% in 2025. The labs are reaching past the software dollar they own toward the services dollar they did not, by fielding the engineers who earn it.
FIG. 03 — THE PALANTIR MODEL · THE FDE IS PRODUCT FORMATION, NOT A SERVICES ARM
The most misread point — and the whole bet rests on it
Consultants operate downstream of the contract; FDEs operate upstream of the roadmap
The consultant
Delivers a recommendation — a deck, downstream of the contract. Accountable for the advice, not the outcome.
vs
recommend

build &
own
The forward-deployed engineer
Builds the production system, upstream of the roadmap. Accountable for whether it works. The bespoke build becomes the product.
The FDE is not a revenue-generating services business — it is the product-discovery and product-formation engine. The bespoke systems built inside clients become the patterns generalized into the product. Treating early deployment cost as a permanent margin drag rather than a product-formation investment is the systematic misread that has fooled Palantir’s investors for years. The dependency it creates is operational, not contractual — the system becomes woven into the institution’s operating fabric, a deeper lock than a license. Palantir’s answer to scale: the boot camp (12-18 month sales cycle → 5 days, >75% conversion, >$1M initial deal).
FIG. 04 — THE TOKEN ECONOMICS · WHY THE EMBEDDED CUSTOMER IS UNCAPPED
The FDE acquires an uncapped, token-metered annuity — which is why the high-touch cost is rational
A seat-based customer is capped by headcount; a token-based customer is bounded only by the work the AI does
The old unit · seat-based
Capped by headcount
A developer = a $20/month subscription. Revenue ceiling fixed by the number of seats. The deployment cost could never be justified against it.
The new unit · token-based
Bounded only by the work
That same developer = hundreds-to-thousands/month in tokens, scaling with the value the AI generates. The FDE’s job is to put the AI on more of the work.
Front-loaded deployment cost buys a recurring, expanding, uncapped token annuity — and with Anthropic’s inference margins reported at ~70% (up from 38% a year earlier), a high-margin one. That is what makes the high-touch acquisition cost rational: the labs are not buying a seat-capped subscription; they are buying an uncapped consumption stream and paying an engineer to maximize it. Palantir’s Shyam Sankar: “Tokens are the new coal. Palantir is the train.” The FDE is infrastructure for the token economy.
FIG. 05 — THE SCALABILITY QUESTION · WHAT DECIDES WHETHER IT WORKS
The whole vertically-integrated structure rests on whether the FDE scales — and that is genuinely unresolved
The FDE resembles consulting more than software · Palantir runs services at 18-20% of revenue after years
The bull case
The bear case
Product formation that scales. Token economics + boot-camp standardization make the FDE acquire uncapped, high-margin annuities; margins expand as the platform matures.
Labor-bound services that drag. Standardization lags the customer base; each new client needs proportional FDE hours; margins compress as it scales.
The labs capture the six-to-one services dollar at software margins — becoming something larger than software companies.
The labs run large, capital-intensive services operations at consulting margins — having become the consultants they set out to compress.
The token-economy tailwind (uncapped consumption, ~70% inference margins) genuinely differentiates the labs’ FDE from Palantir’s per-seat-era version — but it offsets the labor-cost question, by an amount not yet measured. Palantir, after years, runs services at 18-20% of revenue and a 50% adjusted operating margin — neither pure software nor pure services. The labs inherit that exact ambiguity, at larger scale and with less operating history. The bet is that the FDE is product formation that scales. The risk is that they have rebuilt consulting and called it product.
The labs have concluded the model is not the product — the deployment is — and moved, in the same week, to own the layer where the model meets the operation. Whether that makes them something larger than software companies or merely rebuilds a labor-bound consulting business at consulting margins is the Palantir question they have all inherited.
Thorsten Meyer · The Deployment · Enterprise Reorg 03

Implications of Embedding Engineers in Enterprise AI Deployment

This strategic shift signifies a move by leading AI labs to control not just the models but the entire deployment process, aiming to dominate the enterprise AI market. By embedding engineers directly into client workflows, they can create operational dependencies, increase switching costs, and capture a larger share of the value chain. This approach could reshape the enterprise software and consulting landscape, potentially leading to a new standard where AI providers are also integral to ongoing operations, not just initial deployment.

Furthermore, the model’s success hinges on whether the embedded engineering approach can be scaled profitably. If margins remain high as deployment grows, these labs could establish a dominant, durable position. Conversely, if labor costs and operational complexity cause margins to shrink, the strategy might face significant challenges, making this a critical inflection point for the industry.

Autonomous AI-Driven Enterprise Software From Development to Deployment

Autonomous AI-Driven Enterprise Software From Development to Deployment

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background on the Palantir-Inspired Deployment Model

The deployment approach adopted by Anthropic and OpenAI is directly inspired by Palantir’s FDE model, developed over years in defense and intelligence sectors. Palantir’s engineers work closely with clients, building operational systems that become integral to their workflows, creating high switching costs and operational dependencies.

This model contrasts with traditional consulting, where recommendations are separated from implementation. Instead, the embedded engineer is responsible for both building and maintaining the system, fostering ongoing revenue streams. The labs’ adoption of this approach signals a shift from model-centric to deployment-centric enterprise AI strategies.

The move also reflects broader industry recognition that model performance improvements are no longer the primary barrier to AI adoption; integration, security, and workflow redesign are now the key challenges, with research indicating a high failure rate for pilots beyond initial testing phases.

“The labs are applying Palantir’s FDE model to the broader enterprise market, transforming AI deployment into a continuous, embedded process that deepens client dependency and revenue.”

— Thorsten Meyer

NVD RTX PRO 6000 Blackwell Professional Workstation Edition Graphics Card for AI, Design, Simulation, Engineering - 96GB DDR7 ECC Memory - 4th Gen RT/5th Gen Tensor Core GPU - OEM Packaging

NVD RTX PRO 6000 Blackwell Professional Workstation Edition Graphics Card for AI, Design, Simulation, Engineering – 96GB DDR7 ECC Memory – 4th Gen RT/5th Gen Tensor Core GPU – OEM Packaging

[NVIDIA Blackwell Streaming Multiprocessor] The new SM features increased processing throughput, and new neural shaders that integrate neural…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unanswered Questions About Scalability and Margins

It remains unclear whether the embedded engineer model can be scaled profitably as deployment grows. The approach resembles consulting, which can be labor-intensive, potentially causing margins to compress instead of expand. The labs’ confidence that this is a product formation strategy rather than a service overhead is still under evaluation, and whether margins will sustain or diminish with scale is an open question.

AI for Project Managers: A Desk Reference & Field Guide: Use Artificial Intelligence to Streamline Workflows, Automate Tasks, and Make Smarter Decisions with Practical Tools and Ethical Insights

AI for Project Managers: A Desk Reference & Field Guide: Use Artificial Intelligence to Streamline Workflows, Automate Tasks, and Make Smarter Decisions with Practical Tools and Ethical Insights

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps in Enterprise AI Deployment Strategy

In the coming months, the labs are expected to expand their deployment efforts, potentially setting industry standards. Monitoring the financial performance of these embedded-engineer initiatives and their impact on client retention will be critical. Further, industry observers will watch for signs of whether the model can be scaled efficiently or if operational costs will limit growth.

Additionally, regulatory and security considerations around embedded engineers working within client systems may influence the pace and scope of adoption, as well as the development of best practices for managing operational dependencies.

Fortinet FortiWiFi 30G Next-Gen Wireless Firewall and 3 Year Unified Threat Protection License Plus FortiCare Premium | Secure Wi-Fi 6 SD-WAN Network Appliance for SMB Offices (FWF-30G-A-BDL-950-36)

Fortinet FortiWiFi 30G Next-Gen Wireless Firewall and 3 Year Unified Threat Protection License Plus FortiCare Premium | Secure Wi-Fi 6 SD-WAN Network Appliance for SMB Offices (FWF-30G-A-BDL-950-36)

FortiWiFi-30G Hardware plus 3 Year FortiCare Premium and FortiGuard Unified Threat Protection (UTP) (SKU: FWF-30G-A-BDL-950-36)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why are AI labs focusing on embedding engineers into client companies?

Labs believe that operational integration is the key to successful enterprise AI deployment. Embedding engineers ensures models are effectively integrated into workflows, creating dependency and recurring revenue opportunities.

What risks are associated with the embedded engineer model?

The main risks include high labor intensity, scalability challenges, and potential margin compression as deployment expands. It resembles consulting work, which can be costly and difficult to standardize.

How does this move compare to traditional consulting?

Unlike traditional consulting, where recommendations are separate from implementation, embedded engineers build and operate the systems, making them responsible for outcomes and ongoing support, leading to higher client lock-in.

Will this strategy lead to higher or lower margins for AI labs?

It is uncertain. If the embedded-engineer approach scales efficiently, margins could expand. However, if operational costs grow faster than revenues, margins may shrink, which is a key risk for this strategy.

Source: ThorstenMeyerAI.com

You May Also Like

AMÁLIA · The Three Hard Questions.

Portugal’s €5.5M AMÁLIA project, a European Portuguese LLM, has delivered a base version. Key questions about openness, native data, and goals remain.

The Twelve Real Complaints About AI Tools in 2026 — A Reddit, Twitter, and GitHub Synthesis

A detailed report on the top twelve user complaints about AI tools in 2026, based on Reddit, Twitter, and GitHub discussions, highlighting real-world issues.

Security Surveillance Drones: Patrolling Properties From Above

Just imagine how security drones can revolutionize property patrols; discover the benefits and considerations of aerial surveillance today.

Building Your Own X-Ray Detector Screen

A researcher has synthesized a new phosphor material for homemade X-ray detector screens, potentially enabling low-cost imaging solutions.