What is 'agent washing' and how do I avoid it?

Agent washing is the practice of rebranding existing products such as chatbots, RPA scripts and AI assistants as 'agents' without real agentic capability. Gartner has warned that of the thousands of vendors claiming to sell agentic AI, only around 130 are genuine. To avoid it, ask whether the system can plan multi-step tasks, choose among tools at runtime, recover from errors and act across systems toward a goal. If it only answers questions or follows a fixed script, it is an assistant or a workflow, not an agent, and you should price it as one.

How does outcome-based pricing for AI agents work?

Outcome-based pricing charges you only when the agent delivers a defined result rather than per seat or per message. Intercom's Fin popularised it at $0.99 per resolution, HubSpot moved to $0.50 per resolved conversation in April 2026, and Zendesk priced committed automated resolutions at $1.50 each. The model aligns cost with value, but the contract definition of a 'resolution' or 'outcome' is everything: read it closely, because a generous definition that counts deflections you did not really want can quietly inflate your bill.

Why do so many AI agent projects get cancelled?

Gartner predicts that over 40% of agentic AI projects will be cancelled by the end of 2027, citing escalating costs, unclear business value and inadequate risk controls. Separately, MIT found that 95% of enterprise generative AI pilots delivered no measurable financial return, with the main barrier being organisational rather than technical: teams could not integrate the tool into real workflows. The lesson for buyers is to start from a specific, measurable process and a clear payback target, not from the technology.

What questions should I ask an AI agent vendor before buying?

Ask what specific outcome the agent is accountable for, how 'success' is defined and billed, what happens when it fails or is uncertain, which of your systems it can read and write, where your data is processed and retained, what audit logs you receive, how you cap spend, and how you exit the contract and export your data and configuration. Insist on a paid pilot with a written success metric before any annual commitment, and treat any reluctance to define failure behaviour as a red flag.

How do I estimate the true cost of an AI agent?

Look beyond the headline price. Total cost of ownership includes the platform or licence fee, usage or outcome charges that scale with volume, integration and onboarding work, ongoing maintenance and prompt tuning, human review of the agent's output, and the cost of failures such as wrong actions or escalations. A $0.99-per-resolution agent can be excellent at high volume and poor at low volume, so model your real monthly volume across at least two pricing tiers before committing.

Should I sign an annual contract for an AI agent right away?

Rarely. Given how fast pricing and capabilities are changing in 2026, start with a short paid pilot scoped to one workflow and a measurable target, then convert to a longer term only after the agent hits that target in production. Favour usage or outcome pricing over large prepaid seat commitments early on, keep an exit and data-export clause, and re-test the market at renewal because per-resolution prices have been falling as competition increases.

How to Buy an AI Agent Without Getting Burned

Q: Is it better to buy an AI agent or build one?

For most teams in 2026 the data favours buying from a specialized vendor. MIT's 'State of AI in Business 2025' study found that purchasing tools from specialized vendors and building partnerships succeeded about 67% of the time, while internal builds succeeded only about a third as often. Building makes sense when the workflow is a genuine competitive differentiator, when your data cannot leave your environment, or when no vendor fits the process. For everything else, buying a proven agent and adapting it around your tools is usually faster, cheaper and more likely to reach production.

Q: Does the EU AI Act affect buying an AI agent?

Yes, if you operate in or sell into the EU. Obligations for general-purpose AI models have applied since 2 August 2025, and the EU AI Office's enforcement powers activate on 2 August 2026. Obligations for high-risk systems were deferred from August 2026 to 2 December 2027 under the provisional Digital Omnibus agreement. As a buyer you are a 'deployer', so you should ask vendors for technical documentation, training-data summaries, logging and human-oversight features before you sign, because their compliance becomes part of yours.

Eighteen months ago, "AI agent" was a research term. Today it is on the pricing page of almost every business tool you already pay for, and the sales pressure to buy one is relentless. Yet the numbers underneath the hype are sobering: Gartner predicts that more than 40% of agentic AI projects will be cancelled by the end of 2027, and an MIT study found that 95% of enterprise generative AI pilots produced no measurable financial return. Buying an agent is no longer the hard part — buying the right one, on terms that will not surprise you, is. This is a buyer's guide for doing exactly that.

Why buying an AI agent got harder, not easier

On paper, the AI-agent market in 2026 looks like a buyer's paradise. There are thousands of vendors, prices are falling, and every demo ends with a flawless automated resolution. The problem is that the demo is rarely the product you receive, and the headline price is rarely the bill you pay. Two pieces of research published over the last year explain why caution is warranted.

First, in June 2025 Gartner predicted that over 40% of agentic AI projects would be scrapped by the end of 2027, citing escalating costs, unclear business value and inadequate risk controls. Gartner also coined a useful term for part of the cause — "agent washing" — and estimated that of the thousands of vendors claiming to offer agentic AI, only around 130 are the real thing. Second, MIT's "State of AI in Business 2025" report, based on 300 deployments and interviews with 150 executives, found that 95% of enterprise generative AI pilots delivered no measurable return. Crucially, MIT concluded the barrier was organisational rather than technical: companies could not integrate the tools into their actual workflows.

Read together, those findings carry a clear message for buyers. The technology mostly works; the purchase is what goes wrong. Teams buy on capability and hype, not on a specific, measurable process, and the project quietly dies when the agent never connects to the work. Before you evaluate a single vendor, write down the one process you want automated, the metric that will prove it worked, and the payback period you need. If you cannot fill in those three blanks, you are not ready to buy yet.

The pricing models you are actually choosing between

The most consequential decision you make is not which vendor you pick but which pricing model you sign up for, because it determines how your bill behaves as you grow. In 2026 the market is mid-migration from per-seat subscriptions toward usage and outcome-based billing, and many vendors now run several models at once. Salesforce Agentforce, for example, simultaneously offers conversation-based pricing, per-action Flex Credits and traditional per-user licensing — and reportedly reached roughly $800 million in annual recurring revenue in fiscal 2026, so this is not a niche experiment. Here is how the main models compare.

Pricing model	You pay for	Best when	Hidden risk
Per seat / licence	Each named user, monthly	Predictability matters and usage is steady	You pay for seats whether or not the agent does work
Usage / per action	Each API call, token or action executed	Volume is spiky or hard to forecast	Costs scale with activity, not with results
Outcome / per resolution	Each defined successful result	The "outcome" is clean and easy to verify	The vendor's definition of "success" controls your bill
Hybrid	A base platform fee plus variable consumption	You want a floor on capability and a ceiling on surprise	Two meters to watch instead of one

Outcome-based pricing is the headline trend of 2026, and the numbers show how fast it is moving. Intercom's Fin agent popularised charging $0.99 per resolution and reached nine-figure revenue doing so. In April 2026 HubSpot undercut that with a $0.50 per resolved conversation price, while Zendesk's outcome-based tier sits at $1.50 per committed automated resolution, or $2.00 pay-as-you-go. The direction is unmistakable: when an agent does measurable work, vendors increasingly bill for the result, and competition is pushing those per-result prices down.

Read the definition of "resolution" before anything else. Outcome pricing only protects you if the outcome is defined the way you would define it. Check exactly what counts as billable: a genuinely solved ticket, or any conversation the agent touched? Are escalations, abandoned chats and "talk to a human" requests excluded? A single generous clause can turn a $0.99 headline into a much larger invoice.

How to spot an "agent" that is really a chatbot

Because the word "agent" now sells, plenty of products wear the label without earning it. The distinction matters commercially: a true agent that plans and acts across your systems is worth outcome pricing, whereas a rebranded chatbot that answers questions on a fixed script is not — and you should refuse to pay agent prices for it. If you are still nailing down the definition, our explainer on what agentic automation actually is draws the line clearly. When you are in a sales call, use these four tests.

Can it plan a multi-step task? A real agent breaks a goal into steps and sequences them itself. A chatbot answers one turn at a time and forgets the objective.
Does it choose among tools at runtime? Agents decide which system or action to use based on the situation. Scripted bots follow a branch you configured in advance.
Can it recover from an error? Ask what happens when an API call fails or data is missing. Agents retry, re-plan or escalate; assistants simply stop or hallucinate.
Does it act, or only answer? A genuine agent writes to your systems — books the meeting, issues the refund, updates the record. If it only returns text, you are buying an assistant.

None of this means assistants and rule-based workflows are bad purchases. They are often the smarter, cheaper choice for well-defined tasks. The point is to pay the right price for the right category, and not to fund an ambitious outcome-pricing contract for software that cannot actually deliver outcomes on its own.

A due-diligence checklist before you sign

Treat an AI-agent purchase like hiring a contractor who will act on your behalf, because that is functionally what it is. The questions below separate vendors who have thought about production from those who have only built a demo. Walk through them in order and write down the answers; the ones a vendor cannot answer cleanly are usually the ones that hurt later.

Accountability: What single outcome is this agent responsible for, and how is success measured and billed?
Failure behaviour: What happens when the agent is uncertain or wrong? Does it stop, escalate to a human, or act anyway?
System access: Exactly which of your tools can it read from and write to, and can you scope those permissions narrowly?
Data handling: Where is your data processed and stored, how long is it retained, and is it used to train shared models?
Auditability: What logs do you receive of every decision, input and action, and can you export them?
Spend controls: Can you cap monthly spend or volume so a runaway agent cannot generate an unbounded bill?
Exit: How do you cancel, and can you export your data, configuration and prompts to leave cleanly?
Proof: Will the vendor agree to a paid pilot with a written success metric before any annual commitment?

If you want a more structured way to assess whether your own organisation is ready for an agent at all, our agentic AI readiness checklist covers the internal preparation that a vendor cannot do for you. The two checklists work best together: one vets the seller, the other vets the buyer.

Build, buy, or adapt: how to decide

Once you know what you need, the next question is whether to buy a finished agent, build your own, or take a middle path and adapt a proven template to your stack. The MIT research offers an unusually direct data point here: buying from specialized vendors and forming partnerships succeeded about 67% of the time, while internal builds succeeded only about a third as often. For most businesses, that tilts the default toward buying — but the decision still depends on the workflow.

Approach	Choose it when	Watch out for
Buy a packaged agent	A vendor already solves your exact process and integrates with your tools	Lock-in, opaque pricing, and limited control over behaviour
Adapt a template or marketplace workflow	You want control and faster time-to-value without starting from scratch	You own the maintenance and the guardrails
Build from scratch	The workflow is a competitive differentiator or your data cannot leave your environment	High failure rate, long timelines, and ongoing upkeep

The middle path is underrated. Buying a ready-made automation and shaping it to your own systems often captures most of the speed of buying with much of the control of building. Our guide to buying a ready-made automation walks through how to evaluate a packaged workflow, what to check before you deploy it, and where adaptation beats a from-scratch build. Whichever route you take, remember MIT's underlying finding: success correlates with integration into real work, not with how the software was sourced.

The compliance question buyers keep skipping

If your business touches the EU, an AI-agent purchase now carries regulatory weight, and the timeline is no longer theoretical. Obligations for general-purpose AI models under the EU AI Act have applied since 2 August 2025, and the EU AI Office's enforcement powers — including fines and information requests — activate on 2 August 2026. The stricter obligations for high-risk systems were pushed back from August 2026 to 2 December 2027 under the provisional Digital Omnibus agreement reached in May 2026, but the direction of travel is fixed.

As a buyer you are usually a "deployer" of the agent, which means the vendor's compliance becomes part of your own. Before you sign, ask whether the provider maintains the technical documentation the Act requires, publishes a summary of training-data content, supports the logging and human-oversight features you will need, and can tell you which risk tier their system falls into. A vendor who treats these questions as a nuisance is telling you something. For a fuller treatment of the controls that keep automated systems defensible, see our guide to automation security and compliance.

One practical rule: never let a bought agent take an irreversible or sensitive action — moving money, deleting records, messaging customers at scale — without a human approval step or a deterministic check. This is both good engineering and, increasingly, a compliance expectation. It also protects you on the days the vendor ships a model update you did not ask for.

Modelling the true cost before you commit

Headline prices for AI agents are designed to look small, and at the right volume they genuinely are. The danger is comparing a $0.50 or $0.99 per-result sticker against your gut feeling instead of against your real numbers. Total cost of ownership includes far more than the per-unit price, and the components below routinely double or triple the figure a buyer first has in mind.

Platform or licence fee: the fixed base you pay before the agent does anything.
Usage or outcome charges: the variable meter that scales with your real volume, not the demo's.
Integration and onboarding: the engineering work to connect the agent to your systems, where MIT says most projects actually fail.
Maintenance and tuning: ongoing prompt adjustments, monitoring and updates as your processes change.
Human review: the staff time to check, approve and correct the agent's output, especially early on.
Cost of failure: the price of wrong actions, bad customer interactions and escalations the agent could not handle.

The practical move is to model your expected monthly volume across at least two pricing tiers and two scenarios — a quiet month and a peak — before you sign anything. A per-resolution agent that is a bargain at 10,000 monthly resolutions can be poor value at 500, and a per-seat licence can flip the other way. If you want a deeper framework for putting numbers on automation decisions, our breakdown of what it costs to automate a business process shows how to build the model and where buyers usually underestimate.

A sane buying process, start to finish

Putting the pieces together, a low-risk way to buy an AI agent in 2026 looks less like a software purchase and more like a controlled experiment. The goal is to reach the point where you have proof, not a hunch, before money and reputation are on the line.

Write down one process, one success metric, and one payback target. If you cannot, stop here.
Shortlist vendors who solve that specific process, and apply the four agent-versus-chatbot tests to each.
Run the due-diligence checklist, paying special attention to failure behaviour, spend caps and exit terms.
Model total cost across realistic volumes and two pricing tiers, not just the headline per-unit price.
Negotiate a short paid pilot scoped to your one process with a written success metric.
Keep a human approval gate on every sensitive or irreversible action during the pilot.
Only convert to an annual term after the agent hits the metric in production — and re-test the market at renewal, because prices keep falling.

This sequence will not guarantee success, but it directly targets the reasons agent purchases fail: vague goals, unverified vendors, runaway costs and pilots that never reach real work. If you are weighing several agent platforms specifically, our side-by-side of AI agent builders compared is a useful companion once you have decided what you actually need.

Buy an automation you can actually verify

Browse vetted, ready-to-run workflows and agents you can pilot on your own stack before committing — with the source and logic in plain view.

See how to buy a ready-made automation

FAQ

Is it better to buy an AI agent or build one?

For most teams, buying from a specialized vendor wins: MIT found vendor purchases and partnerships succeeded about 67% of the time versus roughly a third as often for internal builds. Build only when the workflow is a true differentiator or your data cannot leave your environment.

What is "agent washing"?

It is rebranding chatbots, RPA scripts and assistants as "agents" without real agentic capability. Gartner estimates only around 130 of the thousands of self-described agentic vendors are genuine, so test for planning, tool choice, error recovery and the ability to act before paying agent prices.

How does outcome-based pricing work?

You pay per defined result rather than per seat. Intercom's Fin charges $0.99 per resolution, HubSpot moved to $0.50 per resolved conversation in April 2026, and Zendesk lists $1.50 per committed resolution. The contract definition of a billable "outcome" matters more than the headline number.

Why do so many agent projects get cancelled?

Gartner expects over 40% of agentic AI projects to be scrapped by end of 2027 due to cost, unclear value and weak controls, while MIT found 95% of GenAI pilots returned nothing measurable — mostly because teams never integrated the tool into real workflows.

Does the EU AI Act affect my purchase?

If you operate in the EU, yes. GPAI obligations have applied since August 2025, enforcement powers begin in August 2026, and high-risk obligations were deferred to December 2027. As a deployer, ask vendors for documentation, logging and human-oversight features before signing.

What should I ask a vendor first?

What outcome the agent is accountable for, how success is billed, what it does when it fails, which systems it can touch, where your data goes, what logs you get, how you cap spend, and how you exit. Insist on a paid pilot with a written success metric.

How do I estimate the real cost?

Add platform fees, usage or outcome charges, integration, maintenance, human review and the cost of failures — then model your real monthly volume across two pricing tiers and a quiet-versus-peak scenario before committing.

Should I sign an annual contract straight away?

Rarely. Start with a short paid pilot on one workflow, prefer usage or outcome pricing over large prepaid seat commitments, keep an exit clause, and re-test the market at renewal because per-result prices keep falling.