Agentic AI is mostly 'agent washing.' Here's what we build instead.
By Aaron McClendon, Founder & CTO, Arkitekt AI

Every vendor pitch this year sounds the same. Drop in an agent, walk away, watch the work happen. Then you talk to someone who actually deployed one and the story changes fast.
Gartner predicts more than 40% of agentic AI projects will be canceled by the end of 2027, citing escalating costs, unclear business value, and inadequate risk controls. They also flag a problem they call agent washing: rebranding chatbots, RPA scripts, and basic assistants as "agents" because the word sells. Most of what's marketed as agentic isn't.
MIT's NANDA initiative found something similar from the buyer side: 95% of enterprise generative AI pilots are failing to produce measurable P&L impact. The gap between the demo and the production system is wider than most teams budget for.
The demo is the easy part
A demo shows the agent doing the interesting 10% of the work. Production is the other 90%: edge cases, permissions, audit trails, what happens when an upstream system returns a weird value, and how a human notices when something silently went wrong.
theCUBE Research points at the same wall: integration, governance, and orchestration gaps are what keep enterprise agents from running autonomously. None of those are model problems. They're plumbing problems. You don't fix them by switching from one frontier model to another.
What we actually build
When a client comes to us asking for "an agent that does X," we usually end up building something more boring and more useful. A few patterns that keep showing up:
- AI drafts, a human approves. The model writes the first version of the quote, the email, the coverage summary, the categorization. A person clicks approve or edits. Throughput goes up 3–5x. Errors stay rare because the human is still the last step. - AI flags, a human decides. The model reads everything and surfaces the 4 things out of 400 that look off. The human spends their attention where it matters. - Deterministic code does the work, AI handles the messy input. The hard logic is plain old software you can test. The LLM just turns unstructured input (a PDF, an email, a voice note) into structured data the rest of the system can use.
None of this is glamorous. None of it makes a good keynote. But it ships, it stays shipped, and you can explain to your auditor what it does.
The implication
If you're scoping an AI project for next quarter, the question isn't "how autonomous can we make this?" It's "where in this workflow does a human add the most value, and how do we give them better tools?" Start there and you're already ahead of the 40% that will get canceled.
The agents will get better. Eventually they'll handle more of the loop. But betting your operation on that timeline today is how you end up in the cancellation column.
Arkitekt AI builds production-grade custom software on managed infrastructure, delivered autonomously at AI speed. If you're paying for tools that almost fit, let's talk.
Source: “Inside Big Software's fight for its life,” Ashley Stewart, Business Insider, April 7, 2026.