AI Agents as Managers

Which AI Agents are Valuable in a 100% Automated Company in June 2025?

Last week I posted about how all of this automating I’m doing for this challenge doesn’t feel like AI. The reddit comments after that post agreed with me almost unanimously:

"...agents right now are just scripted chains with some wrappers. it works, but doesn’t feel like they’re making decisions or adapting independently"
"In my experience working with automation setups, it’s still about combining the right tools and APIs, not about having a fully autonomous agent that runs wild on its own. I think for now it’s really about focusing on narrow specific use cases and refining them , maybe the more real AI vibe will come as the tech matures."

AI Agents are currently just scripted chains.

This poses the question; where do we best leverage them? Can AI “Agents” act as managers?

AI Agents are Chained Prompts

Quick Update on AI Directory Experiment

Last week I got the semblance of Agent 1 and 2 (Prospector & Builder) stood up and working.

That’s 50% of the way to having an automated web Directory business:

4 Stages of building an online directory with AI

This week I’m going to spend a lot of time refining the first two flows to get the output up to scratch.

The progress so far is both exciting, and demoralising.

E.g. I love the combo of Business API + Make.com + my Human Queue. It’s so sweet to have entered only a keyword, and having had AI’s gather keywords, generate brand ideas, and even make basic logos; then letting me pick with 1 click:

My Human Queue lets me give highly efficient input to my AI Workflows
My Human Queue lets me give highly efficient input to my AI Workflows

But it really doesn’t feel like AI. It feels like a very slick automation which contains lots of AI calls. When I reply to my AI manager agent prototypes I can feel their hollowness; not to mention their cringe-worthy levels of sycophancy.

AI isnt being very AI

You can read more about this first experiment here, (full rundown on this first proto-swarm soon.)

New here? Welcome! This is the journey of building a 100% automated AI business in 2025. You’re jumping in after we’ve already kicked things off, so you might want to catch up first.

Check out these key posts to get the full story—and don’t forget to subscribe for updates and exclusive perks:

How AI Agents Currently Work

At their core, AI agents are sophisticated orchestrations of prompts; essentially, structured instructions guiding LLMs to perform tasks. While it sometimes seems like these agents are more complex, they are mostly executing well-designed prompt architectures.

The Anatomy of an AI Agent

AI Agents currently seem to operate on one of these:

  1. Simple Prompt Execution: One or two straightforward levels of nested instruction, e.g. System Prompt + User Prompt
  2. Chained Prompts: A sequence where the output of one prompt becomes the input for the next. For instance, extracting key points from a text, then using those points to generate a summary, (automatic chain-of-thought prompting)
  3. Conditional Prompting: Dynamic branching versions of the above, where the next prompt depends on the previous output. For example, if sentiment analysis detects negativity, so the agent might initiate a prompt to draft a response addressing concerns
AI Agents are basically chained conditional prompts

This is likely what’s driving nearly all of the ‘AI Agent’ tools you may have used – to a more or less decent outcome. They’re pretty much just prompts.

Where I’m using AI Agents Currently

The best 2 examples I’ve seen of close-to-expectation “AI Agents” are in Relevance AI & Make. These two AI Agent setups seem most robust for workflow automation, and I could get the agents to do some work with a depth I’ve not seen in other tools, (when they worked – I’m looking at you Relevance).

But as per the Reddit chorus, more often than not, these agents have to stay flat to the floor, or risk falling over.

  • Give them one tool – (e.g. an API to call) and a fairly shallow task (query keywords, return the low-hanging fruit) – they’ll do okay
  • Give them several tools – you’re less likely to be able to get a consistent output
  • Give them handfuls of tools and a complex task – even if you’re methodical with your prompts, you’re likely going to get frustratingly close-to-great answers, then absolute clangers, in no rational sequence

…I digress.

I’m currently using AI agents on Make, to do keyword research and assist with domain prospecting:

Example of an AI Agent which works reliably (and the silly silly prompt that was needed to make it play ball)
  • Keyword research = 1 tool (Google Keyword API) + 1 task (find me the 20 lowest competition keywords as close to the provided keyword as possible)
  • Domain Prospecting = 0 tools + 1 task (invent some madcap-but-audience-appropriate domain name ideas)

… both of which I suppose I could recreate by chaining a few prompts together outside of Make.

Where I’m not using AI Agents

I’m not using AI Agents as managers anymore.

See the following ridiculous Make scenario workflow:

Example of a silly router in Make - because AI Agents are worse
Example of a silly router in Make

This basic workflow outperforms a prompted agent…because where you can use conditional statements it’s still so much more reliable than LLM based agents (which, reflecting historic human nature, are overall inconsistent).

I’m not using AI Agents with 3+ tools anymore

I’ve dabbled with fuller agents but they just don’t seem ready. They time out and do nothing. They blow API budgets in milliseconds. They work randomly which gives you false hope. If you have any success with multi-tool agents, please do comment below and point me in a good direction!

AI Agents are just Chained Prompts - Change my mind

When will AI Agents be Smart Enough to be Managers?

As artificial intelligence transitions from AI hype to AI augmentation to realistic AI Agents I hope to be able to leverage them reliably as manager agents.

I suspect there has been progress on this already this year, and I will re-hunt for it when I hit the need to build a ‘Product manager’ for this directory business experiment.

Certainly as we get more and more capable models, I’d hope that true AI Agents become an easy go-to (alongside some form of biz API and human queue). If they don’t, this challenge is going to be bloody difficult to achieve!

Me wondering if we will have good enough AI for this challenge by the end of the year
Me wondering if we will have good enough AI for this challenge by the end of the year

What’s your take? Are any AI agent frameworks smart enough to bother with yet? How are you using “AI Agents”? Let me know in the comments.

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *