Since 1994

Choose the right AI tool.

Use it right.

STRIVE Forum · Part 2 · Myles Bai

*Tonight is a workshop,

not a lecture.*


Two short concept blocks.
Two hands-on labs.
One take-home checklist.


Bring one real task from your business to test against.

REX BLACK · STRIVE FORUM · PART 2
Plain English

What we're answering tonight.

  • Which AI tool do I pick for which task?
  • Why does the same prompt give me four different answers from four different tools?
  • How do I tell if an output is good enough for what I need?
  • When is it worth paying ten times more for a smarter model?

Who this is for

Solo founders and small teams choosing between ChatGPT, Claude, Gemini, Copilot, and the dozen other AI tools landing in your inbox every week, without an engineering team to tell you which one to use when.

REX BLACK · STRIVE FORUM · PART 2
Sixty-second recap of Part 1

Green. Yellow. Red.

The six flaws that never go away

  1. Confidently wrong
  2. Drift
  3. No memory, no paper trail
  4. Cost scales
  5. Doesn't know your business
  6. Privacy is your job

Green, Yellow, Red

Green · You can catch all six. Go.
Yellow · Gaps in one or two. Add a habit, then go.
Red · Too many gaps. Wait.


If you missed Part 1, the deck is at rexblack.com/resources/talks/strive-apr-2026.

REX BLACK · STRIVE FORUM · PART 2
The question we're answering tonight

Same task. Five tools.

Which one do I pick?


A clean answer means less money wasted, better output, and fewer surprises at the end of the month.

REX BLACK · STRIVE FORUM · PART 2
Part 1 of 2 · Concepts

The four questions

every tool has to answer.

Capability

Is it smart enough for this task?

Speed

Is it fast enough for how I'll use it?

Cost

Is the price reasonable for the value?

Trust

Can I rely on it, or do I have to re-check every output?


Every AI tool is a tradeoff across these four. Nothing is best on all of them at once.

REX BLACK · STRIVE FORUM · PART 2
The three tiers, in plain English

Three buckets. Pick the cheapest one that works.

Tier When to use it Real tools today Roughly
Quick & cheap Drafts, summaries, FAQs, routine email, simple data cleanup, social posts. Things where a bad answer costs you nothing. ChatGPT (free / 4o-mini), Claude Haiku, Gemini Flash Cents per task
Stronger thinker Strategy, complex analysis, proposal writing, business planning, "help me think through this." ChatGPT Plus (4o / 5), Claude Sonnet, Gemini Pro A few cents to a quarter per task
Top-tier or verified Customer-facing, revenue-impacting, compliance-sensitive, legal review, anything that goes out under your name. Claude Opus, GPT-5 with retrieval, specialist tools (Harvey, Casetext), or human-checked output Dollars per task

The rule: start with the cheapest tier that does the job. Move up only when the cheap tier visibly fails on your task.

REX BLACK · STRIVE FORUM · PART 2

30x

Cost difference between the cheapest model and the most expensive, on the same task, with the same prompt.
REX BLACK · STRIVE FORUM · PART 2
Side by side · An example before you try it yourself

Same prompt. Different output.

Prompt: "Write a follow-up email to a prospect who went quiet for two weeks."

Quick & cheap

"Hi {Name}, following up on our conversation from two weeks ago. Wanted to see if you had any questions. Let me know!"

Generic. Fine. Sounds like AI wrote it.

Stronger thinker

"{Name}, you mentioned {concern} last time. I came across {data point} this week, made me think of you. Worth a 15-minute call, or want the one-page version?"

Specific. References your earlier note. Two ways to say yes.


Same prompt, same word count. Completely different output. Worth the extra cost on your task?

REX BLACK · STRIVE FORUM · PART 2
Lab 1 of 2 · Hands-on, ~15 minutes

Compare the models yourself.


rexblack.com/labs/compare-models

Pick a task from your business. Run the same prompt against three models. Look at the cost, the time, and the output side by side.

REX BLACK · STRIVE FORUM · PART 2
Lab 1 debrief · Discussion prompts

What did you see?

Talk us through

  • Which tools did you compare?
  • What was the cheapest that did your task acceptably?
  • Where did the expensive tool actually add value?
  • Where did the cheap tool quietly fail?

Patterns we usually see

  • For drafts, summaries, and routine writing, the cheap tier is good enough 80 percent of the time.
  • For strategy, analysis, and anything with judgment, the stronger tier is noticeably better and worth the 5x to 10x cost.
  • For revenue-impacting output, you almost always need human review, regardless of tier.

"Good enough" is not the same as "best." For most small-business tasks, good enough is the right target.

REX BLACK · STRIVE FORUM · PART 2
Part 2 of 2 · The grading rubric

Four questions

for every AI output.


  1. Useful · Does it actually do my specific task? Not "in general." For this task.
  2. Fast enough · Can I get the answer in the time I have? Two seconds, two minutes, two hours, they're all valid, but I have to know which.
  3. Worth it · Is the cost reasonable for what I get back?
  4. Trustworthy · Can I send this output without checking, or do I need to read every line first?

The fourth one is the most important and the most ignored.

REX BLACK · STRIVE FORUM · PART 2
The rubric applied · Three worked examples

Three real tasks. Graded.

Task Useful? Fast enough? Worth it? Trustworthy? Recommendation
Draft a weekly customer newsletter Yes, cheap tier Yes, seconds Yes, cents You read it before sending Green · Quick & cheap tier. Five-second review.
Write a $50K proposal for a new client Yes, stronger tier needed Yes, minutes Yes, a quarter Heavy edit pass required Yellow · Stronger thinker. Two-hour review. Use as a structured first draft, not final.
Auto-reply to a customer complaint Risky across all tiers Yes, seconds Yes, cents No, not without review Red until reworked · Don't auto-send. Use it to draft a reply for you, then send manually.

Same rubric. Three different answers. Each one tells you something specific to do.

REX BLACK · STRIVE FORUM · PART 2
Real-world examples by department

Where each tier actually shows up in a small business.

Quick & cheap, the workhorse

  • Routine customer email drafts
  • Meeting and call summaries
  • Internal SOP first drafts
  • Cleaning up notes or transcripts
  • Naming an offer or product
  • Reformatting data
  • "Translate this into plain English for me"

Stronger thinker, the analyst

  • Proposal writing
  • Strategy and positioning
  • Pricing exercises
  • "What questions am I not asking?"
  • Business plan sections
  • Competitive analysis
  • Job description writing

Top-tier or verified, the specialist

Customer-facing chat or voice agents · contract or compliance review · medical / legal / financial content · anything that goes out under your name without your eyes on it first. This tier needs engineering or verification you don't have yet. That's fine. Phase 2.

REX BLACK · STRIVE FORUM · PART 2
Lab 2 of 2 · Hands-on, ~8 minutes

What is an AI agent?


rexblack.com/labs/agent-anatomy

Click through the six layers that make up an agent. See what each one does, what breaks without it, and which tools cover which piece.

REX BLACK · STRIVE FORUM · PART 2
Lab 2 debrief · What you should now be able to tell

"Is this an agent, or is it a clever workflow?"

A workflow

The steps are pre-written. The AI fills in one or two of them. Predictable. Cheap to run. Easy to debug. Most "AI tools" you'll buy are this.

An agent

The AI itself decides what to do next, using tools it picks. Powerful. Expensive. Harder to predict. Worth it only when the workflow can't handle the variability.


For most small-business tasks, a workflow with one good prompt is the right answer, not an agent.

REX BLACK · STRIVE FORUM · PART 2
Take it home

The decision checklist.


Tape this to your monitor. Run it against every task you're tempted to automate.

  1. Color it. Green, Yellow, or Red? (Part 1 framework.) Stop here if Red.
  2. Pick the tier. Quick & cheap, stronger thinker, or top-tier verified?
  3. Grade the output. Useful? Fast enough? Worth it? Trustworthy?
  4. Pick the review step. Auto-send, quick read, heavy edit, or human-only?
  5. Set a spend cap. Day one. In the tool's billing settings. Don't skip this.
  6. Two-week tally. How often does the draft go out clean? Where does it fail?
  7. Reassess monthly. New model dropped? Free tier got smarter? Cheaper tier may now beat the one you're paying for.

If you can do steps 1 to 4 in under sixty seconds, you've internalized the framework. If you can't, run it again on the next task until you can.

REX BLACK · STRIVE FORUM · PART 2
Share-out · Two minutes each

One task. This week.


Pick one task from your business you're going to run through the checklist this week.

  • What is the task?
  • What tier did you decide on?
  • What's the human review step?
  • What does success look like at two weeks?

We'll go around. Two minutes a person. Not for show, this is how the framework gets sticky.

REX BLACK · STRIVE FORUM · PART 2
Remember

Three things from tonight.

One

Every AI tool trades capability, speed, cost, and trust. Nothing is best on all four. Pick the cheapest tier that works.

Two

Run every output through four questions. Useful, fast enough, worth it, trustworthy. The fourth one is the one most people skip.


Three

Most "AI agents" you'll be pitched are workflows with one prompt. They're fine, they're just not magic, and they shouldn't cost agent-level prices. Ask why a workflow isn't enough before you say yes.

REX BLACK · STRIVE FORUM · PART 2
Keep going

Resources we built for you.

Hands-on labs

  • Compare 13 models side by side · rexblack.com/labs/compare-models
  • What is an AI agent? · rexblack.com/labs/agent-anatomy

This talk + Part 1

  • Tonight's talk · rexblack.com/resources/talks/strive-may-2026
  • Part 1, the six flaws · rexblack.com/resources/talks/strive-apr-2026

The article behind the framework

  • Choosing the right model (and knowing when to switch) · rexblack.com/resources/writing/ai-model-selection-framework
  • Workflow or agent? · rexblack.com/resources/writing/workflow-or-agent
  • Starting AI adoption (six stages) · rexblack.com/resources/writing/ai-adoption-sequence

Real production examples

  • AI-powered services intake · rexblack.com/case-studies/ai-veteran-intake
REX BLACK · STRIVE FORUM · PART 2
Since 1994

Thank you.

Myles Bai · Rex Black, Inc. · rexblack.com

Open warmly. "Welcome back. April was the conceptual frame, the six flaws, Green / Yellow / Red. Tonight is the hands-on lab. You're going to leave with a four-question rubric and a tool tier you can apply to any task in your business this week." Beat. "We're going to talk for about fifteen minutes, then you're going to use a lab, then we'll talk about what you saw. Twice. That's the shape of the night."

Set expectations. This is not a sit-and-listen session. You will be on your phone or laptop running real prompts at two points tonight. Pick one repetitive task in your business now, drafting a customer email, summarizing a meeting, writing a job description, naming an offer, and hold it in your head. We'll keep coming back to it.

Orient the room. Four questions everyone has had at some point in the last sixty days. By the end of the hour and a half, you should have an answer to all four. The "who this is for" line is for the new faces in the room: this is a small business AI talk, not a developer talk.

Sixty-second recap, do not relitigate. This is the trellis for tonight. Once you know whether a task is Green, Yellow, or Red, the next question is, "of the AI tools out there, which one do I actually use for this Green task?" That's tonight.

Reframe. Last month was "should I use AI for this at all." Tonight is "given that I should, which one." That's the question on every founder's screen right now and the answer is not "the most famous one." It's the one that matches the four things we're about to walk through.

This is the heart of the talk. The technical version of this list is "capability, latency, cost, reliability." We're using the plain-English version because that's how a small business owner actually thinks about it. The key insight, and the one that costs money when people miss it, is that no tool wins on all four. The ones that are "smart" are slow and expensive. The ones that are "fast and cheap" make more mistakes. You're always trading.

This is the Benjamin framework, made concrete. Three tiers. Real product names so people leave the room knowing what to actually open tomorrow. The pricing column is the part that surprises people. The difference between tier one and tier three is roughly 30 to 100 times in cost per task. That's not a rounding error. If you default to the fanciest tool for everything, you're spending 50 to 100 dollars where you should be spending one.

Land it. "Same task. Same prompt. The cheapest tool costs about a thirtieth of what the most expensive one costs. That doesn't mean cheap is always better, sometimes you need the expensive one. It means *defaulting* to the expensive one for every task is the single most common waste of money in small-business AI adoption."

This is James's note made literal. Side-by-side on a real task they all do every day. The point is not "the expensive one wins." The point is "the expensive one wins on this particular kind of task." On a different task, say cleaning up a list of email addresses, the cheap one wins because there's no nuance to add. Setting up the lab.

Walk them to the URL. Hold the slide up. "Open this on your phone or your laptop. The lab lets you pick up to three models from a list of about fifteen, paste a prompt once, and see all three answers next to each other along with what each one cost and how long each one took. Take fifteen minutes. Use a real task. We'll regroup at the bottom of the hour and you'll tell me what surprised you." Stay on this slide while they work. Mute yourself. Pop back on at the ten-minute mark to give a two-minute warning.

Open mic. Get three or four people to share what they tried and what they saw. Listen for the moment someone says "I was paying for X but the free version did the same thing." Underline it. That's the highest-leverage realization in this entire session.

This is Benjamin's rubric, in the form a founder can actually use. Four yes/no questions. If you can answer "yes, yes, yes, yes" to a tool on a specific task, you've got a green light to automate that task with that tool. If any of them is a "no," you either pick a different tier, change your workflow to add a human review step, or skip the automation for that task.

This is the moment Benjamin asked for. The four-axis framework, ground to a useful business example. Walk through each row slowly. The point is not "AI does this for me." The point is "the rubric tells me how much trust I extend, and where I add a human." That's what's missing in 90% of what they're being sold by vendors right now.

Reinforce the tiers with real departments. Founders see themselves in this list. The biggest payoff is the left column, the "workhorse" tasks where the cheap tier gets you most of the way and the only review needed is a quick read-through. That's where you'll get the most hours back per dollar.

Shorter lab. This one is more "click through and read" than "type a prompt." The reason it's worth eight minutes is that "agent" is the most overused word in this space right now and most of what's being sold as an agent is actually a workflow with one AI call in it. The lab gives them the vocabulary to push back when a vendor pitches them an "agent" for $5K a month that should cost $50.

The takeaway from the agent lab. Most of what your vendors are selling you as "AI agents" do not need to be agents. Workflows are cheaper, more predictable, and easier to operate. When someone pitches you an "agent" for a task that has a clear five-step structure, ask why a workflow isn't enough. Save yourself thousands.

The take-home. Read it slowly. This is the deliverable from tonight. Every person in the room should be able to run this list on a real task by the end of the week. The reassess-monthly step is the one most people skip and it's the one that compounds. The cheap tier in 2026 is doing things the expensive tier did in 2024.

Reserve eight to ten minutes for this. Three or four people share, two minutes each. The act of saying it out loud commits them to running it this week. The followup, even informally, is worth more than another slide of content.

Three takeaway cards. Read each. Beat between them. These are the lines you want them quoting on a sales call next month when a vendor tries to upsell them on something they don't need.

Last functional slide. Everything from tonight, in one place. Bookmark this URL, the talk page itself, and you can come back to all the labs and articles whenever you want.

Hold the contact slide while you take questions. Encourage people to grab the labs URL on their way out. The conversation that happens after the formal Q and A is usually the most useful one.