@karl.yeh_ai_explorer: Got access to ChatGPT Agent Mode and tested it against other AI Agents, Genspark and Manus for real business use cases. Here's the what I found. First, I hate the examples used by AI Agent live demos because they're so simplistic that it's faster for you to do them manually or unrealistic for most people...and why is it always booking travel??? I tested these three AI Agents for two business problems. One web based and the other using the data connectors. Desired Outcome: Can the agent do task either faster than doing ir manually OR can multiple agents be run at the same time and get accurate (or close enough) so the user can do other things. Client business problem #1 A municipal client visits other municipal websites to find job information to compare roles and salary. They do this bi-weekly/monthly. AI Agent Task: Visit a municipal website, find all posted jobs and extract the job title, job description and salary. Put into a spreadsheet and create a presentation. NOTE: We tried using typical web scrapers but how job posting are set up are so different based on website CMS and if they are using HRIS software. Client business problem #2 A construction client regularly compares supplier invoice statements against their internal invoice list contain hundreds of invoices. Currently being done manually. Important because any discrepancies can lead to hundreds of thousands of wasted dollars in duplicate payments or late fees. We shifted this task to using o3 but wanted to test a faster method. AI Agent Task: Leverage the Agents their connector to Google Drive, compare an attached internal invoice lists against four PDF supplier invoice statements and check for discrepancies. Create a report based on an example provided. RESULTS Problem 1 ChatGPT Agent Mode: - Provided the best and most accurate results. Followed the instructions the cleanest. - Spreadsheet was on point. - No additional prompting required - Presentation was the WORST and took the longest time creating it too Genspark: - Eventually completed the task but needed multiple prompting to clarify end goal. - Fast web search - Hands down the best designed and relevant presentation. Manus: - Could not complete the task even with multiple prompting (no job description or salary information) - Fast web search - Excellent designed presentation Business Problem 2 ChatGPT Agent Mode: Completed the task and provide accurate discrepancies in the report. Genspark: Couldn't find the Google Drive folder. Connector didn't work Manus: Couldn't find the Google Drive folder. Connector didn't work - I read I could only add files so maybe this task was beyond it's capabilities? Note on these test: It's easy to find use cases that each Agent will succeed or fail in so you have to test on your intended to use than just take my results or any other results as final review. I've used Genspark and Manus for other business cases and they performed well. #aiagent #aibusiness #aibusinesstools #aiagents #openaiagents @Genspark.ai @Manus AI

Karl Yeh
Karl Yeh
Open In TikTok:
Region: CA
Saturday 19 July 2025 13:09:18 GMT
4488
140
28
28

Music

Download

Comments

brandnat
Brand Nat | AI Automation :
LOVE this!! real life biz use cases!!
2025-07-21 12:29:29
1
grantporter_genxup
Grant Porter :
Thanks for real life examples and details.
2025-07-20 03:09:11
3
morelogiclessemotions
MoreLogicLessEmotions :
Problem is Ai hallucinates often because of throttling performance. It’s not reliable and a quick way to lose a client by presenting inaccurate data.
2025-07-19 18:59:07
0
brentmvw8vh
brent :
what i’m loving right now is mixing agentmode with workbeaver. agentmode handles the smart steps, and workbeaver handles the actual computer actions. you describe a task, and workbeaver just does it without scripting. agentmode helps with conditions or logic before handing off the physical actions to workbeaver.
2025-07-20 15:07:02
4
theflaviocorrea
Flávio Corrêa :
hint: always use either JSON or XML for multiphase prompts with self-checks and delimitators for each phase. Don't build it, meta-prompt it ;)
2025-07-19 13:42:12
0
sigmagirl3000
SIGMA-GIRL :
Your posts are 🔥 AWESOME!!! 👏 the real world use cases are ON POINT!
2025-07-20 12:51:04
2
varma_umesh
Umesh Varma :
Finally someone gave agents real world examples. So tired of everyone showing agents scheduling meetings on the calendar or searching through emails.. duh. Appreciate the detailed comparison and not just a 30 seconds summary. 🙏🙏🙏
2025-07-20 18:18:52
2
leodragonzero
user2222 :
What about connecting to accounting tools like Sage?
2025-07-19 15:13:23
0
yokevintrue
Kevin True :
Great post. I appreciate the detail.
2025-07-20 02:35:34
1
coryrussell
No Bad Dayz :
Excellent post, very well done.
2025-07-20 03:14:16
2
tyjuanceo
Tyjuan 🌹 :
Great content 🔥
2025-07-19 23:12:27
1
savant_ai
Matt @ Savant-AI™ Studios :
Very helpful exploration. Thank you @Karl Yeh
2025-07-20 07:18:20
1
atomasbranch
Allen Branch :
😳😳😳
2025-07-20 18:55:20
0
bepositive1231
Positive change :
Interesting. Can you do an example with bot protection like missingmoney.com?
2025-07-20 10:59:09
0
vballdude
vballdude :
You sort of glossed over the fact that one had 40 jobs and one had 42….that is an utter fail - hallucinating data in these cases is unacceptable and there is no way to know unless you go and do the job yourself - which defeats the purpose. I would guess if you ran this with any of them over multiple months you would see many occurunces of hallucinations.
2025-07-20 02:35:29
0
user3796735523732
Martha Dixon :
@isaacnelson017's RSI strategy catches perfect reversals every time. Turned my small $3K account into $27K in 8 months.
2025-07-20 15:39:06
0
user10548379954600
Stacy Rice :
The only trading alerts worth your attention - @liamparker0011 's setups have real statistical edges!
2025-07-20 12:00:49
0
To see more videos from user @karl.yeh_ai_explorer, please go to the Tikwm homepage.

Other Videos


About