I asked three AIs the same retirement questions

Zeldeen Müller, CFP®, CEO, inSite Connect and Creator of AgendaWorx.com, put AI to the test with some key retirement investment questions. Here’s what happened…

9
Zeldeen Müller, CEO, AgendaWorx
Zeldeen Müller

My team spends their days helping retirement fund members sort out their queries on helpdesks. Lately, I have noticed a new “expert” sneaking into the chat: artificial intelligence (AI).

 More folks are asking AI for financial advice before they even phone their fund or Google. So, I thought, let’s put these AI whizzes to the test! I grabbed three common retirement fund questions and fired them at ChatGPT 5.1 (OpenAI), Grok 3 (Elon Musk’s brainchild) and Claude Sonnet 4.5 (Anthropic). Here’s what I found. The questions were: 

  1. “I am a 30-year-old South African and have a retirement fund. Should I cash out my withdrawal benefit or preserve it? Explain in ±150 words.” 
  2. “I’m 59, with R2.5-million in my fund, a R50 000 monthly salary and no other savings. Should I retire at 60 instead of 65? Explain in ±150 words.” 
  3. “I’m 45 and in a South African retirement fund. Should I use my annual Two- Pot withdrawal to pay my home loan? Explain in ±150 words.” 
Check out the matrix below for how each AI did (full answers at
https://www.agendaworx.com/agendaworx/i-asked-3-ai-models-the-same-question/)

What I learnt 

Grok 3 was the star pupil. It dished out spot-on tax rates, scarily accurate income projections and clear Two-Pot details. It always nudged members to chat with a financial advisor, and its inflation warnings and preservation tips were practical. 

Claude Sonnet 4.5 impressed me with its tax and Two-Pot breakdowns and even outdid Grok on preservation options. I loved its clever alternative suggestions, like using bonuses or budgeting smarter instead of Two-Pot withdrawals. But, worryingly, it wildly overstated early retirement income by 320% (R35 000 as opposed to R8 333 monthly), which could trick someone into retiring too soon. Skipping advisor recommendations was a miss too. 

ChatGPT 5.1 flopped. Its advice was fuzzy, with no tax rates, preservation options or Two-Pot specifics. It mentioned inflation vaguely but left members in the dark. No advisor nudge either. Weaker than I anticipated it would be. 

Why this matters 

AI can be a nifty starting point, but it’s not perfect. Claude’s big income error could lead to bad calls, like retiring with too little cash. ChatGPT’s wishy-washy answers might confuse members. Grok 3’s solid, but even it’s no match for a human advisor. Financial advisors are essential to verify AI outputs, correct errors and tailor plans to individual circumstances. Advisors should caution clients about AI’s limitations, stressing that it’s a tool, not a substitute for professional guidance. Members must consult experts to navigate South Africa’s complex retirement landscape safely. 

  •  


 

Sponsored