Artificial Intelligence, Banking

GPT-4 faces a challenger: Can Writer’s finance-focused LLM take the lead in banking?

  • We often focus on chatbots built by banks and financial firms, but today, we explore the engines driving chatbot interactions and platform automation.
  • Banks typically turn to GPT-4 for LLM solutions, but a potential rival is emerging. San Francisco’s Writer, a gen AI company, is pushing forward in enterprise AI with domain-specific LLMs like Palmyra Fin.
close

Email a Friend

GPT-4 faces a challenger: Can Writer’s finance-focused LLM take the lead in banking?

Banks are heavily investing in Large Language Models (LLMs) to enhance both internal operations and customer interactions — yet building a model that excels at both is a significant challenge.

A recent study by Writer, a San Francisco-based generative AI company that provides a full-stack AI platform for enterprise use, found that ‘thinking’ LLMs produce false information in up to 41% of tested cases.

The study evaluated advanced reasoning models in real-world financial scenarios, highlighting the risks such inaccuracies pose to regulated industries like financial services. The research also showed that traditional chat LLMs outperform thinking models in accuracy.

LLMs are used in three main ways within financial services:

  1. Platforms for operations & automation – LLMs power internal enterprise platforms to streamline workflows, automate document processing, summarize reports, analyze data, and assist employees. For example, Ally Bank’s proprietary AI platform, Ally.ai, uses LLMs to improve its marketing and business processes.
  2. Task-specific AI assistants – LLMs enhance specific financial tasks such as fraud detection, compliance monitoring, or investment analysis. An example of this is J.P. Morgan’s IndexGPT, which aims to provide AI-driven investment insights.
  3. Chatbots & virtual assistants – LLMs improve customer-facing chatbots by making them more conversational and executing basic tasks. Bank of America’s virtual assistant, Erica, provides banking insights to its customers.

We often focus on chatbots built by banks and financial firms, but today, we explore the underlying technology behind them — the engines driving chatbot interactions and platform automation.

We take a closer look at the LLMs driving these AI systems, their challenges, and how financial firms can train enterprise-grade models to capitalize on their potential while controlling their risks.

Thinking LLMs vs. traditional chat LLMs 

Thinking LLMs, also referred to as CoT (Chain-of-Thought) models, are designed to simulate multi-step reasoning and decision-making processes to provide more nuanced responses beyond only retrieving or summarizing information, says Waseem Alshikh, CTO and co-founder of Writer.

Waseem Alshikh, CTO and co-founder of Writer

Morgan Stanley’s AI Assistant, for example, uses OpenAI’s GPT-4 to scan 100,000+ research reports and provide quick insights to financial advisors. It enhances portfolio strategy recommendations by summarizing complex data beyond retrieving reports.

“These models are not truly ‘thinking’ but are instead trained to generate outputs that resemble reasoning patterns or decompose complex problems into intermediate reasoning steps,” Waseem notes.

Morgan Stanley’s AI tool encountered accuracy issues stemming from hallucinated responses. Shortly after its launch in 2023, sources within the company described the tool as ‘spotty on accuracy,’ with users frequently receiving responses like “I’m unable to answer your question.”

While Morgan Stanley has been proactive in fine-tuning OpenAI’s GPT-4 model to assist its financial advisors, the company acknowledges the challenges posed by AI hallucinations. To reduce inaccuracies, the bank curated training data and limited prompts to business-related topics.

Traditional chat LLMs, however, tend to be more accurate, according to Waseem. These models mainly use pattern matching and next-token prediction, responding in a conversational manner based on pre-trained knowledge and contextual cues. While these models may struggle with complex queries at times, they produce fewer hallucinations, making them more reliable for regulatory compliance, according to Writer’s research.

Bank of America’s virtual assistant, Erica, uses a traditional chat model to assist customers with banking tasks like balance inquiries, bill payments, and credit report updates. By leveraging structured data and predefined algorithms, it provides accurate and reliable responses while reducing the likelihood of misinformation.

But how can financial firms navigate the trade-off between AI sophistication and accuracy?

Best practices for implementing thinking LLMs in financial services

Given the advanced capabilities of thinking LLMs, financial firms can’t simply rule them out, but they can deploy them effectively with the right strategic approaches.

Waseem outlines the key steps:


TS Pro subscription options

0 comments on “GPT-4 faces a challenger: Can Writer’s finance-focused LLM take the lead in banking?”

Banking, Partner

The Harmony Gap: Why banks lose $100M annually when fintech and legacy systems don’t play nice

  • Financial disharmony costs organizations $100 million annually as money struggles to flow seamlessly through trading, banking, and payment systems, with cyberthreats driving a third of losses.
  • FIS CTO Firdaus Bhathena and Oxford Economics' Margaux McLoughlin discuss strategic solutions including AI integration, enhanced training, and industry partnerships to address the growing harmony gap in financial services.
Zack Miller | June 26, 2025
Banking, Partner, SMB Finance

How to get started with embedded lending and drive revenue

  • Always strapped for cash and with their owners juggling multiple responsibilities at once, small businesses increasingly adopt vertical SaaS platforms to run their operations, the demand for integrated banking, payments, and lending solutions continues to accelerate.  
  • Tune in to hear us plot the market shift from embedded payments to lending, underwriting challenges for thin-file customers, and strategic partnership models that create comprehensive solutions for small businesses along with two case studies from Relay and Tilled.
Zack Miller | June 25, 2025
Banking, Embedded Finance, Partner

How Citizens is using Value-Added Services and a command center approach to empower SMBs

  • SMB customers need more than just the traditional checking account from their banks, firms that want to capture and retain SMB customers need to go further and build products that help make running an SMB easier.
  • On the show today, Mark Valentino, President of Business Banking at Citizens, and Taira Hall, EVP and Head of Enterprise Payments at Citizens, join to talk about how the bank aiming to become a command center for SMB owners.
Rabab Ahsan | June 23, 2025
Artificial Intelligence, Banking, Partner

How Temenos is co-creating AI products with banks, not just for them

  • Banks struggle with AI adoption due to outdated data systems and cultural resistance; Temenos' Barb Morgan addresses these foundational challenges.
  • Morgan talks about how Temenos is helping the industry in evolving towards gradual AI deployment, market-centric innovation, and customer-driven product development approaches.
Zack Miller | June 20, 2025
Banking, Partner

“Take a hard look at your current ecosystem. If you were to double the assets under your management today, would your current ecosystem sustain that growth?” Finastra’s Kristen Lista, on what FIs need to do to compete in SME lending

  • Traditional banks are losing market share to non-bank FIs who are proving to be more agile in their SME lending approaches.
  • Finastra’s Principal Product Manager Kristen Lista joins the Tearsheet podcast today to discuss how consolidating technology and decreasing the time between application and access to funding can fuel traditional bank's SME lending approaches.
Rabab Ahsan | June 17, 2025
More Articles