LLMs Struggle with Stock Trading, Visuals Boost Performance

New research reveals how large language models can improve numerical understanding in financial markets.

A new study introduces the Agent Trading Arena, a virtual stock market for LLM-based agents. It highlights that while LLMs struggle with numerical reasoning from plain text, chart-based visualizations significantly enhance their trading performance and understanding, especially under high volatility.

By Sarah Kline

September 16, 2025

4 min read

LLMs Struggle with Stock Trading, Visuals Boost Performance

Key Facts

The Agent Trading Arena is a virtual zero-sum stock market for LLM-based agents.
LLM agents in the arena can directly impact market prices, simulating real-world conditions.
LLMs struggle with numerical reasoning when given plain-text financial data.
Chart-based visualizations significantly improve LLM numerical reasoning and trading performance.
A 'reflection module' further enhances LLM performance, particularly with visual inputs.

Why You Care

Ever wondered if an AI could manage your investments better than you? What if the secret to its success wasn’t just raw data, but how that data is presented? New research dives into how Large Language Models (LLMs) perform in the complex world of stock trading, revealing a surprising truth about their numerical understanding.

What Actually Happened

A recent paper, “Agent Trading Arena: A Study on Numerical Understanding in LLM-Based Agents,” introduces a novel virtual stock market. This “Agent Trading Arena” allows LLM-based agents to compete in a zero-sum environment, according to the announcement. Unlike traditional methods, these agents directly influence market prices, creating a more realistic simulation. The study aims to bridge the gap between training and real-world market evaluation, as detailed in the blog post. Researchers found that LLMs (Large Language Models) often struggle with numerical reasoning when presented with plain-text financial data. They tend to overfit to local patterns, the research shows. However, chart-based visualizations dramatically improve both their numerical reasoning and overall trading performance, the team revealed.

Why This Matters to You

This research has direct implications for anyone interested in AI-driven financial tools or understanding how LLMs process complex information. Imagine you’re a retail investor using an AI assistant. If that AI only reads financial news articles, its performance might be limited. But if it also analyzes stock charts, its decisions could be far more informed. The study highlights that incorporating a ‘reflection module’ further enhances performance, especially with visual inputs. “Experiments reveal that LLMs struggle with numerical reasoning when given plain-text data, often overfitting to local patterns and recent values,” the paper states. This means your AI assistant might be missing crucial insights if it’s not looking at the whole picture. Do you think your current financial tools are leveraging visual data effectively?

Here’s how different data presentations impact LLM trading:

Data Type	Numerical Reasoning	Trading Performance	Overfitting Tendency
Plain-Text Data	Low	Moderate	High
Chart-Based Visual	High	High	Low

This table, derived from the research findings, illustrates the significant advantage visuals provide. For example, if you’re tracking a stock, seeing its price movements on a chart provides context that a simple list of numbers might miss. This visual context helps LLMs avoid focusing too much on recent, isolated data points.

The Surprising Finding

Here’s the twist: despite their language capabilities, LLMs are surprisingly poor at numerical reasoning from plain text. The study found that LLMs often ‘overfit’ to local patterns and recent values when only given text data. This means they might react strongly to short-term fluctuations without understanding the broader trend. In stark contrast, chart-based visualizations significantly enhance both numerical reasoning and trading performance. This challenges the common assumption that LLMs can simply ‘read’ numbers and understand their implications as easily as they process words. The team revealed that incorporating a reflection module yields additional improvements, especially with visual inputs. This suggests that the way data is presented is as crucial as the data itself for AI in finance.

What Happens Next

This research paves the way for more AI trading agents. We can expect to see AI financial tools begin to integrate more visual data analysis within the next 12-18 months. Developers will likely focus on creating reflection modules for LLMs, allowing them to better interpret complex visual information. For example, future AI investment platforms might analyze candlestick charts and technical indicators directly, rather than relying solely on textual reports. The industry implications are vast, suggesting a shift towards multimodal AI in financial applications. If you’re building AI models for finance, consider prioritizing visual data input and reflection mechanisms. This approach could significantly improve your model’s robustness, especially during periods of high market volatility, according to the research.

Ready to start creating?