Beyond the Numbers: Predicting Stock Performance with News and AI

Author: Denis Avetisyan

A new approach combines financial time series data with real-time news sentiment analysis to deliver more accurate and generalizable stock price forecasts.

The system forecasts stock prices by distilling daily news into attentive pooled representations, fusing this information with historical price data, and then leveraging a large language model to extrapolate future values-a process that optionally incorporates further refinement steps along the way.

This review details a novel framework leveraging attention mechanisms and language models for multi-stock price prediction integrated with news sentiment analysis.

Accurately forecasting stock prices remains a persistent challenge in financial modeling, often hindered by the efficient integration of diverse data sources. This paper, ‘Generalized Stock Price Prediction for Multiple Stocks Combined with News Fusion’, introduces a novel approach that leverages attention mechanisms and large language models to effectively incorporate daily financial news into a generalized prediction framework. Experimental results demonstrate a significant reduction in prediction error-a 7.11% decrease in Mean Absolute Error-by utilizing stock name embeddings for focused news filtering. Could this generalized model, trained across multiple stocks, represent a scalable solution for improved financial forecasting and risk management?

The Illusion of Prediction: Markets as Complex Systems

Conventional time series forecasting, reliant on techniques like ARIMA and exponential smoothing, frequently underperforms when applied to stock market analysis. These methods assume a degree of linearity and stationarity – that past patterns will predictably continue – an assumption consistently violated by the inherent volatility and complex interdependencies within financial markets. Subtle shifts in investor sentiment, unforeseen geopolitical events, and the cascading effects of high-frequency trading create non-linear dynamics that these traditional models struggle to capture. Consequently, forecasts often miss critical turning points or fail to accurately gauge the magnitude of price movements, highlighting the need for more sophisticated approaches capable of modeling the intricate, often chaotic, behavior of stock prices.

The proliferation of digital news and social media has created a vast ocean of unstructured data that holds potential clues to future market movements. While traditional financial analysis relies heavily on structured datasets like historical prices and company reports, this new wave of information – encompassing news articles, blog posts, and social media sentiment – offers the possibility of anticipating shifts before they are reflected in trading volumes. However, extracting meaningful signals from this data presents significant challenges. Natural language processing techniques are required to decipher context, identify relevant events, and gauge market sentiment, all while contending with noise, bias, and the sheer volume of information. Successfully navigating this landscape demands sophisticated algorithms capable of discerning genuine predictive power from spurious correlations, representing a considerable hurdle for accurate stock price prediction.

Using SNP significantly improves the accuracy of 5-day price prediction across both LLaMA and GPT-2 backbones, with attentive pooling methods (<span class="katex-eq" data-katex-display="false">+CAP</span>, <span class="katex-eq" data-katex-display="false">+SAP</span>, and <span class="katex-eq" data-katex-display="false">+PA-SAP</span>) consistently outperforming the baseline TimeLLM and simple news incorporation. — Using SNP significantly improves the accuracy of 5-day price prediction across both LLaMA and GPT-2 backbones, with attentive pooling methods ( $+CAP$ , $+SAP$ , and $+PA-SAP$ ) consistently outperforming the baseline TimeLLM and simple news incorporation.

Language as a Signal: Decoding Market Narratives

Large language models (LLMs) such as BERT, LLaMA, and GPT-2 utilize transformer architectures to process news articles and generate contextualized word embeddings. These embeddings capture semantic relationships between words, going beyond simple keyword analysis to understand nuance and context. Specifically, LLMs are trained on massive text corpora, enabling them to learn representations that reflect the meaning of words based on their surrounding text. This capability allows for the quantification of textual information, converting qualitative news content into numerical vectors suitable for downstream tasks like sentiment analysis, topic modeling, and, ultimately, financial forecasting. The models achieve this by assigning each word or sub-word unit a high-dimensional vector, where similar meanings are represented by vectors that are close to each other in vector space.

Effective news encoding, a core component of utilizing language models for financial forecasting, involves converting unstructured textual data from news articles into numerical vector representations. This transformation is achieved through techniques like word embeddings – such as Word2Vec, GloVe, or those learned directly within models like BERT – which map each word to a high-dimensional vector space where semantic similarity corresponds to proximity. These vectors capture contextual information, allowing the model to understand relationships between words and concepts. The resulting numerical data is then suitable for input into quantitative models, including time series analysis, regression, and machine learning algorithms, facilitating the identification of patterns and potential predictive signals within the news flow. The quality of this encoding directly impacts the performance of downstream analytical tasks, necessitating careful consideration of model selection, training data, and embedding dimensionality.

The Time-LLM framework combines the capabilities of large language models with time-series analysis techniques to improve financial forecasting. This integration allows for the encoding of textual information from news sources – previously inaccessible to quantitative models – as numerical features within a time-series model. Specifically, language model outputs representing sentiment, topic prevalence, or event detection are incorporated as exogenous variables, augmenting traditional forecasting inputs such as historical price and volume data. This approach enables the model to react to and potentially predict market movements based on real-time news flow, capturing nuanced relationships between textual information and financial indicators that would otherwise remain hidden. Empirical results demonstrate that Time-LLM consistently outperforms baseline models relying solely on historical data, particularly in short-term forecasting scenarios.

The News-Price Fusion Module combines price and news embeddings via dense layers, bidirectional cross-attention alternating between price and news as query/key/value, and a two-layer Graph Convolutional Network, ultimately integrating these fused representations with the original embeddings using a weighted average to produce the final result.

Attention as a Filter: Distilling Signal from Noise

Attentive Pooling techniques address the challenge of processing variable-length news articles by weighting different segments of text based on their relevance to the forecasting task. Self-Attentive Pooling (SAP) calculates attention weights based on the internal relationships within a single article, effectively identifying salient sentences or phrases. Cross-Attentive Pooling extends this by calculating attention weights between different news articles, allowing the model to identify corroborating or conflicting information. Position-Aware Self-Attentive Pooling further refines the process by incorporating positional information, recognizing that the location of information within an article can be significant. These methods collectively enable the model to prioritize the most impactful information, improving performance compared to approaches that treat all text equally.

News-Price Fusion, when integrated with Graph Convolutional Networks (GCN), improves the modeling of connections between stock movements and relevant news events. The GCN component facilitates the capture of dependencies within the news data itself, identifying relationships between different articles and entities. Simultaneously, the News-Price Fusion mechanism correlates these news-derived representations with historical stock price data. This combined approach allows the model to learn how specific news events, and the broader context within which they occur, influence stock performance, thereby enhancing the accuracy of forecasting models by leveraging both textual information and quantitative data.

Attentive mechanisms facilitate the conversion of extensive news data into predictive signals for financial forecasting. Specifically, the Self-Attentive Pooling (+SAP) method demonstrates a measurable improvement in forecast accuracy, achieving a 7.11% reduction in Mean Absolute Error (MAE) when compared to baseline models. This reduction indicates a significant enhancement in the model’s ability to identify and prioritize pertinent information within news streams, leading to more precise predictions. The MAE metric quantifies the average magnitude of errors in the forecasts, with a lower value representing improved performance.

Three distinct daily news pooling methods leverage the stock name embedding-via cross-attention, prepending, or addition <span class="katex-eq" data-katex-display="false">\oplus</span> with positional embeddings-to generate a pooled representation for improved analysis. — Three distinct daily news pooling methods leverage the stock name embedding-via cross-attention, prepending, or addition $\oplus$ with positional embeddings-to generate a pooled representation for improved analysis.

Validation in the Real World: A System Tested

Rigorous evaluation of the predictive model involved comprehensive testing against two distinct real-world datasets: the BigData23 Dataset, representing the U.S. stock market, and the TW21 Dataset, focused on the Taiwan stock market. This dual assessment was crucial to confirm the model’s generalizability and robustness beyond a single market’s specific characteristics. Results consistently indicated strong predictive capabilities across both datasets, demonstrating the model’s ability to accurately forecast stock market trends in varied economic environments and highlighting its potential for practical application in financial analysis and investment strategies.

Rigorous evaluation of the predictive model leveraged established metrics-Mean Squared Error and Mean Absolute Error-to demonstrate its efficacy against existing baseline models. Notably, the implementation of Self-Attentive Pooling yielded a substantial improvement in performance, specifically achieving a 7.11% reduction in Mean Absolute Error when tested on the TW21 dataset, representing the Taiwanese stock market. This quantifiable advancement underscores the model’s ability to generate more accurate predictions and highlights the benefits of the Self-Attentive Pooling mechanism in capturing complex relationships within financial time series data, suggesting a practical advantage for investors and analysts.

Prior to model training, data underwent a crucial preprocessing step known as Standard Scaling. This technique, involving the normalization of feature values to possess a mean of zero and a standard deviation of one, proved instrumental in achieving optimal model convergence and stability. By effectively addressing disparities in feature scales, Standard Scaling prevented features with larger values from unduly influencing the learning process. This normalization facilitated faster and more reliable training, ultimately contributing to the model’s robust performance on both the U.S. and Taiwanese stock market datasets. The consistent improvements observed following this preprocessing stage highlight its importance in preparing financial time-series data for predictive modeling.

Beyond Prediction: Towards a Learning Financial Ecosystem

Future investigations are set to refine the communication between the language model and its financial forecasting duties through the strategic application of ‘Prompt Engineering’. This involves meticulously crafting the input queries – the ‘prompts’ – to elicit more accurate and nuanced predictions from the model. Rather than simply asking for a forecast, researchers aim to design prompts that guide the model’s reasoning process, encouraging it to consider specific economic indicators, historical trends, or potential risk factors. By iteratively testing and refining these prompts, the model’s performance can be significantly enhanced, moving beyond generalized predictions towards more insightful and actionable financial intelligence. This approach acknowledges that the quality of the output is directly linked to the clarity and precision of the input, transforming the interaction into a collaborative dialogue between analyst and artificial intelligence.

The capacity of financial forecasting models to accurately represent real-world market behavior hinges on their ability to discern intricate relationships between numerous variables. Current research suggests that advancements in model architecture, specifically through the implementation of more sophisticated graph structures and attention mechanisms, hold considerable promise. These techniques allow the model to move beyond simple linear correlations and instead map assets as nodes within a complex network, capturing dependencies and influences that traditional methods often miss. Attention mechanisms, in particular, enable the model to dynamically prioritize the most relevant connections within this network, focusing computational resources on the factors most likely to drive future price movements. By more faithfully mirroring the interconnectedness of financial markets, these enhancements aim to significantly improve the model’s predictive power and its capacity to adapt to rapidly changing conditions.

The envisioned culmination of this research lies in the development of a truly autonomous financial intelligence system. This system wouldn’t simply react to market data, but proactively learn from it, continuously refining its predictive capabilities as conditions evolve. Rather than relying on static algorithms, it would employ sophisticated machine learning techniques to identify emerging patterns and anticipate shifts in market behavior. The ultimate deliverable isn’t just forecasting, but the provision of actionable insights – specific, data-driven recommendations designed to empower investors and optimize portfolio performance in a dynamic and often unpredictable economic landscape. This adaptive capability represents a significant leap towards a future where financial analysis transcends traditional methods and embraces the power of continuous, self-directed learning.

The pursuit of generalized stock price prediction, as detailed in this framework, resembles less a construction project and more the tending of a complex garden. The model’s integration of news sentiment via attention-based pooling isn’t about controlling market behavior, but about acknowledging the inherent unpredictability and responsiveness of the system. As Paul Feyerabend observed, “Anything goes.” This isn’t nihilism, but a recognition that rigid methodologies, seeking to impose order on chaotic systems, inevitably encounter unforeseen consequences. The model’s adaptability, its capacity to evolve with new data and market shifts, is not a bug-it’s the feature that allows it to persist, even if that persistence manifests in forms quite different from those originally envisioned. Long stability, in this context, would indeed signal a hidden disaster – a failure to adapt to the ever-shifting landscape.

What’s Next?

This work, like all attempts to predict the market, has merely refined the shape of inevitable failure. The architecture, with its attention mechanisms and language model integrations, doesn’t solve the problem of stock price prediction – it postpones the moment of its unsolvability. The system will, predictably, find new edges to be exploited, and those edges will, equally predictably, erode. A truly robust system wouldn’t aim for accuracy, but for graceful degradation.

The real limitation isn’t the model itself, but the assumption that ‘news’ constitutes a fixed, interpretable signal. Language is a living organism, constantly evolving to obscure as much as it reveals. Future work must acknowledge that information isn’t found in text, but created through the act of reading – a distinctly human process. The attempt to automate this process is not advancement, but a narrowing of perspective.

Therefore, the next iteration shouldn’t focus on more complex algorithms, but on systems that actively invite their own disruption. A model that incorporates calibrated error, that anticipates and embraces its own obsolescence, would be a more honest, and ultimately, a more useful artifact. Perfection, after all, leaves no room for people – or for markets.

Original article: https://arxiv.org/pdf/2603.19286.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/