Multi-Source Data Fusion for Short-Term Demand Forecasting of Seasonal Retail Products: An Empirical Study Using Weather and Social Media Signals
Main Article Content
Abstract
Recent advances in machine learning have enabled retailers to leverage diverse data streams for demand prediction, yet most systems continue to rely primarily on historical sales. In this study, we address the challenge of seasonal demand forecasting by integrating meteorological observations, social media activity, and economic indicators with point-of-sale transactions. We propose a hierarchical ensemble that combines gradient boosting machines with bidirectional LSTMs, where component weights adapt according to product category and forecast horizon. Through mutual information screening, we reduce feature dimensionality from 123 to category-specific subsets averaging 38 features, lowering computational costs by 60% while maintaining predictive accuracy. Our approach requires only standard training of the base models along with a lightweight meta-learner, without necessitating end-to-end joint training. Evaluation on 10.3 million transactions from three U.S. retail chains demonstrates that our method achieves a mean absolute percentage error of 19.6%, compared to 37.5% for seasonal naive baselines. Multi-source fusion thus provides a practical pathway to more accurate retail forecasting, without the complexity associated with end-to-end deep learning systems. Crucially, this approach relies on lightweight feature learning and selection techniques combined with meta-learner weighting, rather than on end-to-end joint representation training.
Article Details
Section
How to Cite
References
1. G. P. Zhang, and M. Qi, "Neural network forecasting for seasonal and trend time series," European Journal of Operational Research, vol. 160, no. 2, pp. 501-514, 2005.
2. G. Verstraete, E. H. Aghezzaf, and B. Desmet, "A data-driven framework for predicting weather impact on high-volume low-margin retail products," Journal of Retailing and Consumer Services, vol. 48, pp. 169-177, 2019.
3. M. Sousa, A. M. Tomé, and J. Moreira, "Long-term forecasting of hourly retail customer flow on intermittent time series with multiple seasonality," Data Science and Management, vol. 5, no. 3, pp. 137-148, 2022. doi: 10.1016/j.dsm.2022.07.002
4. M. Li, F. Wang, X. Jia, W. Li, T. Li, and G. Rui, "Multi-source data fusion for economic data analysis," Neural Computing and Applications, vol. 33, no. 10, pp. 4729-4739, 2021. doi: 10.1007/s00521-020-05531-0
5. M. A. Jahin, A. Shahriar, and M. A. Amin, "MCDFN: Supply chain demand forecasting via an explainable multi-channel data fusion network model," Evolutionary Intelligence, vol. 18, no. 3, p. 66, 2025. doi: 10.1007/s12065-025-01053-7
6. P. Ramos, N. Santos, and R. Rebelo, "Performance of state space and ARIMA models for consumer retail sales forecasting," Robotics and Computer-Integrated Manufacturing, vol. 34, pp. 151-163, 2015. doi: 10.1016/j.rcim.2014.12.015
7. H. Chan, and M. I. M. Wahab, "A machine learning framework for predicting weather impact on retail sales," Supply Chain Analytics, vol. 5, p. 100058, 2024.
8. H. Ye, X. Teng, B. Song, K. Zou, M. Zhu, and G. He, "Multi-source data fusion-based grid-level load forecasting," Applied Sciences, vol. 15, no. 9, p. 4820, 2025. doi: 10.3390/app15094820
9. Y. Ensafi, S. H. Amin, G. Zhang, and B. Shah, "Time-series forecasting of seasonal items sales using machine learning-A comparative analysis," International Journal of Information Management Data Insights, vol. 2, no. 1, p. 100058, 2022.
10. L. Tan, R. Kang, J. Xia, and Y. Wang, "Application of multi-source data fusion on intelligent prediction of photovoltaic power," Solar Energy, vol. 277, p. 112706, 2024. doi: 10.1016/j.solener.2024.112706
11. F. Badorf, and K. Hoberg, "The impact of daily weather on retail sales: An empirical study in brick-and-mortar stores," Journal of Retailing and Consumer Services, vol. 52, p. 101921, 2020. doi: 10.1016/j.jretconser.2019.101921
12. Y. Zhao, J. Zhao, and E. Y. Lam, "House price prediction: A multi-source data fusion perspective," Big Data Mining and Analytics, vol. 7, no. 3, pp. 603-620, 2024. doi: 10.26599/bdma.2024.9020019
13. G. Nunnari, and V. Nunnari, "Forecasting monthly sales retail time series: A case study," In 2017 IEEE 19th Conference on Business Informatics (CBI), July, 2017, pp. 1-6. doi: 10.1109/cbi.2017.57
14. G. A. N. Pongdatu, and Y. H. Putra, "Seasonal time series forecasting using SARIMA and Holt Winter's exponential smoothing," In IOP Conference Series: Materials Science and Engineering, August, 2018, p. 012153. doi: 10.1088/1757-899x/407/1/012153
15. Q. Wang, Y. Wang, K. Zhang, Y. Liu, W. Qiang, and Q. Han Wen, "Artificial intelligent power forecasting for wind farm based on multi-source data fusion," Processes, vol. 11, no. 5, p. 1429, 2023. doi: 10.3390/pr11051429
16. A. Salah, M. Bekhit, E. Eldesouky, A. Ali, and A. Fathalla, "Price prediction of seasonal items using time series analysis," Computer Systems Science and Engineering, 2023. doi: 10.32604/csse.2023.035254
17. I. Saadi, B. Farooq, A. Mustafa, J. Teller, and M. Cools, "An efficient hierarchical model for multi-source information fusion," Expert Systems with Applications, vol. 110, pp. 352-362, 2018. doi: 10.1016/j.eswa.2018.06.018