Ensemble Learning for Fake News Detection: Enhancing Classification Accuracy and Explainability with Structural and Statistical Metadata

Sebastián González-Celi, Henry N. Roa, Jorge Cruz-Silva, Edison Loza-Aguirre, Nelson Salgado-Reyes, Javier Guaña-Moya

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    The proliferation of fake news on digital platforms presents significant societal challenges, necessitating the development of robust and interpretable detection systems. This study proposes a stacking-based ensemble learning model that integrates XGBoost and Logistic Regression to improve fake news classification accuracy while enhancing model transparency. Unlike traditional Natural Language Processing (NLP) approaches, which rely solely on textual analysis, this model incorporates structural and statistical metadata features, such as publication date and article length, to improve generalizability across misinformation domains. Experimental results on the Spanish Political Fake News dataset demonstrate that the stacking ensemble model outperforms individual classifiers, achieving an F1-score of 95.2% and a ROC-AUC of 0.974. SHAP (Shapley Additive Explanations) analysis enhances interpretability by identifying the most influential features contributing to classification decisions, confirming that metadata plays a critical role in misinformation detection. These findings highlight the effectiveness of hybrid machine-learning approaches that combine textual and structural information for scalable misinformation detection. The study’s contributions include a highly accurate and explainable classification model, positioning ensemble learning as a viable solution for real-world applications in automated fact-checking, journalism, and social media moderation.

    Original languageEnglish
    Title of host publicationIntelligent Systems and Applications - Proceedings of the 2025 Intelligent Systems Conference IntelliSys
    EditorsKohei Arai
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages485-502
    Number of pages18
    ISBN (Print)9783031999642
    DOIs
    StatePublished - 2025
    Event11th Intelligent Systems Conference, IntelliSys 2025 - Amsterdam, Netherlands
    Duration: Aug 28 2025Aug 29 2025

    Publication series

    NameLecture Notes in Networks and Systems
    Volume1554 LNNS
    ISSN (Print)2367-3370
    ISSN (Electronic)2367-3389

    Conference

    Conference11th Intelligent Systems Conference, IntelliSys 2025
    Country/TerritoryNetherlands
    CityAmsterdam
    Period08/28/2508/29/25

    Keywords

    • Fake news detection
    • Machine learning
    • Metadata-driven classification
    • SHAP analysis
    • Stacking ensemble learning
    • XGBoost

    Fingerprint

    Dive into the research topics of 'Ensemble Learning for Fake News Detection: Enhancing Classification Accuracy and Explainability with Structural and Statistical Metadata'. Together they form a unique fingerprint.

    Cite this