Fake News Detection Using Natural Language Processing And Machine Learning

The rapid spread of misinformation on digital platforms has created a pressing need for effective automated solutions. This project presents a fake news detection framework using Natural Language Processing (NLP) and Machine Learning (ML) techniques. Using a labeled dataset of real and fake news articles, the study applies TF-IDF-based feature extraction and evaluates four classification algorithms—Logistic Regression, Support Vector Machine (SVM), Naïve Bayes, and Random Forest. The Random Forest model, after hyperparameter tuning, achieved the highest accuracy of 99.8% and an F1-score of 0.9979. The research also explores challenges like domain adaptation and model explainability, emphasizing the importance of ethical AI in combating misinformation. Results demonstrate that even lightweight ML models can be highly effective for fake news detection, offering scalable solutions for content moderation and public information integrity.

Files

Metadata

Work Title Fake News Detection Using Natural Language Processing And Machine Learning
Access
Open Access
Creators
  1. Febin Clement
Keyword
  1. Fake News Detection
  2. NLP
  3. Machine Learning
  4. Text Classification
  5. TF-IDF
  6. Random Forest
  7. Logistic Regression
  8. SVM
  9. Naïve Bayes
  10. Misinformation
  11. Content Moderation
  12. Explainable AI
  13. Digital Media
  14. Information Integrity
  15. Hyperparameter Tuning
License In Copyright (Rights Reserved)
Work Type Masters Culminating Experience
Sub Work Type Scholarly Paper/Essay (MA/MS)
Program Information Systems
Degree Master of Science
Acknowledgments
  1. Girish Subramanian
Publisher
  1. ScholarSphere
Publication Date April 2025
DOI doi:10.26207/0he1-3r26
Deposited April 23, 2025

Versions

Analytics

Collections

This resource is currently not in any collection.

Work History

Version 1
published

  • Created
  • Updated
  • Updated Keyword, Degree, Program, and 3 more Show Changes
    Keyword
    • Fake News Detection, NLP, Machine Learning, Text Classification, TF-IDF, Random Forest, Logistic Regression, SVM, Naïve Bayes, Misinformation, Content Moderation, Explainable AI, Digital Media, Information Integrity, Hyperparameter Tuning
    Degree
    • Master of Science
    Program
    • Information Systems
    Description
    • The rapid spread of misinformation on digital platforms has created a pressing need for effective automated solutions. This project presents a fake news detection framework using Natural Language Processing (NLP) and Machine Learning (ML) techniques. Using a labeled dataset of real and fake news articles, the study applies TF-IDF-based feature extraction and evaluates four classification algorithms—Logistic Regression, Support Vector Machine (SVM), Naïve Bayes, and Random Forest. The Random Forest model, after hyperparameter tuning, achieved the highest accuracy of 99.8% and an F1-score of 0.9979. The research also explores challenges like domain adaptation and model explainability, emphasizing the importance of ethical AI in combating misinformation. Results demonstrate that even lightweight ML models can be highly effective for fake news detection, offering scalable solutions for content moderation and public information integrity.
    Sub Work Type
    • Scholarly Paper/Essay (MA/MS)
    Publication Date
    • 2025-04
  • Added Creator Febin Clement
  • Added Creator Emily Mross
  • Updated Acknowledgments Show Changes
    Acknowledgments
    • Girish Subramanian
  • Added Creator Febin Clement
  • Added Creator Emily Mross
  • Added Masters_Project_Final_Draft__3_.pdf
  • Updated License Show Changes
    License
    • https://rightsstatements.org/page/InC/1.0/
  • Deleted Creator Febin Clement
  • Deleted Creator Emily Mross
  • Published Publisher Show Changes
    Publisher
    • ScholarSphere
  • Updated
  • Updated
  • Deleted Creator Emily Mross