AI-Based Prediction Models for Transportation Infrastructure Asset Management Data Hub

Transportation infrastructure system networks are essential to sustain our economy, society, and quality of life. The Federal Highway Administration (FHWA) indicates the data-driven asset management (inspection, maintenance, and emergency response) of urban infrastructure (building, transportation, energy, and cyber) represents the future of work and identifies it as one of their strategic objectives. An accurate prediction of asphalt pavement condition is important to guide pavement maintenance practices. The existing models for pavement condition predictions are predominantly based on linear regressions or simple machine learning technics (e.g., the artificial neural network (ANN) and the random forest models). However, additional work on these models is needed to improve their basic assumptions, training efficiency, and interpretability. To this end, a new modeling approach is proposed in this project, which includes a Light Gradient Boosting Machine (LightGBM)-based ensemble learning model, coupled with the Shapley Additive Explanation (SHAP) method, to predict the International Roughness Index (IRI) of asphalt pavements. The SHAP method was applied to interpret the underlying influencing factors and their interactions. 20 features were initially identified as the model inputs, and 1,706 observations were extracted from the Long-Term Pavement Performance (LTPP) database. Two benchmark models, namely the Mechanistic-Empirical Pavement Design Guide (MEPDG) model and the ANN model, were used for comparison. The results showed that the developed model outperformed both benchmark models in terms of R-square, root mean square error, and the absolute error. Feature interpretation was performed to identify the top influencing factors of IRI, and the interaction effects between two features were also analyzed. The 20-feature model was further simplified based on the analysis result. The simplified model only required five features to efficiently and effectively predict IRI using the proposed LightGBM-based approach, which can reduce the workload in data collection and management for pavement engineers.

Files

Metadata

Work Title AI-Based Prediction Models for Transportation Infrastructure Asset Management Data Hub
Access
Open Access
Creators
  1. Yang Song
  2. Chenxi Chen
  3. Xianbiao Hu
  4. Jenny Liu
  5. David Yizhuan Wang
License MIT License
Work Type Report
Publication Date August 31, 2024
Deposited May 24, 2025

Versions

Analytics

Collections

This resource is currently not in any collection.

Work History

Version 1
published

  • Created
  • Updated
  • Updated
  • Updated Description, Publication Date Show Changes
    Description
    • Transportation infrastructure system networks are essential to sustain our economy, society, and quality of life. The Federal Highway Administration (FHWA) indicates the data-driven asset management (inspection, maintenance, and emergency response) of urban infrastructure (building, transportation, energy, and cyber) represents the future of work and identifies it as one of their strategic objectives. An accurate prediction of asphalt pavement condition is important to guide pavement maintenance practices. The existing models for pavement condition predictions are predominantly based on linear regressions or simple machine learning technics (e.g., the artificial neural network (ANN) and the random forest models). However, additional work on these models is needed to improve their basic assumptions, training efficiency, and interpretability. To this end, a new modeling approach is proposed in this project, which includes a Light Gradient Boosting Machine (LightGBM)-based ensemble learning model, coupled with the Shapley Additive Explanation (SHAP) method, to predict the International Roughness Index (IRI) of asphalt pavements. The SHAP method was applied to interpret the underlying influencing factors and their interactions. 20 features were initially identified as the model inputs, and 1,706 observations were extracted from the Long-Term Pavement Performance (LTPP) database. Two benchmark models, namely the Mechanistic-Empirical Pavement Design Guide (MEPDG) model and the ANN model, were used for comparison. The results showed that the developed model outperformed both benchmark models in terms of R-square, root mean square error, and the absolute error. Feature interpretation was performed to identify the top influencing factors of IRI, and the interaction effects between two features were also analyzed. The 20-feature model was further simplified based on the analysis result. The simplified model only required five features to efficiently and effectively predict IRI using the proposed LightGBM-based approach, which can reduce the workload in data collection and management for pavement engineers.
    Publication Date
    • 2024-08-31
  • Added Creator Yang Song
  • Added Creator Chenxi Chen
  • Added Creator Xianbiao Hu
  • Added Creator Jenny Liu
  • Added Creator David Yizhuan Wang
  • Added TriDurLE_Project_Report_Datahub.pdf
  • Updated License Show Changes
    License
    • https://opensource.org/licenses/MIT
  • Published
  • Updated