A Generalized Framework for Adopting Regression-Based Predictive Modeling in Manufacturing Environments

Akinsolu, Mobayode O. and Ziribi, Khalil (2023) A Generalized Framework for Adopting Regression-Based Predictive Modeling in Manufacturing Environments. Inventions, 8 (1). pp. 1-32. ISSN 2411-5134

[img]
Preview
Text
GURO_575_inventions-08-00032-v2 (1).pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

In this paper, the growing significance of data analysis in manufacturing environments is exemplified through a review of relevant literature and a generic framework to aid the ease of adoption of regression-based supervised learning in manufacturing environments. To validate the practicality of the framework, several regression learning techniques are applied to an open-source multi-stage continuous-flow manufacturing process data set to typify inference-driven decision-making that informs the selection of regression learning methods for adoption in real-world manufacturing environments. The investigated regression learning techniques are evaluated in terms of their training time, prediction speed, predictive accuracy (R-squared value), and mean squared error. In terms of training time, k-NN20 (k-Nearest Neighbour with 20 neighbors) ranks first with average and median values of 4.8 ms and 4.9 ms, and 4.2 ms and 4.3 ms, respectively, for the first stage and second stage of the predictive modeling of the multi-stage continuous-flow manufacturing process, respectively, over 50 independent runs. In terms of prediction speed, DTR (decision tree regressor) ranks first with average and median values of 5.6784×10^6 observations per second (ob/s) and 4.8691×10^6 observations per second (ob/s), and 4.9929×10^6 observations per second (ob/s) and 5.8806×10^6 observations per second (ob/s), respectively, for the first stage and second stage of the predictive modeling of the multi-stage continuous-flow manufacturing process, respectively, over 50 independent runs. In terms of R-squared value, BR (bagging regressor) ranks first with average and median values of 0.728 and 0.728, respectively, over 50 independent runs, for the first stage of the predictive modeling of the multi-stage continuous-flow manufacturing process, and RFR (random forest regressor) ranks first with average and median values of 0.746 and 0.746, respectively, over 50 independent runs, for the second stage of the predictive modeling of the multi-stage continuous-flow manufacturing process. In terms of mean squared error, BR (bagging regressor) ranks first with average and median values of 2.7 and 2.7, respectively, over 50 independent runs, for the first stage of the predictive modeling of the multi-stage continuous-flow manufacturing process, and RFR (random forest regressor) ranks first with average and median values of 3.5 and 3.5, respectively, over 50 independent runs, for the second stage of the predictive modeling of the multi-stage continuous-flow manufacturing process. All methods are further ranked inferentially using the statistics of their performance metrics to identify the best method(s) for the first and second stages of the predictive modeling of the multi-stage continuous-flow manufacturing process. A Wilcoxon rank sum test is then used to statistically verify the inference-based rankings. DTR and k-NN20 have been identified as the most suitable regression learning techniques given the multi-stage continuous-flow manufacturing process data used for experimentation.

Item Type: Article
Keywords: artificial intelligence, data, data analysis, machine learning, manufacturing, regression, regression learning, supervised learning
Divisions: Applied Science, Computing and Engineering
Depositing User: Hayley Dennis
Date Deposited: 19 Jun 2023 11:10
Last Modified: 21 Jun 2023 15:26
URI: https://glyndwr.repository.guildhe.ac.uk/id/eprint/18022

Actions (login required)

Edit Item Edit Item