dc.contributor.author | Plevris, Vagelis | |
dc.contributor.author | Solorzano, German | |
dc.contributor.author | Bakas, Nikolaos P. | |
dc.contributor.author | Ben Seghier, Mohamed El Amine | |
dc.date.accessioned | 2023-02-17T13:07:47Z | |
dc.date.available | 2023-02-17T13:07:47Z | |
dc.date.created | 2023-02-09T19:12:19Z | |
dc.date.issued | 2022-11-24 | |
dc.identifier.isbn | 9788412322286 | |
dc.identifier.uri | https://hdl.handle.net/11250/3051984 | |
dc.description.abstract | Performance metrics (Evaluation metrics or error metrics) are crucial components of regression analysis and machine learning-based prediction models. A performance metric can be defined as a logical and mathematical construct designed to measure how close the predicted outcome is to the actual result. A variety of performance metrics have been described and proposed in the literature. Knowledge about the metrics’ properties needs to be systematized to simplify their design and use. In this work, we examine various regression related metrics (14 in total) for continuous variables, including the most widely used ones, such as the (root) mean squared error, the mean absolute error, the Pearson correlation coefficient, and the coefficient of determination, among many others. We provide their mathematical formulations, as well as a discussion on their use, their characteristics, advantages, disadvantages, and limitations, through theoretical analysis and a detailed numerical example. The 10 unitless metrics are further investigated through a numerical analysis with Monte Carlo Simulation based on (i) random guessing and (ii) the addition of random noise with various noise ratios to the predicted values. Some of the metrics show a poor or inconsistent performance, while others exhibit good performance as evaluation measures of the “goodness of fit”. We highlight the importance of the usage of the right metrics to obtain good predictions in machine learning and regression models in general. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | European Community on Computational Methods in Applied Sciences | en_US |
dc.relation.ispartof | 8th European Congress on Computational Methods in Applied Sciences and Engineering (ECCOMAS Congress 2022) | |
dc.relation.ispartofseries | ECCOMAS Congress;8th European Congress on Computational Methods in Applied Sciences and Engineering | |
dc.rights | Navngivelse-Ikkekommersiell-DelPåSammeVilkår 4.0 Internasjonal | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/deed.no | * |
dc.title | Investigation of performance metrics in regression analysis and machine learning-based prediction models | en_US |
dc.type | Conference object | en_US |
dc.description.version | publishedVersion | en_US |
cristin.ispublished | true | |
cristin.fulltext | original | |
cristin.qualitycode | 1 | |
dc.identifier.doi | https://doi.org/10.23967/eccomas.2022.155 | |
dc.identifier.cristin | 2124698 | |
dc.source.volume | 8 | en_US |
dc.source.issue | 8 | en_US |
dc.source.pagenumber | 1-25 | en_US |