Machine learning approaches for predicting health risk of cyanobacterial blooms in Northern European Lakes
dc.creator | Mellios N., Moe S.J., Laspidou C. | en |
dc.date.accessioned | 2023-01-31T08:58:46Z | |
dc.date.available | 2023-01-31T08:58:46Z | |
dc.date.issued | 2020 | |
dc.identifier | 10.3390/W12041191 | |
dc.identifier.issn | 20734441 | |
dc.identifier.uri | http://hdl.handle.net/11615/76506 | |
dc.description.abstract | Cyanobacterial blooms are considered a major threat to global water security with documented impacts on lake ecosystems and public health. Given that cyanobacteria possess highly adaptive traits that favor them to prevail under different and often complicated stressor regimes, predicting their abundance is challenging. A dataset from 822 Northern European lakes is used to determine which variables better explain the variation of cyanobacteria biomass (CBB) by means of stepwise multiple linear regression. Chlorophyll-a (Chl-a) and total nitrogen (TN) provided the best modelling structure for the entire dataset, while for subsets of shallow and deep lakes, Chl-a, mean depth, TN and TN/TP explained part of the variance in CBB. Path analysis was performed and corroborated these findings. Finally, CBB was translated to a categorical variable according to risk levels for human health associated with the use of lakes for recreational activities. Several machine learning methods, namely Decision Tree, K-Nearest Neighbors, Support-vector Machine and Random Forest, were applied showing a remarkable ability to predict the risk, while Random Forest parameters were tuned and optimized, achieving a 95.81% accuracy, exceeding the performance of all other machine learning methods tested. A confusion matrix analysis is performed for all machine learning methods, identifying the potential of each method to correctly predict CBB risk levels and assessing the extent of false alarms; random forest clearly outperforms the other methods with very promising results. © 2020 by the authors. | en |
dc.language.iso | en | en |
dc.source | Water (Switzerland) | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85084666310&doi=10.3390%2fW12041191&partnerID=40&md5=9f9c78747a244496a7e04c908d158f0b | |
dc.subject | Decision trees | en |
dc.subject | Ecosystems | en |
dc.subject | Forecasting | en |
dc.subject | Health | en |
dc.subject | Health risks | en |
dc.subject | Lakes | en |
dc.subject | Linear regression | en |
dc.subject | Nearest neighbor search | en |
dc.subject | Random forests | en |
dc.subject | Risk assessment | en |
dc.subject | Support vector machines | en |
dc.subject | Categorical variables | en |
dc.subject | Confusion matrices | en |
dc.subject | Cyanobacterial blooms | en |
dc.subject | K-nearest neighbors | en |
dc.subject | Machine learning approaches | en |
dc.subject | Machine learning methods | en |
dc.subject | Recreational activities | en |
dc.subject | Stepwise multiple linear regression | en |
dc.subject | Learning systems | en |
dc.subject | algal bloom | en |
dc.subject | biomass | en |
dc.subject | cyanobacterium | en |
dc.subject | data set | en |
dc.subject | health risk | en |
dc.subject | machine learning | en |
dc.subject | nitrogen | en |
dc.subject | performance assessment | en |
dc.subject | risk assessment | en |
dc.subject | support vector machine | en |
dc.subject | Cyanobacteria | en |
dc.subject | MDPI AG | en |
dc.title | Machine learning approaches for predicting health risk of cyanobacterial blooms in Northern European Lakes | en |
dc.type | journalArticle | en |
Αρχεία σε αυτό το τεκμήριο
Αρχεία | Μέγεθος | Τύπος | Προβολή |
---|---|---|---|
Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο. |