Urban growth algorithms | Construpedia

Urban Growth Algorithms

Introduction

In statistics and machine learning, ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained with any of the constituent learning algorithms alone.[1][2][3] Unlike a statistical ensemble in statistical mechanics, which is typically infinite, a machine learning ensemble consists of only a particular finite set of alternative models, but typically allows for a much more flexible structure between those alternatives.

Overview

Supervised learning algorithms perform the task of searching through a hypothesis space to find a suitable hypothesis that makes good predictions with a given problem.[4] Even if the hypothesis space contains very suitable hypotheses for a given problem, it can be very difficult to find a good one. Ensembles combine multiple hypotheses to form a (hopefully) better hypothesis. The term "ensemble" is usually reserved for methods that generate multiple hypotheses using the same base learner. The broader term multiple classifier systems also encompasses the hybridization of hypotheses that are not induced by the same base learner.

Evaluating the prediction of an ensemble usually requires more calculations than evaluating the prediction of a single model. In a sense, ensemble learning can be seen as a way to compensate for poor learning algorithms by performing many additional calculations. On the other hand, the alternative is to do a lot more learning in a non-ensemble system. An ensemble system may be more efficient in improving overall accuracy for the same increase in computing, storage, or communication resources using that increase in two or more methods than would be improved by increasing resource use for a single method. Fast algorithms, such as decision trees, are often used in ensemble methods (e.g., random forests), although slower algorithms can also benefit from ensemble techniques.

By analogy, ensemble techniques have also been used in unsupervised learning scenarios, for example in consensus clustering or anomaly detection.

set theory

Empirically, ensembles tend to give better results when there is significant diversity among the models.[5][6] Therefore, many ensemble methods attempt to promote diversity among the models they combine.[7][8] Although perhaps not intuitive, more random algorithms (such as randomized decision trees) can be used to produce a stronger ensemble than very deliberate algorithms (such as entropy-reduced decision trees).[9] However, it has been shown that the use of a variety of powerful learning algorithms is more effective than using techniques that attempt to simplify models to promote diversity.[10] It is possible to increase diversity in the model training phase by using correlation for regression tasks[11] or using information measures such as cross entropy for classification tasks.[12].

Urban Growth Algorithms

Introduction

Overview

By analogy, ensemble techniques have also been used in unsupervised learning scenarios, for example in consensus clustering or anomaly detection.

set theory

Common types of sets

Optimal Bayesian classifier

The optimal Bayesian classifier is a classification technique. This is a set of all the hypotheses in the hypothesis space. On average, no other ensemble can beat it.[16] The Naive Bayes classifier is a version of this that assumes the data is conditionally independent of class and makes the calculation more feasible. Each hypothesis is given a vote proportional to the probability that the training data set would be sampled from a system if that hypothesis were true. To facilitate finite-sized training data, the vote for each hypothesis is also multiplied by the a priori probability of that hypothesis. The optimal Bayes classifier can be expressed with the following equation:.

Where is the predicted class, is the set of all possible classes, is the hypothesis space, refers to a probability, and is the training data. As a set, the optimal Bayes classifier represents a hypothesis that is not necessarily in . However, the hypothesis represented by the optimal Bayes classifier is the optimal hypothesis in the set space (the space of all possible sets formed only by hypotheses in ).

This formula can be reformulated using Bayes' theorem, which says that the posterior probability is proportional to the probability multiplied by the prior probability:.

therefore,.

Bootstrap aggregation (bagging)

Bootstrap aggregation (bagging) consists of training an ensemble from bootstrap data sets. A bootstrap set is created by selecting from the original training data set with replacement. Therefore, a bootstrap set can contain an example given zero, once, or multiple times. Ensemble members can also have limits on features (for example, the "Node (computing)" nodes of a decision tree), to encourage exploration of diverse features.[17] Local variance information in bootstrap ensembles and feature considerations promote diversity in the ensemble, and can strengthen the ensemble.[18] To reduce overfitting, a member can be validated using the out-of-bag ensemble (examples not in the ensemble). bootstrap).[19].

Inference is performed by voting the predictions of the ensemble members, which is called aggregation. This is illustrated below with a set of four decision trees. Each tree classifies the query example. Since three of the four predict the positive class, the overall classification of the ensemble is positive. Random forests like the one shown are a common application of assembly.

Boosting

Boosting consists of training successive models by emphasizing training data misclassified by previously learned models. Initially, all data (D1) have equal weight and are used to learn a base model M1. Examples misclassified by M1 are assigned a higher weight than those correctly classified. This boosted data (D2) is used to train a second base model M2, and so on. The inference is made by voting.

In some cases, boosting has given better results than bagging, but it tends to overfit more. The most common application of boosting is Adaboost"), but some newer algorithms obtain better results.

Bayesian model averaging

Bayesian model averaging (BMA) makes predictions by averaging the predictions of models weighted by their a posteriori probabilities given the data.[20] BMA is known to often give better answers than a single model, obtained, for example, by stepwise regression&action=edit&redlink=1 "Stepwise regression (not yet written)"), especially when very different models have almost identical performance on the training set but otherwise they can perform very differently.

The issue with any use of Bayes' theorem is the prior, that is, the (perhaps subjective) probability that each model is the best for a given purpose. Conceptually, BMA can be used with any prior. The R packages ensembleBMA[21] and BMA[22] use the priority implicit in the Bayesian information criterion (BIC), following Raftery (1995).[23] The R package BAS supports the use of the priorities implicit in the Akaike information criterion (AIC) and other criteria on alternative models, as well as priorities on the coefficients.[24].

The difference between the BIC and the AIC is the strength of the preference for parsimony. BIC's penalty for model complexity is , while AIC's is . Large-sample asymptotic theory states that if a best model exists, then with increasing sample sizes the BIC is strongly consistent, that is, you will almost certainly find it, while the AIC may not, because the AIC may continue to place excessive posterior probability on models that are more complicated than necessary. On the other hand, AIC and AICc are asymptotically "efficient" (i.e., minimum mean square prediction error), while BIC is not.[25].

Haussler et al. (1994) showed that when BMA is used for classification, its expected error is at most twice the expected error of the Bayes optimal classifier.[26] Burnham and Anderson (1998, 2002) contributed greatly to introducing the basic ideas of Bayesian model averaging to a wider audience and popularizing the methodology.[27] The availability of software, including other free open source packages for R in addition to those mentioned above, helped. to make the methods accessible to a broader public.[28].

Bayesian combination of models

Bayesian model combining (BMC) is an algorithmic correction to Bayesian model averaging (BMA). Instead of sampling each model in the ensemble individually, it is sampled from the space of possible ensembles (with model weights drawn randomly from a Dirichlet distribution with uniform parameters). This modification overcomes BMA's tendency to converge and give all weight to a single model. Although BMC is somewhat more computationally expensive than BMA, it tends to produce much better results. It has been shown that BMC is better on average (with statistical significance) than BMA and bagging.[29].

Using Bayes' law to calculate model weights requires calculating the probability of the data based on each model. Typically, none of the models in the ensemble are exactly the distribution from which the training data was generated, so they all correctly receive a value close to zero for this term. This would work well if the ensemble was large enough to sample the entire model space, but it is rarely possible. Consequently, each pattern in the training data will cause the ensemble weight to shift toward the ensemble model that most closely matches the distribution of the training data. In essence, it boils down to an unnecessarily complex method of performing model selection.

The possible weights of a set can be visualized as if they were located in a simplex. At each vertex of the simplex, all weight is assigned to a single model in the ensemble. The BMA converges towards the vertex closest to the distribution of the training data. Instead, BMC converges toward the point where this distribution is projected onto the simplex. In other words, instead of selecting the model closest to the generated distribution, look for the combination of models closest to the generated distribution.

BMA results can often be approximated by using cross-validation to select the best model from a set of models. Similarly, BMC results can be approximated using cross-validation to select the best combination of sets from a random sampling of possible weights.

Bucket of models

A "model cube" is an ensemble technique in which a model selection algorithm is used to choose the best model for each problem. When tested on a single problem, a cube of models may not produce better results than the best model in the ensemble, but when tested across many problems, it will typically produce much better results, on average, than any model in the ensemble.

The most commonly used method for model selection is cross-validation (sometimes called a “baking contest”). It is described with the following pseudocode:

Selection by cross-validation can be summarized as: "try them all against the training set and choose the one that works best".[30].

Gating is a generalization of cross-validation selection. It consists of training another learning model to decide which of the cube models is the most appropriate to solve the problem. Often, a perceptron is used for the gating model. It can be used to choose the "best" model or to give a linear weight to the predictions of each model in the cube.

When using a cube of models with a large set of problems, it may be desirable to avoid training some of the models that take a long time to train. Milestone learning is a meta-learning approach "Meta-learning (computer science)" that tries to solve this problem. It involves training only the fast (but inaccurate) algorithms in the cube, and then using the performance of these algorithms to help determine which slow (but accurate) algorithm is most likely to obtain better results.[31].

Stacking

Stacking (sometimes called stacked generalization) involves training a model to combine the predictions of other learning algorithms. First, all other algorithms are trained using the available data, and then a combinator algorithm (final estimator) is trained to make a final prediction using all predictions from the other algorithms (base estimators) as additional inputs or using cross-validated predictions from the base estimators, which can avoid overfitting.[32] If an arbitrary combinator algorithm is used, stacking can theoretically represent any of the ensemble techniques described in this article, although in practice it often often does. A logistic regression model is used as a combinator.

Stacking typically gives better results than either model trained separately.[33] It has been used successfully in both supervised learning tasks (regression,[34] classification, and distance learning)[35] and unsupervised learning (density estimation).[36] It has also been used to estimate the error rate of bagging.[3][37] It has been reported to outperform Bayesian averaging. models.[38] The top two Netflix contest results used shuffling, which can be considered a form of stacking.[39].

Vote

Voting is another form of assembly. See, for example, the weighted majority algorithm (machine learning).

Applications of ensemble learning

Contenido

En los últimos años, debido a la creciente potencia computacional, que permite el entrenamiento en el aprendizaje de grandes conjuntos en un tiempo razonable, el número de aplicaciones de aprendizaje de conjuntos ha crecido cada vez más.[45] Algunas de las aplicaciones de los clasificadores de conjuntos incluyen:.

Remote sensing

Land cover mapping is one of the main applications of Earth observation satellite sensors, which use remote sensing and geospatial data, to identify materials and objects found on the surface of target areas. Generally, the target material classes include roads, buildings, rivers, lakes and vegetation.[46] To efficiently identify land cover objects, different ensemble learning approaches based on artificial neural networks are proposed,[47] kernel principal component analysis (KPCA),[48] boosted decision trees,[49] random forest[46][50] and automatic design of multiple classifier systems[51] are proposed to efficiently identify land cover objects. the land cover.

Change detection") is an image analysis problem that involves identifying locations where land cover has changed over time. Change detection is widely used in fields such as urban growth, forest and vegetation dynamics, land use, and disaster monitoring.[52] Early applications of ensemble classifiers in change detection were designed with majority voting,[53] Bayesian model averaging,[54] and maximum likelihood. later.[55] Given the growth of satellite data over time, the last decade has seen increased use of time series methods for continuous change detection from image stacks.[56] An example is a Bayesian ensemble change point detection method called BEAST, with the software available as the Rbeast package in R, Python and Matlab.[57].

Computer security

Distributed denial of service is one of the most threatening cyber attacks that an Internet service provider can suffer.[45] By combining the output of individual classifiers, ensemble classifiers reduce the total error of detecting and discriminating these types of attacks from legitimate flashes.[58].

The classification of malicious code such as computer viruses, computer worms, Trojans, ransomware and spyware using machine learning techniques is inspired by the problem of document categorization.[59] Ensemble learning systems have demonstrated adequate effectiveness in this area.[60][61].

An intrusion detection system monitors the computer network or computer systems to identify intrusion codes as an anomaly detection process. Ensemble learning successfully helps these supervisory systems reduce their total error.[62][63].

facial recognition

Facial recognition, which has recently become one of the most popular research areas of pattern recognition, deals with the identification or verification of a person using their digital images.[64].

Hierarchical ensembles based on the Gabor Fisher classifier and independent component analysis preprocessing techniques are some of the first ensembles used in this field.[65][66][67].

Emotion recognition

While speech recognition is mainly based on deep learning because most industry players in this field such as Google, Microsoft and IBM reveal that the core technology of their speech recognition is based on this approach, speech-based emotion recognition can also have satisfactory performance with ensemble learning.[68][69].

It is also being used successfully in facial emotion recognition").[70][71][72].

Fraud detection

Fraud detection deals with the identification of banking frauds such as money laundering, credit card fraud, and telecom fraud, which have vast research domains and machine learning applications. Since ensemble learning improves the robustness of modeling normal behavior, it has been proposed as an effective technique to detect these fraudulent cases and activities in banking and credit card systems.[73][74].

Financial decision making

The accuracy of business failure prediction is a very crucial issue in financial decision making. Therefore, different ensemble classifiers are proposed to predict financial crises and financial distress.[75] Likewise, in the trading-based manipulation problem, in which traders try to manipulate stock prices "Stock (Finance)") through buying and selling activities, ensemble classifiers are required to analyze changes in stock market data and detect suspicious symptoms of stock price manipulation.[75].

Medicine

Ensemble classifiers have been successfully applied in neuroscience, proteomics, and medical diagnostics, such as in the detection of neurocognitive disorders (e.g., Alzheimer's or myotonic dystrophy) from MRI data sets,[76][77][78] and in the classification of cervical cytology.[79][80].

References

[1] ↑ Opitz, D.; Maclin, R. (1 de agosto de 1999). «Popular Ensemble Methods: An Empirical Study». Journal of Artificial Intelligence Research (en inglés) 11: 169-198. ISSN 1076-9757. doi:10.1613/jair.614. Consultado el 5 de marzo de 2024.: https://jair.org/index.php/jair/article/view/10239
[2] ↑ Polikar, R. (2006). «"Ensemble based systems in decision making"». IEEE Circuits and Systems Magazine. doi:10.1109/MCAS.2006.1688199.: https://dx.doi.org/10.1109%2FMCAS.2006.1688199
[3] ↑ a b Rokach, L. (2010). «"Ensemble-based classifiers"». Artificial Intelligence Review. doi:10.1007/s10462-009-9124-7.: https://dx.doi.org/10.1007%2Fs10462-009-9124-7
[4] ↑ Blockeel H. (2011). «"Hypothesis Space"». Encyclopedia of Machine Learning. ISBN 978-0-387-30768-8. doi:10.1007/978-0-387-30164-8_373.: https://lirias.kuleuven.be/handle/123456789/298291
[5] ↑ Kuncheva, L. and Whitaker, C. (2003). «Measures of diversity in classifier ensembles». Machine Learning.: https://link.springer.com/content/pdf/10.1023/A:1022859003006.pdf
[6] ↑ Sollich, P. and Krogh, A. (1996). «Learning with ensembles: How overfitting can be useful». Advances in Neural Information Processing Systems, volume 8.: https://proceedings.neurips.cc/paper/1995/file/1019c8091693ef5c5f55970346633f92-Paper.pdf
[7] ↑ Brown, G. and Wyatt, J. and Harris, R. and Yao, X. (2005). «Diversity creation methods: a survey and categorisation». Information Fusion.: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.421.349&rep=rep1&type=pdf
[8] ↑ Adeva, Juan Jose Garcıa; Beresi, Ulises Cervino; Calvo, Rafael A. (1 de diciembre de 2005). «Accuracy and Diversity in Ensembles of Text Categorisers». CLEI Electronic Journal (en inglés) 8 (2): 1:1-1:12. ISSN 0717-5000. doi:10.19153/cleiej.8.2.1. Consultado el 5 de marzo de 2024.: https://www.clei.org/cleiej/index.php/cleiej/article/view/319
[9] ↑ Ho, T. (1995). «Random Decision Forests». Proceedings of the Third International Conference on Document Analysis and Recognition.
[10] ↑ Gashler, M.; Giraud-Carrier, C.; Martinez, T. (2008). «Decision Tree Ensemble: Small Heterogeneous is Better Than Large Homogeneous». Seventh International Conference on Machine Learning and Applications. ISBN 978-0-7695-3495-4. doi:10.1109/ICMLA.2008.154.: http://axon.cs.byu.edu/papers/gashler2008icmla.pdf
[11] ↑ Liu, Y.; Yao, X. (1999-12). «Ensemble learning via negative correlation». Neural Networks 12 (10): 1399-1404. ISSN 0893-6080. doi:10.1016/s0893-6080(99)00073-8. Consultado el 5 de marzo de 2024. - [https://doi.org/10.1016/S0893-6080(99)00073-8](https://doi.org/10.1016/S0893-6080(99)00073-8)
[12] ↑ Shoham, Ron; Permuter, Haim (2019). «"Amended Cross-Entropy Cost: An Approach for Encouraging Diversity in Classification Ensemble (Brief Announcement)"». Cyber Security Cryptography and Machine Learning. ISBN 978-3-030-20950-6. doi:10.1007/978-3-030-20951-3_18.: https://dx.doi.org/10.1007%2F978-3-030-20951-3_18
[13] ↑ Morishita, Terufumi; Morio, Gaku; Horiguchi, Shota; Ozaki, Hiroaki; Nukaga, Nobuo (28 de junio de 2022). «Rethinking Fano’s Inequality in Ensemble Learning». Proceedings of the 39th International Conference on Machine Learning (en inglés) (PMLR): 15976-16016. Consultado el 6 de marzo de 2024.: https://proceedings.mlr.press/v162/morishita22a.html
[14] ↑ Bonab, Hamed R.; Can, Fazli (24 de octubre de 2016). «A Theoretical Framework on the Ideal Number of Classifiers for Online Ensembles in Data Streams». Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. CIKM '16 (Association for Computing Machinery): 2053-2056. ISBN 978-1-4503-4073-1. doi:10.1145/2983323.2983907. Consultado el 6 de marzo de 2024.: https://doi.org/10.1145/2983323.2983907
[15] ↑ Bonab, Hamed; Can, Fazli (2017). "Less is More: A Comprehensive Framework for the Number of Components of Ensemble Classifiers".
[16] ↑ Tom M. Mitchell (1997). Machine Learning.
[17] ↑ Salman, R., Alzaatreh, A., Sulieman, H., & Faisal, S. (2021). «A Bootstrap Framework for Aggregating within and between Feature Selection Methods». Entropy (Basel, Switzerland). doi:10.3390/e23020200.: https://dx.doi.org/10.3390%2Fe23020200
[18] ↑ Breiman, L., Bagging Predictors (1996). Machine Learning. doi:10.1007/BF00058655.: https://dx.doi.org/10.1007%2FBF00058655
[19] ↑ Brodeur, Z. P., Herman, J. D., & Steinschneider, S. (2020). «Bootstrap aggregation and cross-validation methods to reduce overfitting in reservoir control policy search». Water Resources Research. doi:10.1029/2020WR027184.: https://dx.doi.org/10.1029%2F2020WR027184
[20] ↑ Hoeting, Jennifer A.; Madigan, David; Raftery, Adrian E.; Volinsky, Chris T. (1999-11). «Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors». Statistical Science 14 (4): 382-417. ISSN 0883-4237. doi:10.1214/ss/1009212519. Consultado el 6 de marzo de 2024.: https://projecteuclid.org/journals/statistical-science/volume-14/issue-4/Bayesian-model-averaging--a-tutorial-with-comments-by-M/10.1214/ss/1009212519.full
[21] ↑ Chris Fraley; Adrian Raftery; J. McLean Sloughter; Tilmann Gneiting. ensembleBMA: Probabilistic Forecasting using Ensembles and Bayesian Model Averaging. Wikidata Q98972500.
[22] ↑ Sevcikova, Hana (23 de noviembre de 2023), hanase/BMA, consultado el 7 de marzo de 2024 .: https://github.com/hanase/BMA
[23] ↑ Adrian Raftery (1995). «"Bayesian model selection in social research"». Sociological Methodology. ISSN 0081-1750. doi:10.2307/271063.: https://es.wikipedia.org//portal.issn.org/resource/issn/0081-1750
[24] ↑ Merlise A. Clyde; Michael L. Littman; Quanli Wang; Joyee Ghosh; Yingbo Li; Don van den Bergh. BAS: Bayesian Variable Selection and Model Averaging using Bayesian Adaptive Sampling. Wikidata Q98974089.
[25] ↑ Gerda Claeskens; Nils Lid Hjort (2008). «Model selection and model averaging». Cambridge University Press. Wikidata Q62568358.
[26] ↑ Haussler, David; Kearns, Michael; Schapire, Robert E. (1 de enero de 1994). «Bounds on the sample complexity of Bayesian learning using information theory and the VC dimension». Machine Learning (en inglés) 14 (1): 83-113. ISSN 1573-0565. doi:10.1007/BF00993163. Consultado el 7 de marzo de 2024.: https://doi.org/10.1007/BF00993163
[27] ↑ Kenneth P. Burnham; David R. Anderson (2002). «Model Selection and Inference: A practical information-theoretic approach,». Springer Science+Business Media. Wikidata Q76889160.
[28] ↑ El artículo de Wikiversity sobre Searching R Packages menciona varias formas de encontrar paquetes disponibles para algo como esto. Por ejemplo, "sos::findFn('{Bayesian model averaging}')" desde dentro de R buscará archivos de ayuda en paquetes contribuidos que incluyan el término de búsqueda y abrirá dos pestañas en el navegador por defecto. La primera listará todos los archivos de ayuda encontrados ordenados por paquete. La segunda resume los paquetes encontrados, ordenados por la aparente fuerza de la coincidencia.
[29] ↑ Monteith, Kristine; Carroll, James; Seppi, Kevin; Martinez, Tony (2011). «Turning Bayesian Model Averaging into Bayesian Model Combination». Proceedings of the International Joint Conference on Neural Networks IJCNN'11.: http://axon.cs.byu.edu/papers/Kristine.ijcnn2011.pdf
[30] ↑ «CiteSeerX». CiteSeerX (en inglés). Consultado el 7 de marzo de 2024.: https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.6096
[31] ↑ Bensusan, Hilan; Giraud-Carrier, Christophe (2000). «"Discovering Task Neighbourhoods through Landmark Learning Performances». Principles of Data Mining and Knowledge Discovery. Lecture Notes in Computer Science. Vol. 1910. ISBN 978-3-540-41066-9. doi:10.1007/3-540-45372-5_32.: https://link.springer.com/content/pdf/10.1007/3-540-45372-5_32.pdf
[32] ↑ «1.11. Ensembles: Gradient boosting, random forests, bagging, voting, stacking». scikit-learn (en inglés). Consultado el 7 de marzo de 2024.: https://scikit-learn/stable/modules/ensemble.html
[33] ↑ Wolpert (1992). «"Stacked Generalization"». Neural Networks. doi:10.1016/s0893-6080(05)80023-1.: https://dx.doi.org/10.1016%2Fs0893-6080%2805%2980023-1
[34] ↑ Breiman, Leo (1 de julio de 1996). «Stacked regressions». Machine Learning (en inglés) 24 (1): 49-64. ISSN 1573-0565. doi:10.1007/BF00117832. Consultado el 7 de marzo de 2024.: https://doi.org/10.1007/BF00117832
[35] ↑ Ozay, M.; Yarman Vural, F. T. A New Fuzzy Stacked Generalization Technique and Analysis of its Performance.
[36] ↑ Smyth, Padhraic; Wolpert, David (1999). «Linearly Combining Density Estimators via Stacking». Machine Learning. doi:10.1023/A:1007511322260.: https://link.springer.com/content/pdf/10.1023/A:1007511322260.pdf
[37] ↑ Wolpert, David H.; MacReady, William G. (1999). «"An Efficient Method to Estimate Bagging's Generalization Error"». Machine Learning. doi:10.1023/A:1007519102914.: https://link.springer.com/content/pdf/10.1023/A:1007519102914.pdf
[38] ↑ Clarke, B. (2003). «Bayes model averaging and stacking when model approximation error cannot be ignored». Journal of Machine Learning Research.: https://www.jmlr.org/papers/volume4/clarke03a/clarke03a.pdf
[39] ↑ Sill, J.; Takacs, G.; Mackey, L.; Lin, D. (2009). "Feature-Weighted Linear Stacking".
[40] ↑ Amini, Shahram M.; Parmeter, Christopher F. (2011). «Bayesian model averaging in R». Journal of Economic and Social Measurement. doi:10.3233/JEM-2011-0350.: https://core.ac.uk/download/pdf/6494889.pdf
[41] ↑ Hofmarcher, Martin Feldkircher and Stefan Zeugner and Paul (9 de agosto de 2022), BMS: Bayesian Model Averaging Library, consultado el 7 de marzo de 2024 .: https://cran.r-project.org/web/packages/BMS/index.html
[42] ↑ Clyde (ORCID=0000-0002-3595-1872), Merlise; Littman, Michael; Ghosh, Joyee; Li, Yingbo; Bersson, Betsy; Bergh, Don van de; Wang, Quanli (6 de diciembre de 2023), BAS: Bayesian Variable Selection and Model Averaging using Bayesian Adaptive Sampling, consultado el 7 de marzo de 2024 .: https://cran.r-project.org/web/packages/BAS/index.html
[43] ↑ Raftery, Adrian; Hoeting, Jennifer; Volinsky, Chris; Painter, Ian; Yeung, Ka Yee (22 de abril de 2022), BMA: Bayesian Model Averaging, consultado el 7 de marzo de 2024 .: https://cran.r-project.org/web/packages/BMA/index.html
[44] ↑ «Classification Ensembles - MATLAB & Simulink - MathWorks United Kingdom». uk.mathworks.com. Consultado el 7 de marzo de 2024.: https://uk.mathworks.com/help/stats/classification-ensembles.html
[45] ↑ a b Woźniak, Michał; Graña, Manuel; Corchado, Emilio (2014). «A survey of multiple classifier systems as hybrid systems». Information Fusion. doi:10.1016/j.inffus.2013.04.006.: https://dx.doi.org/10.1016%2Fj.inffus.2013.04.006
[46] ↑ a b Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. (2012). «"An assessment of the effectiveness of a random forest classifier for land-cover classification"». ISPRS Journal of Photogrammetry and Remote Sensing. doi:10.1016/j.isprsjprs.2011.11.002.: https://dx.doi.org/10.1016%2Fj.isprsjprs.2011.11.002
[47] ↑ Giacinto, Giorgio; Roli, Fabio (2001). «Design of effective neural network ensembles for image classification purposes"». Image and Vision Computing. doi:10.1016/S0262-8856(01)00045-2.: https://dx.doi.org/10.1016%2FS0262-8856%2801%2900045-2
[48] ↑ Xia, Junshi; Yokoya, Naoto; Iwasaki, Yakira (2017). «A novel ensemble classifier of hyperspectral and LiDAR data using morphological features"». 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ISBN 978-1-5090-4117-6. doi:10.1109/ICASSP.2017.7953345.: https://dx.doi.org/10.1109%2FICASSP.2017.7953345
[49] ↑ Mochizuki, S.; Murakami, T. (2012). «Accuracy comparison of land cover mapping using the object-oriented image classification with machine learning algorithms"». 33rd Asian Conference on Remote Sensing 2012, ACRS 2012.
[50] ↑ Liu, Dan; Toman, Elizabeth; Fuller, Zane; Chen, Gang; Londo, Alexis; Xuesong, Zhang; Kaiguang, Zhao (2018). «"Integration of historical map and aerial imagery to characterize long-term land-use change and landscape dynamics: An object-based analysis via Random Forests». Ecological Indicators. doi:10.1016/j.ecolind.2018.08.004.: https://pages.charlotte.edu/gang-chen/wp-content/uploads/sites/184/2018/08/Liu_2018_Intigration-historical-map-aerial-imagery-LCLUC.pdf
[51] ↑ Giacinto, G.; Roli, F.; Fumera, G. (2000). «"Design of effective multiple classifier systems by clustering of classifiers".». Proceedings 15th International Conference on Pattern Recognition. ICPR-2000. Vol. 2. ISBN 978-0-7695-0750-7. doi:10.1109/ICPR.2000.906039.: https://dx.doi.org/10.1109%2FICPR.2000.906039
[52] ↑ Du, Peijun; Liu, Sicong; Xia, Junshi; Zhao, Yindi (2013). «Information fusion techniques for change detection from multi-temporal remote sensing images». Information Fusion. doi:10.1016/j.inffus.2012.05.003.: https://dx.doi.org/10.1016%2Fj.inffus.2012.05.003
[53] ↑ Defined by Bruzzone et al. 2002 como "La clase de datos que recibe el mayor número de votos se toma como clase del patrón de entrada", se trata de mayoría simple, más exactamente descrita como votación por pluralidad.
[54] ↑ «1-s2.0-S0034425719301853-main.pdf». Google Docs. Consultado el 7 de marzo de 2024.: https://drive.google.com/file/d/1MFZ0FpK1NwTieVSAf5jicLgl85Lm48uh/view?usp=embed_facebook
[55] ↑ Bruzzone, Lorenzo; Cossu, Roberto; Vernazza, Gianni (2002). «"Combining parametric and non-parametric algorithms for a partially unsupervised classification of multitemporal remote-sensing images». Information Fusion. doi:10.1016/S1566-2535(02)00091-X.: http://eprints.biblio.unitn.it/105/1/24.pdf
[56] ↑ Mugiraneza, Theodomir; Nascetti, Andrea; Ban, Yifang (2020-01). «Continuous Monitoring of Urban Land Cover Change Trajectories with Landsat Time Series and LandTrendr-Google Earth Engine Cloud Computing». Remote Sensing (en inglés) 12 (18): 2883. ISSN 2072-4292. doi:10.3390/rs12182883. Consultado el 7 de marzo de 2024.: https://www.mdpi.com/2072-4292/12/18/2883
[57] ↑ zhaokg (6 de marzo de 2024), zhaokg/Rbeast, consultado el 7 de marzo de 2024 .: https://github.com/zhaokg/Rbeast
[58] ↑ Raj Kumar, P. Arun; Selvakumar, S. (2011). «Distributed denial of service attack detection using an ensemble of neural classifier». Computer Communications. doi:10.1016/j.comcom.2011.01.012.: https://dx.doi.org/10.1016%2Fj.comcom.2011.01.012
[59] ↑ Shabtai, Asaf; Moskovitch, Robert; Elovici, Yuval; Glezer, Chanan (2009). «"Detection of malicious code by applying machine learning classifiers on static features: A state-of-the-art survey"». Information Security Technical Report. doi:10.1016/j.istr.2009.03.003.: https://dx.doi.org/10.1016%2Fj.istr.2009.03.003
[60] ↑ Zhang, Boyun; Yin, Jianping; Hao, Jingbo; Zhang, Dingxing; Wang, Shulin (2007). «Malicious Codes Detection Based on Ensemble Learning». Autonomic and Trusted Computing. Lecture Notes in Computer Science. ISBN 978-3-540-73546-5. doi:10.1007/978-3-540-73547-2_48.: https://dx.doi.org/10.1007%2F978-3-540-73547-2_48
[61] ↑ Menahem, Eitan; Shabtai, Asaf; Rokach, Lior; Elovici, Yuval (2009). «"Improving malware detection by applying multi-inducer ensemble"». Computational Statistics & Data Analysis. doi:10.1016/j.csda.2008.10.015.: https://dx.doi.org/10.1016%2Fj.csda.2008.10.015
[62] ↑ Locasto, Michael E.; Wang, Ke; Keromytis, Angeles D.; Salvatore, J. Stolfo (2005). «"FLIPS: Hybrid Adaptive Intrusion Prevention"». Recent Advances in Intrusion Detection. Lecture Notes in Computer Science. PMID 978-3-540-31778-4 |pmid= incorrecto (ayuda). doi:10.1007/11663812_5.: https://es.wikipedia.org//www.ncbi.nlm.nih.gov/pubmed/978-3-540-31778-4
[63] ↑ Giacinto, Giorgio; Perdisci, Roberto; Del Rio, Mauro; Roli, Fabio (2008). «"Intrusion detection in computer networks by a modular ensemble of one-class classifiers"». Information Fusion. doi:10.1016/j.inffus.2006.10.002.: https://dx.doi.org/10.1016%2Fj.inffus.2006.10.002
[64] ↑ Mu, Xiaoyan; Lu, Jiangfeng; Watta, Paul; Hassoun, Mohamad H. (2009). «"Weighted voting-based ensemble classifiers with application to human face recognition and voice recognition".». 2009 International Joint Conference on Neural Networks. doi:10.1109/IJCNN.2009.5178708.: https://dx.doi.org/10.1109%2FIJCNN.2009.5178708
[65] ↑ Yu, Su; Shan, Shiguang; Chen, Xilin; Gao, Wen (2006). «Hierarchical ensemble of Gabor Fisher classifier for face recognition». 7th International Conference on Automatic Face and Gesture Recognition (FGR06). ISBN 978-0-7695-2503-7. doi:10.1109/FGR.2006.64.: https://dx.doi.org/10.1109%2FFGR.2006.64
[66] ↑ Su, Y.; Shan, S.; Chen, X.; Gao, W. (2006). «"Patch-Based Gabor Fisher Classifier for Face Recognition"». 18th International Conference on Pattern Recognition (ICPR'06). ISBN 978-0-7695-2521-1. doi:10.1109/ICPR.2006.917.: https://dx.doi.org/10.1109%2FICPR.2006.917
[67] ↑ Liu, Yang; Lin, Yongzheng; Chen, Yuehui (2008). «"Ensemble Classification Based on ICA for Face Recognition".». 2008 Congress on Image and Signal Processing. ISBN 978-0-7695-3119-9. doi:10.1109/CISP.2008.581.: https://dx.doi.org/10.1109%2FCISP.2008.581
[68] ↑ Rieger, Steven A.; Muraleedharan, Rajani; Ramachandran, Ravi P. (2014). «"Speech based emotion recognition using spectral feature extraction and an ensemble of KNN classifiers"». The 9th International Symposium on Chinese Spoken Language Processing. ISBN 978-1-4799-4219-0. doi:10.1109/ISCSLP.2014.6936711.: https://dx.doi.org/10.1109%2FISCSLP.2014.6936711
[69] ↑ Krajewski, Jarek; Batliner, Anton; Kessel, Silke (2010). «"Comparing Multiple Classifiers for Speech-Based Detection of Self-Confidence - A Pilot Study». 2010 20th International Conference on Pattern Recognition. ISBN 978-1-4244-7542-1. doi:10.1109/ICPR.2010.905.: https://dx.doi.org/10.1109%2FICPR.2010.905
[70] ↑ Rani, P. Ithaya; Muneeswaran, K. (2016). «"Recognize the facial emotion in video sequences using eye and mouth temporal Gabor features». Multimedia Tools and Applications. doi:10.1007/s11042-016-3592-y.: https://dx.doi.org/10.1007%2Fs11042-016-3592-y
[71] ↑ Rani, P. Ithaya; Muneeswaran, K. (2016). «"Facial Emotion Recognition Based on Eye and Mouth Regions".». International Journal of Pattern Recognition and Artificial Intelligence. doi:10.1142/S021800141655020X.: https://dx.doi.org/10.1142%2FS021800141655020X
[72] ↑ RANI, P. ITHAYA; MUNEESWARAN, K. (28 de marzo de 2018). «Emotion recognition based on facial components». Sādhanā (en inglés) 43 (3): 48. ISSN 0973-7677. doi:10.1007/s12046-018-0801-6. Consultado el 8 de marzo de 2024.: https://doi.org/10.1007/s12046-018-0801-6
[73] ↑ Louzada, Francisco; Ara, Anderson (2012). «Bagging k-dependence probabilistic networks: An alternative powerful fraud detection tool». Expert Systems with Applications. doi:10.1016/j.eswa.2012.04.024.: https://dx.doi.org/10.1016%2Fj.eswa.2012.04.024
[74] ↑ Sundarkumar, G. Ganesh; Ravi, Vadlamani (2015). «"A novel hybrid undersampling method for mining unbalanced datasets in banking and insurance"». Engineering Applications of Artificial Intelligence. doi:10.1016/j.engappai.2014.09.019.: https://dx.doi.org/10.1016%2Fj.engappai.2014.09.019
[75] ↑ a b Kim, Yoonseong; Sohn, So Young (2012). «"Stock fraud detection using peer group analysis"». Expert Systems with Applications. doi:10.1016/j.eswa.2012.02.025.: https://dx.doi.org/10.1016%2Fj.eswa.2012.02.025
[76] ↑ Savio, A.; García-Sebastián, M.T.; Chyzyk, D.; Hernandez, C.; Graña, M.; Sistiaga, A.; López de Munain, A.; Villanúa, J. (2011). «"Neurocognitive disorder detection based on feature vectors extracted from VBM analysis of structural MRI".». Computers in Biology and Medicine. doi:10.1016/j.compbiomed.2011.05.010.: https://dx.doi.org/10.1016%2Fj.compbiomed.2011.05.010
[77] ↑ Ayerdi, B.; Savio, A.; Graña, M. (2013). «Meta-ensembles of Classifiers for Alzheimer's Disease Detection Using Independent ROI Features». Natural and Artificial Computation in Engineering and Medical Applications. Lecture Notes in Computer Science. ISBN 978-3-642-38621-3. doi:10.1007/978-3-642-38622-0_13.: https://dx.doi.org/10.1007%2F978-3-642-38622-0_13
[78] ↑ Gu, Quan; Ding, Yong-Sheng; Zhang, Tong-Liang (2015). «"An ensemble classifier based prediction of G-protein-coupled receptor classes in low homology"». Neurocomputing. doi:10.1016/j.neucom.2014.12.013.: https://dx.doi.org/10.1016%2Fj.neucom.2014.12.013
[79] ↑ Xue, Dan; Zhou, Xiaomin; Li, Chen; Yao, Yudong; Rahaman, Md Mamunur; Zhang, Jinghua; Chen, Hao; Zhang, Jinpeng et al. (2020). «An Application of Transfer Learning and Ensemble Learning Techniques for Cervical Histopathology Image Classification». IEEE Access 8: 104603-104618. ISSN 2169-3536. doi:10.1109/ACCESS.2020.2999816. Consultado el 8 de marzo de 2024. Se sugiere usar |número-autores= (ayuda).: https://ieeexplore.ieee.org/document/9107128/
[80] ↑ Manna, Ankur; Kundu, Rohit; Kaplun, Dmitrii; Sinitca, Aleksandr; Sarkar, Ram (2021). «"A fuzzy rank-based ensemble of CNN models for classification of cervical cytology». Scientific Reports. PMID 34267261. doi:10.1038/s41598-021-93783-8.: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8282795

Common types of sets

Optimal Bayesian classifier

This formula can be reformulated using Bayes' theorem, which says that the posterior probability is proportional to the probability multiplied by the prior probability:.

therefore,.

Bootstrap aggregation (bagging)

Boosting

In some cases, boosting has given better results than bagging, but it tends to overfit more. The most common application of boosting is Adaboost"), but some newer algorithms obtain better results.

Bayesian model averaging

Bayesian combination of models

Bucket of models

The most commonly used method for model selection is cross-validation (sometimes called a “baking contest”). It is described with the following pseudocode:

Selection by cross-validation can be summarized as: "try them all against the training set and choose the one that works best".[30].

Stacking

Vote

Voting is another form of assembly. See, for example, the weighted majority algorithm (machine learning).

Applications of ensemble learning

Contenido

Remote sensing

Computer security

facial recognition

Facial recognition, which has recently become one of the most popular research areas of pattern recognition, deals with the identification or verification of a person using their digital images.[64].

Hierarchical ensembles based on the Gabor Fisher classifier and independent component analysis preprocessing techniques are some of the first ensembles used in this field.[65][66][67].

Emotion recognition

It is also being used successfully in facial emotion recognition").[70][71][72].

Fraud detection

Financial decision making

Medicine

References

[1] ↑ Opitz, D.; Maclin, R. (1 de agosto de 1999). «Popular Ensemble Methods: An Empirical Study». Journal of Artificial Intelligence Research (en inglés) 11: 169-198. ISSN 1076-9757. doi:10.1613/jair.614. Consultado el 5 de marzo de 2024.: https://jair.org/index.php/jair/article/view/10239
[2] ↑ Polikar, R. (2006). «"Ensemble based systems in decision making"». IEEE Circuits and Systems Magazine. doi:10.1109/MCAS.2006.1688199.: https://dx.doi.org/10.1109%2FMCAS.2006.1688199
[3] ↑ a b Rokach, L. (2010). «"Ensemble-based classifiers"». Artificial Intelligence Review. doi:10.1007/s10462-009-9124-7.: https://dx.doi.org/10.1007%2Fs10462-009-9124-7
[4] ↑ Blockeel H. (2011). «"Hypothesis Space"». Encyclopedia of Machine Learning. ISBN 978-0-387-30768-8. doi:10.1007/978-0-387-30164-8_373.: https://lirias.kuleuven.be/handle/123456789/298291
[5] ↑ Kuncheva, L. and Whitaker, C. (2003). «Measures of diversity in classifier ensembles». Machine Learning.: https://link.springer.com/content/pdf/10.1023/A:1022859003006.pdf
[6] ↑ Sollich, P. and Krogh, A. (1996). «Learning with ensembles: How overfitting can be useful». Advances in Neural Information Processing Systems, volume 8.: https://proceedings.neurips.cc/paper/1995/file/1019c8091693ef5c5f55970346633f92-Paper.pdf
[7] ↑ Brown, G. and Wyatt, J. and Harris, R. and Yao, X. (2005). «Diversity creation methods: a survey and categorisation». Information Fusion.: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.421.349&rep=rep1&type=pdf
[8] ↑ Adeva, Juan Jose Garcıa; Beresi, Ulises Cervino; Calvo, Rafael A. (1 de diciembre de 2005). «Accuracy and Diversity in Ensembles of Text Categorisers». CLEI Electronic Journal (en inglés) 8 (2): 1:1-1:12. ISSN 0717-5000. doi:10.19153/cleiej.8.2.1. Consultado el 5 de marzo de 2024.: https://www.clei.org/cleiej/index.php/cleiej/article/view/319
[9] ↑ Ho, T. (1995). «Random Decision Forests». Proceedings of the Third International Conference on Document Analysis and Recognition.
[10] ↑ Gashler, M.; Giraud-Carrier, C.; Martinez, T. (2008). «Decision Tree Ensemble: Small Heterogeneous is Better Than Large Homogeneous». Seventh International Conference on Machine Learning and Applications. ISBN 978-0-7695-3495-4. doi:10.1109/ICMLA.2008.154.: http://axon.cs.byu.edu/papers/gashler2008icmla.pdf
[11] ↑ Liu, Y.; Yao, X. (1999-12). «Ensemble learning via negative correlation». Neural Networks 12 (10): 1399-1404. ISSN 0893-6080. doi:10.1016/s0893-6080(99)00073-8. Consultado el 5 de marzo de 2024. - [https://doi.org/10.1016/S0893-6080(99)00073-8](https://doi.org/10.1016/S0893-6080(99)00073-8)
[12] ↑ Shoham, Ron; Permuter, Haim (2019). «"Amended Cross-Entropy Cost: An Approach for Encouraging Diversity in Classification Ensemble (Brief Announcement)"». Cyber Security Cryptography and Machine Learning. ISBN 978-3-030-20950-6. doi:10.1007/978-3-030-20951-3_18.: https://dx.doi.org/10.1007%2F978-3-030-20951-3_18
[13] ↑ Morishita, Terufumi; Morio, Gaku; Horiguchi, Shota; Ozaki, Hiroaki; Nukaga, Nobuo (28 de junio de 2022). «Rethinking Fano’s Inequality in Ensemble Learning». Proceedings of the 39th International Conference on Machine Learning (en inglés) (PMLR): 15976-16016. Consultado el 6 de marzo de 2024.: https://proceedings.mlr.press/v162/morishita22a.html
[14] ↑ Bonab, Hamed R.; Can, Fazli (24 de octubre de 2016). «A Theoretical Framework on the Ideal Number of Classifiers for Online Ensembles in Data Streams». Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. CIKM '16 (Association for Computing Machinery): 2053-2056. ISBN 978-1-4503-4073-1. doi:10.1145/2983323.2983907. Consultado el 6 de marzo de 2024.: https://doi.org/10.1145/2983323.2983907
[15] ↑ Bonab, Hamed; Can, Fazli (2017). "Less is More: A Comprehensive Framework for the Number of Components of Ensemble Classifiers".
[16] ↑ Tom M. Mitchell (1997). Machine Learning.
[17] ↑ Salman, R., Alzaatreh, A., Sulieman, H., & Faisal, S. (2021). «A Bootstrap Framework for Aggregating within and between Feature Selection Methods». Entropy (Basel, Switzerland). doi:10.3390/e23020200.: https://dx.doi.org/10.3390%2Fe23020200
[18] ↑ Breiman, L., Bagging Predictors (1996). Machine Learning. doi:10.1007/BF00058655.: https://dx.doi.org/10.1007%2FBF00058655
[19] ↑ Brodeur, Z. P., Herman, J. D., & Steinschneider, S. (2020). «Bootstrap aggregation and cross-validation methods to reduce overfitting in reservoir control policy search». Water Resources Research. doi:10.1029/2020WR027184.: https://dx.doi.org/10.1029%2F2020WR027184
[20] ↑ Hoeting, Jennifer A.; Madigan, David; Raftery, Adrian E.; Volinsky, Chris T. (1999-11). «Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors». Statistical Science 14 (4): 382-417. ISSN 0883-4237. doi:10.1214/ss/1009212519. Consultado el 6 de marzo de 2024.: https://projecteuclid.org/journals/statistical-science/volume-14/issue-4/Bayesian-model-averaging--a-tutorial-with-comments-by-M/10.1214/ss/1009212519.full
[21] ↑ Chris Fraley; Adrian Raftery; J. McLean Sloughter; Tilmann Gneiting. ensembleBMA: Probabilistic Forecasting using Ensembles and Bayesian Model Averaging. Wikidata Q98972500.
[22] ↑ Sevcikova, Hana (23 de noviembre de 2023), hanase/BMA, consultado el 7 de marzo de 2024 .: https://github.com/hanase/BMA
[23] ↑ Adrian Raftery (1995). «"Bayesian model selection in social research"». Sociological Methodology. ISSN 0081-1750. doi:10.2307/271063.: https://es.wikipedia.org//portal.issn.org/resource/issn/0081-1750
[24] ↑ Merlise A. Clyde; Michael L. Littman; Quanli Wang; Joyee Ghosh; Yingbo Li; Don van den Bergh. BAS: Bayesian Variable Selection and Model Averaging using Bayesian Adaptive Sampling. Wikidata Q98974089.
[25] ↑ Gerda Claeskens; Nils Lid Hjort (2008). «Model selection and model averaging». Cambridge University Press. Wikidata Q62568358.
[26] ↑ Haussler, David; Kearns, Michael; Schapire, Robert E. (1 de enero de 1994). «Bounds on the sample complexity of Bayesian learning using information theory and the VC dimension». Machine Learning (en inglés) 14 (1): 83-113. ISSN 1573-0565. doi:10.1007/BF00993163. Consultado el 7 de marzo de 2024.: https://doi.org/10.1007/BF00993163
[27] ↑ Kenneth P. Burnham; David R. Anderson (2002). «Model Selection and Inference: A practical information-theoretic approach,». Springer Science+Business Media. Wikidata Q76889160.
[28] ↑ El artículo de Wikiversity sobre Searching R Packages menciona varias formas de encontrar paquetes disponibles para algo como esto. Por ejemplo, "sos::findFn('{Bayesian model averaging}')" desde dentro de R buscará archivos de ayuda en paquetes contribuidos que incluyan el término de búsqueda y abrirá dos pestañas en el navegador por defecto. La primera listará todos los archivos de ayuda encontrados ordenados por paquete. La segunda resume los paquetes encontrados, ordenados por la aparente fuerza de la coincidencia.
[29] ↑ Monteith, Kristine; Carroll, James; Seppi, Kevin; Martinez, Tony (2011). «Turning Bayesian Model Averaging into Bayesian Model Combination». Proceedings of the International Joint Conference on Neural Networks IJCNN'11.: http://axon.cs.byu.edu/papers/Kristine.ijcnn2011.pdf
[30] ↑ «CiteSeerX». CiteSeerX (en inglés). Consultado el 7 de marzo de 2024.: https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.6096
[31] ↑ Bensusan, Hilan; Giraud-Carrier, Christophe (2000). «"Discovering Task Neighbourhoods through Landmark Learning Performances». Principles of Data Mining and Knowledge Discovery. Lecture Notes in Computer Science. Vol. 1910. ISBN 978-3-540-41066-9. doi:10.1007/3-540-45372-5_32.: https://link.springer.com/content/pdf/10.1007/3-540-45372-5_32.pdf
[32] ↑ «1.11. Ensembles: Gradient boosting, random forests, bagging, voting, stacking». scikit-learn (en inglés). Consultado el 7 de marzo de 2024.: https://scikit-learn/stable/modules/ensemble.html
[33] ↑ Wolpert (1992). «"Stacked Generalization"». Neural Networks. doi:10.1016/s0893-6080(05)80023-1.: https://dx.doi.org/10.1016%2Fs0893-6080%2805%2980023-1
[34] ↑ Breiman, Leo (1 de julio de 1996). «Stacked regressions». Machine Learning (en inglés) 24 (1): 49-64. ISSN 1573-0565. doi:10.1007/BF00117832. Consultado el 7 de marzo de 2024.: https://doi.org/10.1007/BF00117832
[35] ↑ Ozay, M.; Yarman Vural, F. T. A New Fuzzy Stacked Generalization Technique and Analysis of its Performance.
[36] ↑ Smyth, Padhraic; Wolpert, David (1999). «Linearly Combining Density Estimators via Stacking». Machine Learning. doi:10.1023/A:1007511322260.: https://link.springer.com/content/pdf/10.1023/A:1007511322260.pdf
[37] ↑ Wolpert, David H.; MacReady, William G. (1999). «"An Efficient Method to Estimate Bagging's Generalization Error"». Machine Learning. doi:10.1023/A:1007519102914.: https://link.springer.com/content/pdf/10.1023/A:1007519102914.pdf
[38] ↑ Clarke, B. (2003). «Bayes model averaging and stacking when model approximation error cannot be ignored». Journal of Machine Learning Research.: https://www.jmlr.org/papers/volume4/clarke03a/clarke03a.pdf
[39] ↑ Sill, J.; Takacs, G.; Mackey, L.; Lin, D. (2009). "Feature-Weighted Linear Stacking".
[40] ↑ Amini, Shahram M.; Parmeter, Christopher F. (2011). «Bayesian model averaging in R». Journal of Economic and Social Measurement. doi:10.3233/JEM-2011-0350.: https://core.ac.uk/download/pdf/6494889.pdf
[41] ↑ Hofmarcher, Martin Feldkircher and Stefan Zeugner and Paul (9 de agosto de 2022), BMS: Bayesian Model Averaging Library, consultado el 7 de marzo de 2024 .: https://cran.r-project.org/web/packages/BMS/index.html
[42] ↑ Clyde (ORCID=0000-0002-3595-1872), Merlise; Littman, Michael; Ghosh, Joyee; Li, Yingbo; Bersson, Betsy; Bergh, Don van de; Wang, Quanli (6 de diciembre de 2023), BAS: Bayesian Variable Selection and Model Averaging using Bayesian Adaptive Sampling, consultado el 7 de marzo de 2024 .: https://cran.r-project.org/web/packages/BAS/index.html
[43] ↑ Raftery, Adrian; Hoeting, Jennifer; Volinsky, Chris; Painter, Ian; Yeung, Ka Yee (22 de abril de 2022), BMA: Bayesian Model Averaging, consultado el 7 de marzo de 2024 .: https://cran.r-project.org/web/packages/BMA/index.html
[44] ↑ «Classification Ensembles - MATLAB & Simulink - MathWorks United Kingdom». uk.mathworks.com. Consultado el 7 de marzo de 2024.: https://uk.mathworks.com/help/stats/classification-ensembles.html
[45] ↑ a b Woźniak, Michał; Graña, Manuel; Corchado, Emilio (2014). «A survey of multiple classifier systems as hybrid systems». Information Fusion. doi:10.1016/j.inffus.2013.04.006.: https://dx.doi.org/10.1016%2Fj.inffus.2013.04.006
[46] ↑ a b Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. (2012). «"An assessment of the effectiveness of a random forest classifier for land-cover classification"». ISPRS Journal of Photogrammetry and Remote Sensing. doi:10.1016/j.isprsjprs.2011.11.002.: https://dx.doi.org/10.1016%2Fj.isprsjprs.2011.11.002
[47] ↑ Giacinto, Giorgio; Roli, Fabio (2001). «Design of effective neural network ensembles for image classification purposes"». Image and Vision Computing. doi:10.1016/S0262-8856(01)00045-2.: https://dx.doi.org/10.1016%2FS0262-8856%2801%2900045-2
[48] ↑ Xia, Junshi; Yokoya, Naoto; Iwasaki, Yakira (2017). «A novel ensemble classifier of hyperspectral and LiDAR data using morphological features"». 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ISBN 978-1-5090-4117-6. doi:10.1109/ICASSP.2017.7953345.: https://dx.doi.org/10.1109%2FICASSP.2017.7953345
[49] ↑ Mochizuki, S.; Murakami, T. (2012). «Accuracy comparison of land cover mapping using the object-oriented image classification with machine learning algorithms"». 33rd Asian Conference on Remote Sensing 2012, ACRS 2012.
[50] ↑ Liu, Dan; Toman, Elizabeth; Fuller, Zane; Chen, Gang; Londo, Alexis; Xuesong, Zhang; Kaiguang, Zhao (2018). «"Integration of historical map and aerial imagery to characterize long-term land-use change and landscape dynamics: An object-based analysis via Random Forests». Ecological Indicators. doi:10.1016/j.ecolind.2018.08.004.: https://pages.charlotte.edu/gang-chen/wp-content/uploads/sites/184/2018/08/Liu_2018_Intigration-historical-map-aerial-imagery-LCLUC.pdf
[51] ↑ Giacinto, G.; Roli, F.; Fumera, G. (2000). «"Design of effective multiple classifier systems by clustering of classifiers".». Proceedings 15th International Conference on Pattern Recognition. ICPR-2000. Vol. 2. ISBN 978-0-7695-0750-7. doi:10.1109/ICPR.2000.906039.: https://dx.doi.org/10.1109%2FICPR.2000.906039
[52] ↑ Du, Peijun; Liu, Sicong; Xia, Junshi; Zhao, Yindi (2013). «Information fusion techniques for change detection from multi-temporal remote sensing images». Information Fusion. doi:10.1016/j.inffus.2012.05.003.: https://dx.doi.org/10.1016%2Fj.inffus.2012.05.003
[53] ↑ Defined by Bruzzone et al. 2002 como "La clase de datos que recibe el mayor número de votos se toma como clase del patrón de entrada", se trata de mayoría simple, más exactamente descrita como votación por pluralidad.
[54] ↑ «1-s2.0-S0034425719301853-main.pdf». Google Docs. Consultado el 7 de marzo de 2024.: https://drive.google.com/file/d/1MFZ0FpK1NwTieVSAf5jicLgl85Lm48uh/view?usp=embed_facebook
[55] ↑ Bruzzone, Lorenzo; Cossu, Roberto; Vernazza, Gianni (2002). «"Combining parametric and non-parametric algorithms for a partially unsupervised classification of multitemporal remote-sensing images». Information Fusion. doi:10.1016/S1566-2535(02)00091-X.: http://eprints.biblio.unitn.it/105/1/24.pdf
[56] ↑ Mugiraneza, Theodomir; Nascetti, Andrea; Ban, Yifang (2020-01). «Continuous Monitoring of Urban Land Cover Change Trajectories with Landsat Time Series and LandTrendr-Google Earth Engine Cloud Computing». Remote Sensing (en inglés) 12 (18): 2883. ISSN 2072-4292. doi:10.3390/rs12182883. Consultado el 7 de marzo de 2024.: https://www.mdpi.com/2072-4292/12/18/2883
[57] ↑ zhaokg (6 de marzo de 2024), zhaokg/Rbeast, consultado el 7 de marzo de 2024 .: https://github.com/zhaokg/Rbeast
[58] ↑ Raj Kumar, P. Arun; Selvakumar, S. (2011). «Distributed denial of service attack detection using an ensemble of neural classifier». Computer Communications. doi:10.1016/j.comcom.2011.01.012.: https://dx.doi.org/10.1016%2Fj.comcom.2011.01.012
[59] ↑ Shabtai, Asaf; Moskovitch, Robert; Elovici, Yuval; Glezer, Chanan (2009). «"Detection of malicious code by applying machine learning classifiers on static features: A state-of-the-art survey"». Information Security Technical Report. doi:10.1016/j.istr.2009.03.003.: https://dx.doi.org/10.1016%2Fj.istr.2009.03.003
[60] ↑ Zhang, Boyun; Yin, Jianping; Hao, Jingbo; Zhang, Dingxing; Wang, Shulin (2007). «Malicious Codes Detection Based on Ensemble Learning». Autonomic and Trusted Computing. Lecture Notes in Computer Science. ISBN 978-3-540-73546-5. doi:10.1007/978-3-540-73547-2_48.: https://dx.doi.org/10.1007%2F978-3-540-73547-2_48
[61] ↑ Menahem, Eitan; Shabtai, Asaf; Rokach, Lior; Elovici, Yuval (2009). «"Improving malware detection by applying multi-inducer ensemble"». Computational Statistics & Data Analysis. doi:10.1016/j.csda.2008.10.015.: https://dx.doi.org/10.1016%2Fj.csda.2008.10.015
[62] ↑ Locasto, Michael E.; Wang, Ke; Keromytis, Angeles D.; Salvatore, J. Stolfo (2005). «"FLIPS: Hybrid Adaptive Intrusion Prevention"». Recent Advances in Intrusion Detection. Lecture Notes in Computer Science. PMID 978-3-540-31778-4 |pmid= incorrecto (ayuda). doi:10.1007/11663812_5.: https://es.wikipedia.org//www.ncbi.nlm.nih.gov/pubmed/978-3-540-31778-4
[63] ↑ Giacinto, Giorgio; Perdisci, Roberto; Del Rio, Mauro; Roli, Fabio (2008). «"Intrusion detection in computer networks by a modular ensemble of one-class classifiers"». Information Fusion. doi:10.1016/j.inffus.2006.10.002.: https://dx.doi.org/10.1016%2Fj.inffus.2006.10.002
[64] ↑ Mu, Xiaoyan; Lu, Jiangfeng; Watta, Paul; Hassoun, Mohamad H. (2009). «"Weighted voting-based ensemble classifiers with application to human face recognition and voice recognition".». 2009 International Joint Conference on Neural Networks. doi:10.1109/IJCNN.2009.5178708.: https://dx.doi.org/10.1109%2FIJCNN.2009.5178708
[65] ↑ Yu, Su; Shan, Shiguang; Chen, Xilin; Gao, Wen (2006). «Hierarchical ensemble of Gabor Fisher classifier for face recognition». 7th International Conference on Automatic Face and Gesture Recognition (FGR06). ISBN 978-0-7695-2503-7. doi:10.1109/FGR.2006.64.: https://dx.doi.org/10.1109%2FFGR.2006.64
[66] ↑ Su, Y.; Shan, S.; Chen, X.; Gao, W. (2006). «"Patch-Based Gabor Fisher Classifier for Face Recognition"». 18th International Conference on Pattern Recognition (ICPR'06). ISBN 978-0-7695-2521-1. doi:10.1109/ICPR.2006.917.: https://dx.doi.org/10.1109%2FICPR.2006.917
[67] ↑ Liu, Yang; Lin, Yongzheng; Chen, Yuehui (2008). «"Ensemble Classification Based on ICA for Face Recognition".». 2008 Congress on Image and Signal Processing. ISBN 978-0-7695-3119-9. doi:10.1109/CISP.2008.581.: https://dx.doi.org/10.1109%2FCISP.2008.581
[68] ↑ Rieger, Steven A.; Muraleedharan, Rajani; Ramachandran, Ravi P. (2014). «"Speech based emotion recognition using spectral feature extraction and an ensemble of KNN classifiers"». The 9th International Symposium on Chinese Spoken Language Processing. ISBN 978-1-4799-4219-0. doi:10.1109/ISCSLP.2014.6936711.: https://dx.doi.org/10.1109%2FISCSLP.2014.6936711
[69] ↑ Krajewski, Jarek; Batliner, Anton; Kessel, Silke (2010). «"Comparing Multiple Classifiers for Speech-Based Detection of Self-Confidence - A Pilot Study». 2010 20th International Conference on Pattern Recognition. ISBN 978-1-4244-7542-1. doi:10.1109/ICPR.2010.905.: https://dx.doi.org/10.1109%2FICPR.2010.905
[70] ↑ Rani, P. Ithaya; Muneeswaran, K. (2016). «"Recognize the facial emotion in video sequences using eye and mouth temporal Gabor features». Multimedia Tools and Applications. doi:10.1007/s11042-016-3592-y.: https://dx.doi.org/10.1007%2Fs11042-016-3592-y
[71] ↑ Rani, P. Ithaya; Muneeswaran, K. (2016). «"Facial Emotion Recognition Based on Eye and Mouth Regions".». International Journal of Pattern Recognition and Artificial Intelligence. doi:10.1142/S021800141655020X.: https://dx.doi.org/10.1142%2FS021800141655020X
[72] ↑ RANI, P. ITHAYA; MUNEESWARAN, K. (28 de marzo de 2018). «Emotion recognition based on facial components». Sādhanā (en inglés) 43 (3): 48. ISSN 0973-7677. doi:10.1007/s12046-018-0801-6. Consultado el 8 de marzo de 2024.: https://doi.org/10.1007/s12046-018-0801-6
[73] ↑ Louzada, Francisco; Ara, Anderson (2012). «Bagging k-dependence probabilistic networks: An alternative powerful fraud detection tool». Expert Systems with Applications. doi:10.1016/j.eswa.2012.04.024.: https://dx.doi.org/10.1016%2Fj.eswa.2012.04.024
[74] ↑ Sundarkumar, G. Ganesh; Ravi, Vadlamani (2015). «"A novel hybrid undersampling method for mining unbalanced datasets in banking and insurance"». Engineering Applications of Artificial Intelligence. doi:10.1016/j.engappai.2014.09.019.: https://dx.doi.org/10.1016%2Fj.engappai.2014.09.019
[75] ↑ a b Kim, Yoonseong; Sohn, So Young (2012). «"Stock fraud detection using peer group analysis"». Expert Systems with Applications. doi:10.1016/j.eswa.2012.02.025.: https://dx.doi.org/10.1016%2Fj.eswa.2012.02.025
[76] ↑ Savio, A.; García-Sebastián, M.T.; Chyzyk, D.; Hernandez, C.; Graña, M.; Sistiaga, A.; López de Munain, A.; Villanúa, J. (2011). «"Neurocognitive disorder detection based on feature vectors extracted from VBM analysis of structural MRI".». Computers in Biology and Medicine. doi:10.1016/j.compbiomed.2011.05.010.: https://dx.doi.org/10.1016%2Fj.compbiomed.2011.05.010
[77] ↑ Ayerdi, B.; Savio, A.; Graña, M. (2013). «Meta-ensembles of Classifiers for Alzheimer's Disease Detection Using Independent ROI Features». Natural and Artificial Computation in Engineering and Medical Applications. Lecture Notes in Computer Science. ISBN 978-3-642-38621-3. doi:10.1007/978-3-642-38622-0_13.: https://dx.doi.org/10.1007%2F978-3-642-38622-0_13
[78] ↑ Gu, Quan; Ding, Yong-Sheng; Zhang, Tong-Liang (2015). «"An ensemble classifier based prediction of G-protein-coupled receptor classes in low homology"». Neurocomputing. doi:10.1016/j.neucom.2014.12.013.: https://dx.doi.org/10.1016%2Fj.neucom.2014.12.013
[79] ↑ Xue, Dan; Zhou, Xiaomin; Li, Chen; Yao, Yudong; Rahaman, Md Mamunur; Zhang, Jinghua; Chen, Hao; Zhang, Jinpeng et al. (2020). «An Application of Transfer Learning and Ensemble Learning Techniques for Cervical Histopathology Image Classification». IEEE Access 8: 104603-104618. ISSN 2169-3536. doi:10.1109/ACCESS.2020.2999816. Consultado el 8 de marzo de 2024. Se sugiere usar |número-autores= (ayuda).: https://ieeexplore.ieee.org/document/9107128/
[80] ↑ Manna, Ankur; Kundu, Rohit; Kaplun, Dmitrii; Sinitca, Aleksandr; Sarkar, Ram (2021). «"A fuzzy rank-based ensemble of CNN models for classification of cervical cytology». Scientific Reports. PMID 34267261. doi:10.1038/s41598-021-93783-8.: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8282795

Navegación

Urban Growth Algorithms

Introduction

Overview

set theory

Urban Growth Algorithms

Introduction

Overview

set theory

Set size

Common types of sets

Optimal Bayesian classifier

Bootstrap aggregation (bagging)

Boosting

Bayesian model averaging

Bayesian combination of models

Bucket of models

Stacking

Vote

Implementation in statistical packages

Applications of ensemble learning

Contenido

Remote sensing

Computer security

facial recognition

Emotion recognition

Fraud detection

Financial decision making

Medicine

References

Set size

Common types of sets

Optimal Bayesian classifier

Bootstrap aggregation (bagging)

Boosting

Bayesian model averaging

Bayesian combination of models

Bucket of models

Stacking

Vote

Implementation in statistical packages

Applications of ensemble learning

Contenido

Remote sensing

Computer security

facial recognition

Emotion recognition

Fraud detection

Financial decision making

Medicine

References