Abstract
Decision trees are comprehensible, but at the cost of a relatively lower prediction accuracy compared to other powerful black-box classifiers such as SVMs. Boosting has been a popular strategy to create an ensemble of decision trees to improve their classification performance, but at the expense of comprehensibility advantage. To this end, alternating decision tree (ADTree) has been proposed to allow boosting within a single decision tree to retain comprehension. However, existing ADTrees are univariate, which limits their applicability. This research proposes a novel algorithm - multivariate ADTree. It presents and discusses its different variations (Fisher's ADTree, Sparse ADTree, and Regularized Logistic ADTree) along with their empirical validation on a set of publicly available datasets. It is shown that multivariate ADTree has high prediction accuracy comparable to that of decision tree ensembles, while retaining good comprehension which is close to comprehension of individual univariate decision trees.
Original language | English |
---|---|
Pages (from-to) | 195-209 |
Number of pages | 15 |
Journal | Pattern Recognition |
Volume | 50 |
Early online date | 28 Aug 2015 |
DOIs | |
Publication status | Published - Feb 2016 |
Keywords
- Alternating decision tree
- Boosting
- LARS
- Lasso
- Multivariate decision tree
ASJC Scopus subject areas
- Software
- Artificial Intelligence
- Computer Vision and Pattern Recognition
- Signal Processing