R-Estimating-Number-Of-Trees-RF


Project maintained by JalalAl-Tamimi Hosted on GitHub Pages — Theme by mattgraham

To cite, use: DOI

This script “estimateDensityAnd_ntree.R” provides a calculation of the Density-Based Metric to estimate the density of a dataframe. A simulated dataframe is provided with a binomial outcome for classification purpose and 10 predictors. Some predictors are informative (X1, X2 and X8). X3 to X7 are non-informative but are correlated with either X1 or X2. This is a normal type of data in phonetics research. An estimation of the optimal number of trees needed to run a Random Forest (using the “party” package) with the highest accuracy is provided.

An .RData is also added that contains the dataframe and all results.

Two methods to check the optimal number of trees for Random Forests are provided:

These methods are used in the following publications: