7 random forest

Machine Learning

👉 ♡ supervised learning
♡ unsupervised learning
♡ reinforcement learning

recap:
🔖 types of supervised learning

✔ classification 📑

✔ regression 📈

✔ mixed ⚗
- tree based
- random forest 🎈
- neural networks
- support vector machines

🌳 overfitting and the problem with trees

trees classify by drawing square boxes around the data, which does not work well where many separations are needed. it overfits the data

🌳 pruning

pruning means to trim (a tree, shrub, or bush) by cutting away dead or overgrown branches or stems (or unwanted parts), especially to encourage growth.

for example, if you had a single data (a leaf) that is making your square a large one, so you just remove it so that the tree still maintains relevence. improves accuracy of the tree.

🌳 random forests

just like a forest is made up of trees, similarly random forest is a machine learning method made up of tree methods. random as it randomises the input of the trees.

the basic form of it acts on tally. a tree gives out an output, this one collects all the outputs of the trees and try to make up an answer. if 10% of trees said A and 90% said B, it will say B (majority). much like you ask questions to 10 people as to what book they think will be most popular this year or what team will win. just here as the inputs are randomised, each trees are not the same. it might be further tweaked to give more or less meanings to the answers in the tree.

🌳 use of random forests

used to identify customer type, whether people will like a product or predict behaviour.

⚽ exercise
1) dig more into the uses of random forests
2) compare it’s implementation across languages ( with libs included ), how elegant you think python is?

next:
🌱 support vector machines#8 support vector machines