Here is a nice summary of traditional machine learning methods, from Mathworks.
I also decided to add the following picture below, as it illustrates a method that was very popular 30 years ago but that seems to have been forgotten recently: mixture of Gaussian. In the example below, it is used to separate the data set into two clusters. Note that you can use a mixture of any distributions, not just Gaussian, for instance, (data-driven) estimated distributions such as those based on kernel density estimation.