Why is dimension reduction important?

palak-virmani-30ac6553 · 29 August 2020 05:53

iftekar-patel-f1e6bf65 · 3 September 2020 01:55

Space required to store the data is reduced as the number of dimensions comes down
Less dimensions lead to less computation/training time
Some algorithms do not perform well when we have a large dimensions. So reducing these dimensions needs to happen for the algorithm to be useful
It takes care of multicollinearity by removing redundant features. For example, you have two variables – ‘time spent on treadmill in minutes’ and ‘calories burnt’. These variables are highly correlated as the more time you spend running on a treadmill, the more calories you will burn. Hence, there is no point in storing both as just one of them does what you require
It helps in visualizing data. As discussed earlier, it is very difficult to visualize data in higher dimensions so reducing our space to 2D or 3D may allow us to plot and observe patterns more clearly

chirag-garg · 7 August 2021 15:52

Projection into two dimensions is often used to facilitate the visualization of high dimensional data sets.
When the dimensions can be given a meaningful interpretation, projection along that dimension can be used to explain certain behaviors (e.g. Big Five in psychology).
In the supervised learning case, dimensionality reduction can be used to reduce the dimension of the features, potentially leading to better performance for the learning algorithm.