Different Feature Selection Methods and their advantages and distadvantages

Method	Description	Advantages	Disadvantages
Filter	- Filters features based on statistical metrics such as correlation, chi-squared, mutual information, etc.	- Fast and computationally efficient.	- Ignores feature dependencies.
	- Independent of the machine learning algorithm.	- Can be applied as a preprocessing step.	- May not consider the model's performance.
	- Features are selected before training the model.	- Helps remove irrelevant or redundant features.
Wrapper	- Evaluates feature subsets by training the model with different combinations of features.	- Considers feature dependencies.	- Computationally expensive.
	- Utilizes a specific machine learning algorithm for evaluation.	- Tends to find the most predictive features.	- Prone to overfitting if not cross-validated.
	- Selects features based on their impact on model performance (e.g., accuracy, F1-score).	- Allows for fine-grained feature selection.	- May not scale well to high-dimensional data.
Embedded	- Features are selected during the model training process.	- Incorporates feature selection into model building.	- Limited to the capabilities of the chosen algorithm.
	- Algorithms like Lasso Regression, Random Forest, and Gradient Boosting perform embedded feature selection.	- Reduces the risk of overfitting.	- May not perform well if the chosen algorithm is not suitable.
	- Features are assigned importance scores or coefficients.	- Automatically adapts to feature dependencies.

An Architect's vision