| Aspect | Feature Selection | Feature Engineering |
| Purpose | Choose relevant features | Create informative features |
| Objective | Improve model performance by eliminating irrelevant or redundant features | Improve model's ability to capture patterns and relationships |
| Techniques | Correlation analysis, mutual information, feature importance scores, recursive feature elimination | Scaling, one-hot encoding, interaction terms, mathematical operations, date-related feature extraction |
| Automation | Can often be automated using statistical methods and algorithms | May require domain knowledge and human expertise |
| Examples | SelectKBest, SelectFromModel, Recursive Feature Elimination (RFE) | One-hot encoding, polynomial feature creation, date-related feature extraction |
Got this during the execution of following command in R > dat Error: could not find function "read.xlsx" Tried following command > install.packages("xlsx", dependencies = TRUE) Installing package into ‘C:/Users/amajumde/Documents/R/win-library/3.2’ (as ‘lib’ is unspecified) also installing the dependencies ‘rJava’, ‘xlsxjars’ trying URL 'https://cran.rstudio.com/bin/windows/contrib/3.2/rJava_0.9-8.zip' Content type 'application/zip' length 766972 bytes (748 KB) downloaded 748 KB trying URL 'https://cran.rstudio.com/bin/windows/contrib/3.2/xlsxjars_0.6.1.zip' Content type 'application/zip' length 9485170 bytes (9.0 MB) downloaded 9.0 MB trying URL 'https://cran.rstudio.com/bin/windows/contrib/3.2/xlsx_0.5.7.zip' Content type 'application/zip' length 400968 bytes (391 KB) downloaded 391 KB package ‘rJava’ successfully unpacked and MD5 sums checked package ‘xlsxjars’ successfully unpacked ...
Comments