Feature Generation¶
Feature generation is the process of creating new features from one or multiple existing features, potentially for using in statistical analysis. This process adds new information to be accessible during the model construction and therefore hopefully result in more accurate model.
| Title | Description |
|---|---|
| DateToAge | Convert Date to Age |
| CaseWhen | Based on the value, convert it to another value |
| Scala | Write Scala code in Spark for generating new Features |
| SQL | Write SQL code for generating new features |
| StopWOrdRemover | Removes Stop Words |
| Tokenizer | Tokenizes a string into Tokens |
| OneHotEncoder | Applies one hot encoding |
| TF/IDF | Finds the TF and IDF |
| IndexString | Converts a column containg String to numeric values |