VectorIndexer¶
Vector Indexer indexes categorical features inside of a Vector. It decides which features are categorical and converts them to category indices. The decision is based on the number of distinct values of a feature.
Input¶
It takes in a DataFrame and transforms it to another DataFrame
Output¶
It indexes categorical features in datasets of Vectors and stores the result into a new column of the DataFrame.
Type¶
ml-transformer
Class¶
fire.nodes.ml.NodeVectorIndexer
Fields¶
| Name | Title | Description |
|---|---|---|
| inputCol | Input Column | The Input column name |
| outputCol | Output Column | Output column name |
| maxCategories | Maximum Categories | Threshold for the number of values a categorical feature can take. If a feature is found to have > maxCategories values, then it is declared continuous. Must be >= 2 |