VectorIndexer

Vector Indexer indexes categorical features inside of a Vector. It decides which features are categorical and converts them to category indices. The decision is based on the number of distinct values of a feature.

Input

It takes in a DataFrame and transforms it to another DataFrame

Output

It indexes categorical features in datasets of Vectors and stores the result into a new column of the DataFrame.

Type

ml-transformer

Class

fire.nodes.ml.NodeVectorIndexer

Fields

Name Title Description
inputCol Input Column The Input column name
outputCol Output Column Output column name
maxCategories Maximum Categories Threshold for the number of values a categorical feature can take. If a feature is found to have > maxCategories values, then it is declared continuous. Must be >= 2