KMeans¶

K-means clustering with support for k-means|| initialization proposed by Bahmani et al

Input¶

It takes in a DataFrame as input and performs K-Means clustering

The input DataFrame is passed along to the next Processors

ml-estimator

fire.nodes.ml.NodeKMeans

Name	Title	Description
featuresCol	Features Column	Features column of type vectorUDT for model fitting.
k	K	The number of clusters to create.
maxIter	Max Iterations	The maximum number of iterations.
predictionCol	Prediction Column	The prediction column created during model scoring.
seed	Seed	Random Seed.
tol	Tolerence	The convergence tolerance for iterative algorithms.
initMode	initMode	The initialization algorithm mode.
initSteps	initSteps	The number of steps for the k-means\|\| initialization mode. It will be ignored when other initialization modes are chosen.

K-means clustering with support for k-means|| initialization proposed by Bahmani et al