Bandwidth selection for mean shift clustering
The mean shift algorithm was introduced in Fukunaga and Hostetler (1975). It is an iterative procedure which, at every step, shifts the point obtained in the previous iteration in the direction of the estimated normalized density gradient, producing a convergent sequence that transports the initial value to the closest local maximum of the density estimate along the steepest ascent path. This algorithm induces a partition of the data in a natural way, by assigning the same cluster to all the data points that lead to the same local maximum when the convergence of the iterative procedure is reached. Notice that this methodology does not require the number of clusters to be specified in advance, and that it allows clusters of arbitrary shape to be discovered. In this communication we study the properties of this algorithm when kernel methods are used to estimate the normalized density gradient, and propose several data-driven bandwidth selectors to make this methodology fully automatic.
Palabras clave: bandwidth selection mean shift clustering kernel methods density gradient estimation
Otros trabajos en la misma sesión
Últimas noticias
-
22/04/12
Certificados -
11/03/12
Programa del congreso -
11/03/12
Cuota reducida -
15/01/12
Cuota superreducida