Saeid Amiri - Colloquium Speaker

Visiting Assistant Professor, The University of Iowa, Department of Statistics and Actuarial Science
Date: 
Thursday, October 15, 2015 - 3:30pm
Colloquium Title: 
Clustering categorical data via ensembling methods
Location: 
Reception at 3:00 p.m. in 241 SH / Talk at 3:30 in 61 SH

AmiriAbstract

Here, we propose an ensemble approach to clustering categorical data. The proposed ensemble method is based on hierarchical clustering under average linkage. We give a rationale for why our procedure does well in low dimensions. This is supported by extensive computational comparisons with other methods using simulated and real data. Our method for low dimensional categorical data extends to high dimensional categorical data by using an extra level of ensembling. This minimizes the effect of the Curse of Dimensionality that tends to equalize the distances between any two points as dimension increases. A further extension of our ensembling method permits the vectors of categorical outcomes to have different dimensions.

This presentation is part of joint with Bertrand Clarke and Jennifer Clarke from University of Nebraska-Lincoln.

flyer