Clustering on Mixed Data Types in Python

Ryan Kemmer
Analytics Vidhya
Published in
5 min readJan 25, 2021

--

Image by the Author

During my first ever data science internship, I was given a seemingly simple task to find clusters within a dataset. Given my basic knowledge of clustering algorithms like K-Means, DBSCAN, and GMM I thought that I could easily get this task done. However, as I took a closer look into the dataset, I realized the data contained a mixture of categorical and continuous data, and many common methods of clustering I knew…

--

--