Comparison of K-Means and DBSCAN Algorithms for Customer Segmentation in E-commerce

Adi Suryaputra Paramita; Taqwa Hariguna

doi:10.47738/jdmdc.v1i1.3

PDF

Published: May 26, 2024

DOI: https://doi.org/10.47738/jdmdc.v1i1.3

Keywords:

DBSCAN, K-Means, Customer Segmentation, E-Commerce, Clustering Performance Evaluation

Citation Analysis:

👤 Adi Suryaputra Paramita

🏢 Information Systems Department, School of Information Technology, Universitas Ciputra Surabaya, Indonesia

https://orcid.org/0000-0002-9709-2655

👤 Taqwa Hariguna

🏢 Magister of Computer Science, Universitas Amikom Purwokerto, Jawa Tengah, Indonesia

https://orcid.org/0000-0003-1801-6791

Customer segmentation is crucial for e-commerce businesses to effectively target and engage specific customer groups. This study compares the effectiveness of two popular clustering algorithms, K-Means and DBSCAN, in segmenting e-commerce customers. The primary objective is to evaluate and contrast these algorithms to determine which provides more meaningful and actionable customer segments. The methodology involves analyzing a comprehensive e-commerce customer dataset, which includes various features such as customer ID, gender, age, city, membership type, total spend, items purchased, average rating, discount applied, days since last purchase, and satisfaction level. Initial data preprocessing steps include handling missing values, encoding categorical variables, and normalizing numerical features. Both K-Means and DBSCAN algorithms are implemented, and their performance is evaluated using metrics such as silhouette score, Davies-Bouldin index, and Calinski-Harabasz score. The results indicate that K-Means achieved a silhouette score of 0.546, a Davies-Bouldin index of 0.655, and a Calinski-Harabasz score of 552.9. In contrast, DBSCAN achieved a higher silhouette score of 0.680, a Davies-Bouldin index of 1.344, and a Calinski-Harabasz score of 1123.9. These findings suggest that while DBSCAN performs better in terms of silhouette score, indicating more distinctly separated clusters, its higher Davies-Bouldin index reflects fewer compact clusters. The discussion highlights that K-Means is suitable for applications requiring clear and well-defined segments of customers, as it produces balanced cluster sizes. DBSCAN, with its strength in identifying clusters of varying densities and handling noise, is more effective in detecting niche markets and unique customer behaviors. This study's findings have significant practical implications for e-commerce businesses looking to enhance their customer segmentation strategies. In conclusion, both K-Means and DBSCAN demonstrate their respective strengths and weaknesses in clustering the e-commerce customer dataset. The choice of algorithm should be based on the specific requirements of the segmentation task. Future research could explore hybrid methods combining the strengths of both algorithms and incorporate additional data sources for a more comprehensive analysis.

[1]

A. S. Paramita and T. Hariguna, “Comparison of K-Means and DBSCAN Algorithms for Customer Segmentation in E-commerce”, J. Digit. Mark. Digit. Curr., vol. 1, no. 1, pp. 43–62, May 2024.

Distributed Under Creative Commons CC-BY 4.0

Issue

Vol. 1 No. 1 (2024): Regular Issue June 2024

Section

Articles

Journal Metrics
Acceptance Rate	51%
Review Speed	45 days
Issue Per Year	4
Number of Volume	2
Number of Issues	6
Number of Articles	30
Number of Reviewers	62
Number of Contributor	66
Contributing Countries	12
No. of WoS Citations	18
No. of Scopus Citations	87
No. of Google Citations	132
Abstract Views	18,317 views
PDF Download	7,582

Tools
Reference Manager
Plagiarism Checker
Grammar Assistant

3048-0981 (Online)
Organizer / Collaboration	:	Faculty of Economics Universitas Negeri Jakarta, Indonesia
Published by	:	Bright Publisher
Website	:	jdmdc.com
Mailing Address	:	Graha Permata Estate, Jl. HM Bahrun Blok H9, Sokayasa, Berkoh, Kec. Purwokerto Tim., Kabupaten Banyumas, Jawa Tengah 53146
Email	:	dwisugianto@outlook.com (principal contact)
		editor@jdmdc.com (managing editor)

Article Sidebar

Main Article Content

Article Details