Demonstrating the Capabilities of the lionfish Software for Interactive Visualization of Market Segmentation Partitions

Authors

DOI:

https://doi.org/10.17713/ajs.v54i3.2058

Abstract

Market segmentation partitions multivariate data using some clustering algorithm, resulting in some number of homogeneous
clusters of consumers for marketing purposes. Often this type of data has no clear cluster structure, that is, no separations or gaps
between clusters of points exist, which is why this is considered partitioning rather than clustering. Understanding the differences
between the clusters is typically done by examining single features. However, this can be inconclusive as multiple clusters might share similar characteristics on individual features and the market segmentation partition actually defines the clusters based on different linear constraints on the features. To understand what uniquely characterizes a cluster of customers, examining linear combinations of features may be helpful. This article introduces the R package lionfish that provides interactive and dynamic tools to facilitate the exploration and refining of market segmentations. The package integrates tour algorithms that use linear combinations of features to view high-dimensional data, from the tourr package, with Python-powered interactivity, allowing manual control, interactive selection, and multiple linked windows, to support revising the cluster memberships based on visual feedback. The focus is on the widely used k-means clustering algorithm, but the tools also support other algorithms. The utility of the software is demonstrated through three example analyses from the domain of market segmentation. The flexible, user-driven approach provided by package lionfish offers deeper insights into complex market behaviors, enabling more effective segmentation and enhancing strategic decision-making.

Author Biographies

Matthias Medl, BOKU University

Matthias Medl is a PhD student at BOKU University, who specialized in the application of machine learning in the field of biotechnology. 

Dianne Cook, Monash University, Econometrics and Business Statistics

Dianne Cook is a Professor of Statistics in Econometrics and Business Statistics at Monash University in Melbourne, Australia. She has a PhD in Statistics from Rutgers University. Her research focuses on statistical graphics, which involves interactive visualisation of high-dimensional data, and statistical inference for data visualisation. In her role at Monash University she regularly teaches courses on machine learning and data analysis, and she has conducted workshops on data visualisation.

Di is a Fellow of the American Statistical Association, past editor of the Journal of Computational and Graphical Statistics, and The R JournalMember of the R Foundation, and elected member of the International Statistical Institute, and author of numerous R packages. She is active in R Ladies Melbourne, the Statistical Computing and Visualisation Section of the Statistical Society of Australia, and the Graphics and Computing Sections of the American Statistical Association.

Ursula Laa, BOKU University

Ursula Laa is an assistant professor and researcher in the Statistics Institute of the University of Natural Resources and Life Sciences (BOKU) in Vienna. She focuses on new methods and applications in data science with emphasis on interdisciplinary work, for example connecting my background in physics and my current interest in statistical data visualisation.

Downloads

Published

2025-04-23

How to Cite

Medl, M., Cook, D., & Laa, U. (2025). Demonstrating the Capabilities of the lionfish Software for Interactive Visualization of Market Segmentation Partitions. Austrian Journal of Statistics, 54(3), 71–99. https://doi.org/10.17713/ajs.v54i3.2058

Issue

Section

Special Issue. In memorial: Fritz Leisch