AutoClustering

A feed-forward neural network based clustering algorithm

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Since a clustering process can be regarded as a map of data to cluster labels, it should be natural to employ a deep learning technique, especially a feed-forward neural network, to realize the clustering method. In this study, we discussed a novel clustering method realized only by a feed-forward neural network. Unlike self-organizing maps and growing neural gas networks, the proposed method is compatible with deep learning neural networks. The proposed method has three parts: A map of records to clusters (encoder), a map of clusters to their exemplars (decoder), and a loss function to measure positional closeness between the records and the exemplars. In order to accelerate clustering performance, we proposed an improved activation function at the encoder, which migrates a soft-max function to a max function continuously. Though most of the clustering methods require the number of clusters in advance, the proposed method naturally provides the number of clusters as the number of unique one-hot vectors obtained as a result. We also discussed the existence of local minima of the loss function and their relationship to clusters.

Original languageEnglish
Title of host publicationProceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018
EditorsJeffrey Yu, Zhenhui Li, Hanghang Tong, Feida Zhu
PublisherIEEE Computer Society
Pages659-666
Number of pages8
ISBN (Electronic)9781538692882
DOIs
Publication statusPublished - 2019 Feb 7
Event18th IEEE International Conference on Data Mining Workshops, ICDMW 2018 - Singapore, Singapore
Duration: 2018 Nov 172018 Nov 20

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume2018-November
ISSN (Print)2375-9232
ISSN (Electronic)2375-9259

Conference

Conference18th IEEE International Conference on Data Mining Workshops, ICDMW 2018
CountrySingapore
CitySingapore
Period18/11/1718/11/20

Fingerprint

Feedforward neural networks
Clustering algorithms
Self organizing maps
Labels
Chemical activation
Neural networks
Gases
Deep learning

Keywords

  • Autoencoder
  • Clustering
  • Feed forward neural network

ASJC Scopus subject areas

  • Computer Science Applications
  • Software

Cite this

Kimura, M. (2019). AutoClustering: A feed-forward neural network based clustering algorithm. In J. Yu, Z. Li, H. Tong, & F. Zhu (Eds.), Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018 (pp. 659-666). [8637379] (IEEE International Conference on Data Mining Workshops, ICDMW; Vol. 2018-November). IEEE Computer Society. https://doi.org/10.1109/ICDMW.2018.00102

AutoClustering : A feed-forward neural network based clustering algorithm. / Kimura, Masaomi.

Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018. ed. / Jeffrey Yu; Zhenhui Li; Hanghang Tong; Feida Zhu. IEEE Computer Society, 2019. p. 659-666 8637379 (IEEE International Conference on Data Mining Workshops, ICDMW; Vol. 2018-November).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kimura, M 2019, AutoClustering: A feed-forward neural network based clustering algorithm. in J Yu, Z Li, H Tong & F Zhu (eds), Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018., 8637379, IEEE International Conference on Data Mining Workshops, ICDMW, vol. 2018-November, IEEE Computer Society, pp. 659-666, 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018, Singapore, Singapore, 18/11/17. https://doi.org/10.1109/ICDMW.2018.00102
Kimura M. AutoClustering: A feed-forward neural network based clustering algorithm. In Yu J, Li Z, Tong H, Zhu F, editors, Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018. IEEE Computer Society. 2019. p. 659-666. 8637379. (IEEE International Conference on Data Mining Workshops, ICDMW). https://doi.org/10.1109/ICDMW.2018.00102
Kimura, Masaomi. / AutoClustering : A feed-forward neural network based clustering algorithm. Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018. editor / Jeffrey Yu ; Zhenhui Li ; Hanghang Tong ; Feida Zhu. IEEE Computer Society, 2019. pp. 659-666 (IEEE International Conference on Data Mining Workshops, ICDMW).
@inproceedings{f994127e2b624edababb413f0aea2147,
title = "AutoClustering: A feed-forward neural network based clustering algorithm",
abstract = "Since a clustering process can be regarded as a map of data to cluster labels, it should be natural to employ a deep learning technique, especially a feed-forward neural network, to realize the clustering method. In this study, we discussed a novel clustering method realized only by a feed-forward neural network. Unlike self-organizing maps and growing neural gas networks, the proposed method is compatible with deep learning neural networks. The proposed method has three parts: A map of records to clusters (encoder), a map of clusters to their exemplars (decoder), and a loss function to measure positional closeness between the records and the exemplars. In order to accelerate clustering performance, we proposed an improved activation function at the encoder, which migrates a soft-max function to a max function continuously. Though most of the clustering methods require the number of clusters in advance, the proposed method naturally provides the number of clusters as the number of unique one-hot vectors obtained as a result. We also discussed the existence of local minima of the loss function and their relationship to clusters.",
keywords = "Autoencoder, Clustering, Feed forward neural network",
author = "Masaomi Kimura",
year = "2019",
month = "2",
day = "7",
doi = "10.1109/ICDMW.2018.00102",
language = "English",
series = "IEEE International Conference on Data Mining Workshops, ICDMW",
publisher = "IEEE Computer Society",
pages = "659--666",
editor = "Jeffrey Yu and Zhenhui Li and Hanghang Tong and Feida Zhu",
booktitle = "Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018",

}

TY - GEN

T1 - AutoClustering

T2 - A feed-forward neural network based clustering algorithm

AU - Kimura, Masaomi

PY - 2019/2/7

Y1 - 2019/2/7

N2 - Since a clustering process can be regarded as a map of data to cluster labels, it should be natural to employ a deep learning technique, especially a feed-forward neural network, to realize the clustering method. In this study, we discussed a novel clustering method realized only by a feed-forward neural network. Unlike self-organizing maps and growing neural gas networks, the proposed method is compatible with deep learning neural networks. The proposed method has three parts: A map of records to clusters (encoder), a map of clusters to their exemplars (decoder), and a loss function to measure positional closeness between the records and the exemplars. In order to accelerate clustering performance, we proposed an improved activation function at the encoder, which migrates a soft-max function to a max function continuously. Though most of the clustering methods require the number of clusters in advance, the proposed method naturally provides the number of clusters as the number of unique one-hot vectors obtained as a result. We also discussed the existence of local minima of the loss function and their relationship to clusters.

AB - Since a clustering process can be regarded as a map of data to cluster labels, it should be natural to employ a deep learning technique, especially a feed-forward neural network, to realize the clustering method. In this study, we discussed a novel clustering method realized only by a feed-forward neural network. Unlike self-organizing maps and growing neural gas networks, the proposed method is compatible with deep learning neural networks. The proposed method has three parts: A map of records to clusters (encoder), a map of clusters to their exemplars (decoder), and a loss function to measure positional closeness between the records and the exemplars. In order to accelerate clustering performance, we proposed an improved activation function at the encoder, which migrates a soft-max function to a max function continuously. Though most of the clustering methods require the number of clusters in advance, the proposed method naturally provides the number of clusters as the number of unique one-hot vectors obtained as a result. We also discussed the existence of local minima of the loss function and their relationship to clusters.

KW - Autoencoder

KW - Clustering

KW - Feed forward neural network

UR - http://www.scopus.com/inward/record.url?scp=85062848726&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85062848726&partnerID=8YFLogxK

U2 - 10.1109/ICDMW.2018.00102

DO - 10.1109/ICDMW.2018.00102

M3 - Conference contribution

T3 - IEEE International Conference on Data Mining Workshops, ICDMW

SP - 659

EP - 666

BT - Proceedings - 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018

A2 - Yu, Jeffrey

A2 - Li, Zhenhui

A2 - Tong, Hanghang

A2 - Zhu, Feida

PB - IEEE Computer Society

ER -