DATA MINING TECHNIQUE WITH CLUSTER ANAYSIS USE K-MEANS ALGORITHM FOR LQ45 INDEX ON INDONESIA STOCK EXCHANGE

 

Abstract

This study aims to apply data mining techniques with cluster analysis on stock data registered in LQ45 in Indonesia Stock Exchange. The cluster analysis used in this method is k-means algorithm, the data in this research is taken from Indonesia Stock Exchange. The cluster analysis in this study analyzed the characteristics of data volumes and stock values, while the results in this study were presented in the form of cluster members visually. Therefore, this cluster analysis in this research can be used for quick and efficient identifier for each member of LQ45 index cluster based on share value for each cluster and its volume. The identification results can be used by beginner-level investors that begun to be interested in stock investments to help make informed decisions about stock trading on desired cluster groups.

 

Proposed System

 

Data Exploration is a preliminary examination of the data to determine its main characteristics and determine the best approach for extracting meaningful information. The main purpose is to encourage in deciding the most appropriate preprocessing and data analysis techniques. Mistakes, there are several processes to be taken, such as cleaning, integration, transformation, reduction of news reports. This shows the missing value filling, combines the report by relevance and consolidates the data by replacing the original information using the news aggregator. Once the stored data is processed in pre-processing data stored in the data repository. The data repository contains data that has been cleared. In this paper using four parts of cluster analysis applied in cluster analysis. Then implemented on two attributes, namely volume and transaction value on shares in the Liquid 45 or blue-chip group in Indonesia Stock Exchange. The data used was taken from Indonesia Stock Exchange.

 

CONCLUSION

 

Using cluster analysis in this study result with the ability to provide information quickly and efficiently for potential novice investors on the distribution map of Liquid 45 shares or bluechip stocks in Indonesia Stock Exchange. The cluster analysis of 45 blue chip stocks in the Indonesia Stock Exchange provides useful and quick information visually to see the map of 45 blue chip stocks divided into four parts according to the needs in stock price attributes and share transaction value so as to provide information quickly and accurate to quickly become the target of stock investors’ decisions.

 

REFERENCES

[1] Y. Luo, J. Hu, X. Wei, D. Fang, and H. Shao, “Stock trends prediction based on hypergraph modeling clustering algorithm,” in 2014 IEEE International Conference on Progress in Informatics and Computing, 2014, pp. 27–31.

[2] H. Leung and T. Ton, “The impact of internet stock message boards on cross-sectional returns of smallcapitalization stocks,” J. Bank. Financ., vol. 55, no. December 1997, pp. 37–55, 2015.

[3] X. Zhong and D. Enke, “A comprehensive cluster and classification mining procedure for daily stock market return forecasting,” Neurocomputing, vol. 267, pp. 152–168, Dec. 2017.

[4] E. N. Desokey, A. Badr, and A. F. Hegazy, “Enhancing stock prediction clustering using K-means with genetic algorithm,” in 2017 13th International Computer Engineering Conference (ICENCO), 2017, pp. 256–261.

[5] R. Asif, A. Merceron, S. A. Ali, and N. G. Haider, “Analyzing undergraduate students’ performance using educational data mining,” Comput. Educ., vol. 113, pp. 177–194, Oct. 2017.

[6] M. S. Packianather, A. Davies, S. Harraden, S. Soman, and J. White, “Data Mining Techniques Applied to a Manufacturing SME,” Procedia CIRP, vol. 62, pp. 123–128, 2017.

[7] R. Mythily, A. Banu, and S. Raghunathan, “Clustering Models for Data Stream Mining,” Procedia Comput. Sci., vol. 46, no. Icict 2014, pp. 619–626, 2015.

[8] R. Wang et al., “Review on mining data from multiple data sources,” Pattern Recognit. Lett., vol. 0, pp. 1–9, Jan. 2018.

[9] S. Aghabozorgi and Y. W. Teh, “Stock market comovement assessment using a three-phase clustering method,” Expert Syst. Appl., vol. 41, no. 4 PART 1, pp. 1301–1314, 2014.

[10] V. Vijay, V. P. Raghunath, A. Singh, and S. N. Omkar, “Variance Based Moving K-Means Algorithm,” in 2017 IEEE 7th International Advance Computing Conference (IACC), 2017, no. i, pp. 841– 847.

[11] Han, J., and Kamber, M. (2012). “Data Mining: Concepts and Techniques”. 4th ed. San Francisco, Morgan Kaufmann Publishers. 2018 International Conference on Information and Communications Technology (ICOIACT) 888