Big Data Analytics in Healthcare: COVID-19 Indonesia Clustering

Main Article Content

Johanes Fernandes Andry
Glisina Dwinoor Rembulan
Edwin Leonard Salim
Endang Fatmawati
Hendy Tannady

Keywords

Big Data, Big Data Analytics, Data Mining, COVID-19, Clustering, k-means algorithm

Abstract

The rapid growth of the Internet and Technology produced a massive amount of data that resulted a phenomenon called Big Data. To process such a complex kind of massive amount of data, an advanced approach and tool is needed that is able to quickly produce results. This approach to analyzing massive amount of data is known as Big Data Analytics. Big data analytics is widely used in various sectors, not to mention the health sector. In the healthcare sector, recently there has been a study that is often carried out in dealing with crisis situations, namely research on implementing big data analytics to provide technological solutions to help deal with pandemics. In this article, we analyze and visualize the data collected from Indonesia. The data analyzed starts from the first case of COVID-19 in Indonesia to present. The proposed solution is to classify the regional case data into a group that can represent the situation of the area. As a result, it is determined based on the data that there are three groups consisting of areas with low risk, moderate risk, and high risk. In addition, this article proposes combining big data analytics technology with cloud technology to facilitate the dissemination of information to citizens to increase awareness about the spread of the COVID-19 virus.

Abstract 313 | PDF Downloads 268

References

1. M. Chen, S. Mao, and Y. Liu, “Big data: A survey,” Mob. Networks Appl., vol. 19, no. 2, pp. 171–209, 2014, doi: 10.1007/s11036-013-0489-0.
2. S. Dash, S. K. Shakyawar, M. Sharma, and S. Kaushik, “Big data in healthcare: management, analysis and future prospects,” J. Big Data, vol. 6, no. 1, 2019, doi: 10.1186/s40537-019-0217-0.
3. X. Wu, X. Zhu, G.-Q. Wu, and W. Ding, “Data Mining with Big Data Xindong,” Ieeexplore.Ieee.Org, pp. 1–26, 2014, [Online]. Available:
https://ieeexplore.ieee.org/abstract/document/6547630/.
4. O. Müller, I. Junglas, J. Vom Brocke, and S. Debortoli, “Utilizing big data analytics for information systems research: Challenges,
promises and guidelines,” Eur. J. Inf. Syst., vol. 25, no. 4, pp. 289–302, 2016, doi:10.1057/ejis.2016.2.
5. J. P. John and S. Vasudevan, “Big Data Analytics In Healthcare,” 2018 10th Int. Conf. Adv. Comput. ICoAC 2018, no. February, pp. 212–
215, 2016.
6. W. Raghupathi and V. Raghupathi, “Big Data Analytics in Healthcare: Promise and Potential,” Heal. Inf. Sci. Syst., 2014, doi: 10.1186/2047-
2501-2-3.
7. T. Singhal, “Review on COVID19 disease so far,” Indian J. Pediatr., vol. 87, no. April, pp. 281–286, 2020.
8. R. Madurai Elavarasan and R. Pugazhendhi,“Restructured society and environment: A review on potential technological strategies to control the COVID-19 pandemic,” Sci. Total Environ., vol.725, p. 138858, 2020, doi:10.1016/j.scitotenv.2020.138858.
9. T. J. Post, “Stay home, President says,” 2020.
10. S. Tuli, S. Tuli, R. Tuli, and S. S. Gill, “Predictingthe growth and trend of COVID-19 pandemic using machine learning and cloud computing,”
Internet of Things, vol. 11, p. 100222, 2020, doi:10.1016/j.iot.2020.100222.
11. K. Abdulmajeed, M. Adeleke, and L. Popoola,“Online Forecasting of Covid-19 Cases in Nigeria Using Limited Data,” Data Br., vol. 30, p.
105683, 2020, doi: 10.1016/j.dib.2020.105683.
12. A. Mahmudan, “Clustering of District or City in Central Java Based COVID-19 Case Using KMeans Clustering,” J. Mat. Stat. dan Komputasi,
vol. 17, no. 1, pp. 1–13, 2020, doi:10.20956/jmsk.v17i1.10727.
13. A. Mcafee and E. Brynjolfsson, “Spotlight on Big Data Big Data: The Management Revolution, 2012. Acedido em 15-03-2017,” Harv. Bus. Rev.,no. October, pp. 1–9, 2012.
14. V. Mayer-Schonberger and K. Cukier, Big Data: A Revolution That Will Transform How We Live, Work, and Think, vol. 53, no. 9. Boston:
Houghton Mifflin Harcourt, 2013.
15. C. C. Aggarwal and C. C. Aggarwal, Data Classification. 2015.
16. D. T. Larose and C. D. Larose, Data Mining and Predictive Analytics, 2nd ed. Wiley Publishing, 2015.
17. P.-N. Tan, M. Steinbach, and V. Kumar,Introduction to Data Mining. Pearson, 2013.
18. E. Turban, R. Sharda, and D. Delen, Business Intelligence and Analytics: Systems for Decision Support, Global Edition, 10th editi. Pearson
Education Limited, 2014.
19. V. Kotu and B. Deshpande, Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner. Elsevier Science, 2014.
20. I. H. Witten, E. Frank, and M. a Hall, Data Mining: Practical Machine Learning Tools and Techniques (Google eBook). 2011.
21. EMC Education Services, Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data. Wiley, 2016.
22. Hendratno, “COVID-19 Indonesia Dataset.” 2020, doi: 10.34740/kaggle/dsv/1608770.
23. J. F. Andry, H. Tannady, G. D. Rembulan & A. Rianto. The importance of big data for healthcare and its usage in clinical statistics of
cardiovascular disease. Journal of Population Therapeutics and Clinical Pharmacology, 29(04), 107-115, 2022.
24. D. Y. Heryadi, H. Tannady, G. D. Rembulan, B. Rofatin, & R. S. Sundari. Changes in behavior and welfare of organic rice farmers during the
COVID-19 pandemic. Caspian Journal of Environmental Sciences, 21(1), 191-197, 2023.
25. J. F. Andry, L. Liliana, H. Tannady, & A. S. Arief. (2022, December). Data Centre Risk Analysis Using ISO 31000: 2009 Framework.
In Journal of Physics: Conference Series (Vol. 2394, No. 1, p. 012032). IOP Publishing.
26. H. Tannady, J. M. Renwarin, A. N. D. Cora, & E. Purwanto. (2021, July). Production Planning and Inventory Control of Atonic Fertilizer Products Using Static Lot Sizing Method. In IOP Conference Series: Earth and Environmental Science (Vol. 819, No. 1, p. 012087). IOP
Publishing.