OPTIMIZED THE MARITIME BIG DATA K-MEANS CLUSTERING BASED ON THE MAPREDUCE MODEL

  • PHAM TUAN ANH
  • DANG XUAN KIEN
  • PHAM TAM THANH
Keywords: MapReduce, K-means, AIS data, data mining.

Abstract

With the development of information technology, the maritime big data is an increasing trend of applications being expected to deal with big data that usually do not fit in the main memory of an analyzing big data is a challenging problem today. For such data intensive application, the maritime big data, the “MapReduce” framework has recently attracted considerable attention and started to be investigated for analysis which can handle petabyte of AIS data for millions of vessels. MapReduce is a programming model that allows easy development of scalable parallel applications to process big data on large clusters of commodity machines. This study, a standard clustering algorithm called K-means is based on the MapReduce model to be processed the marine traffic data in southern region, Viet Nam. According to the main results obtained, we concerned with making inference or prediction the clustering data which were collected and were shown the dashboard of maritime vessels traffic, including the scale, the trend of change and the spatial distribution situation.

Tác giả

PHAM TUAN ANH

1Trường Đại học Giao thông vận tải Thành phố Hồ Chí Minh

DANG XUAN KIEN

1Trường Đại học Giao thông vận tải Thành phố Hồ Chí Minh

PHAM TAM THANH

3Trường Đại học Hàng hải Việt Nam

điểm /   đánh giá
Published
2022-03-17
Section
Khoa học - Kỹ thuật