Scientific Research and Essays

  • Abbreviation: Sci. Res. Essays
  • Language: English
  • ISSN: 1992-2248
  • DOI: 10.5897/SRE
  • Start Year: 2006
  • Published Articles: 2768

Full Length Research Paper

Dynamic bit vectors: An efficient approach for mining frequent itemsets

Bay Vo1*, Tzung-Pei Hong2,3  and  Bac Le4
1Department of Computer Science, Ho Chi Minh City University of Technology, Ho Chi Minh, Vietnam. 2Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan, R.O.C. 3Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, R.O.C. 4Department of Computer Science, University of Science, Ho Chi Minh, Vietnam.
Email: [email protected]

  •  Accepted: 20 September 2011
  •  Published: 30 October 2011

Abstract

There are two common kinds of data formats to be adopted in data mining. One ishorizontal, and the other is vertical. Approaches based on vertical data formats havethe advantages of requiring a fewer number of database scans and computingitemset supports fast. One of the vertical data representations, bit vector, has recently been widely used for mining frequent item sets and has caused significant results. The sizes of bit vectors for item sets are, however, always the same, equal to the number of transactions in a database. In this paper, we propose the scheme of dynamic bit vectors to reduce the memory and the computational time for mining frequent item sets from transaction databases. A fast method for computing the intersection of twodynamic bit vectors and an algorithm for mining frequent item sets based on the scheme are presented. The proposed algorithm is also compared with some other approaches and experimental results show that it is quite efficient in both the mining time and the memory usage.

 

Key words: Data mining, frequent item set, dynamic bit vector, vertical data format.