FuzzyCmeans

From HP-SEE Wiki

Revision as of 06:57, 4 July 2011 by Silviu (Talk | contribs)
Jump to: navigation, search

Contents

General Information

  • Application's name: Parallel Fuzzy C Mean for classification/Feature detection category
  • Virtual Research Community: EO-Science
  • Scientific contact: Dana Petcu, petcu@info.uvt.ro
  • Technical contact: Silviu Panica, silviu@info.uvt.ro
  • Developers: Silviu Panica, Daniela Zaharie, West University of Timisoara, Romania ({silviu,dzaharie}@info.uvt.ro)
  • Web site: http://research.info.uvt.ro/

Short Description

Fuzzy clustering algorithms allow the identification of spatially continuous regions of pixels characterized by similar feature values, that’s through considering the fact that a pixel in a satellite image may contain spectral information corresponding to different ground components. Since the satellite images are usually large, designing efficient implementation of fuzzy clustering algorithms attracted the interest of researchers. Currently, there exist parallel variants of the traditional Fuzzy C-Means (FCM) algorithm, but their extension to the case of algorithms involving spatial information has not been investigated yet.

This research work aims to extend the existing parallelization of FCM to include some spatial variants (e.g. FCM with spatial information and Gaussian Kernel based FCM). The proposed parallelization is based on three basic ideas: spatial slicing of images, exploiting the collective computations, as much as possible, and reducing the communication between processors. Several slicing strategies were analyzed with respect to their ability to ensure a balanced load of processors. There were also proposed parallel variants for the computation of cluster validity indices useful in the context of semi-automatic identification of the number of classes.

Problems Solved

Fuzzy c-means solves the problem of object clustering in case of remote sensing images. This algorithm tries to identify spatially continuous regions of pixels characterized by similar feature values which most likely corresponds to similar ground cover types, e.g. generate vegetation maps of an area of interest.

Scientific and Social Impact

...

Collaborations and Beneficiaries

This work was done in collaboration with:

  • IBM Center of Advanced Studies, Egypt
    • Ahmed Sayed, asayed72@yahoo.com
    • Hisham El-Shishiny, shishiny@eg.ibm.com
  • Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt.
    • Ashraf S. Hussein, ashrafh@acm.org

Primary beneficiaries will be our local research group from geography department but also the students that are involved on earth observation trainings or research groups.

Technical Features and HP-SEE Implementation

  • Primary programming language: C
  • Parallel programming paradigm: MPI/MPIX (BlueGene/P)
  • Main parallel code: existing code
  • Pre/post processing code: local development
  • Application tools and libraries: MPICH2, OpenMPI and MPIX
  • Number of cores required: 2048
  • Minimum RAM/core required: 1GB
  • Storage space during a single run: 850MB (average storage; it depends on the input datasets type and size)
  • Long-term data storage: 10TB

Usage Example

Will be added when the tool goes in production

Publications

  • D.Petcu, D. Zaharie, S.Panica, A.S. Hussein, A. Sayed, H. El-Shishiny, Fuzzy Clustering of Large Satellite Images using High Performance Computing, accepted at SPIE Remote Sensing Conference: High-Performance Computing in Remote Sensing, 19-22 September 2011, Prague.
Personal tools