Electronic Theses and Dissertation Database
Library Home  |  ` Library Catalog  |  ETD Home  |  Browse ETDs  |  Search ETDs  |  ETD Resources

Title page for ETD etd-04192006-111652


Type of Document Dissertation
Author Tang, Yuchun
Author's Email Address tyczjs@yahoo.com
URN etd-04192006-111652
Title Granular Support Vector Machines Based on Granular Computing, Soft Computing and Statistical Learning
Degree Ph.D.
Department Computer Science
Advisory Committee
Advisor Name Title
Yan-Qing Zhang Committee Chair
Raj Sunderraman Committee Member
Robert Harrison Committee Member
Yichuan Zhao Committee Member
Keywords
  • Statistical Learning
  • Computational Intelligence
  • Granular Computing
  • Bioinformatics
  • Granular Support Vector Machines
  • Machine Learning
  • Data Mining
Date of Defense 2005-12-02
Availability unrestricted
Abstract
With emergence of biomedical informatics, Web intelligence, and E-business, new challenges are coming for knowledge discovery and data mining modeling problems.

In this dissertation work, a framework named Granular Support Vector Machines (GSVM) is proposed to systematically and formally combine statistical learning theory, granular computing theory and soft computing theory to address challenging predictive data modeling problems effectively and/or efficiently, with specific focus on binary classification problems. In general, GSVM works in 3 steps. Step 1 is granulation to build a sequence of information granules from the original dataset or from the original feature space. Step 2 is modeling Support Vector Machines (SVM) in some of these information granules when necessary. Finally, step 3 is aggregation to consolidate information in these granules at suitable abstract level. A good granulation method to find suitable granules is crucial for modeling a good GSVM.

Under this framework, many different granulation algorithms including the GSVM-CMW (cumulative margin width) algorithm, the GSVM-AR (association rule mining) algorithm, a family of GSVM-RFE (recursive feature elimination) algorithms, the GSVM-DC (data cleaning) algorithm and the GSVM-RU (repetitive undersampling) algorithm are designed for binary classification problems with different characteristics.

The empirical studies in biomedical domain and many other application domains demonstrate that the framework is promising.

As a preliminary step, this dissertation work will be extended in the future to build a Granular Computing based Predictive Data Modeling framework (GrC-PDM) with which we can create hybrid adaptive intelligent data mining systems for high quality prediction.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  tang_yuchun_200605_phd.pdf 1.20 Mb 00:05:34 00:02:51 00:02:30 00:01:15 00:00:06

Browse All Available ETDs by ( Author | Department )

Click here to send a comment to ETD Support