Electronic Theses and Dissertation Database
Library Home  |  ` Library Catalog  |  ETD Home  |  Browse ETDs  |  Search ETDs  |  ETD Resources

Title page for ETD etd-02072006-103620


Type of Document Master's Thesis
Author Clayton, Arnshea
Author's Email Address aclayton1@student.gsu.edu
URN etd-02072006-103620
Title The Relative Importance of Input Encoding and Learning Methodology on Protein Secondary Structure Prediction
Degree Master of Science
Department Computer Science
Advisory Committee
Advisor Name Title
Yanqing Zhang Committee Chair
Rajeshekhar Sunderraman Committee Member
Yi Pan Committee Member
Keywords
  • Neural Networks
  • Learning Vector Quantization
  • Protein Secondary Structure Prediction
  • Resilient Propagation
Date of Defense 2006-01-09
Availability unrestricted
Abstract
In this thesis the relative importance of input encoding and learning algorithm on protein secondary structure prediction is explored. A novel input encoding, based on multidimensional scaling applied to a recently published amino acid substitution matrix, is developed and shown to be superior to an arbitrary input encoding. Both decimal valued and binary input encodings are compared. Two neural network learning algorithms, Resilient Propagation and Learning Vector Quantization, which have not previously been applied to the problem of protein secondary structure prediction, are examined. Input encoding is shown to have a greater impact on prediction accuracy than learning methodology with a binary input encoding providing the highest training and test set prediction accuracy.
Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  clayton_arnshea_200605_ms.pdf 402.38 Kb 00:01:51 00:00:57 00:00:50 00:00:25 00:00:02

Browse All Available ETDs by ( Author | Department )

Click here to send a comment to ETD Support