Wang, Guangyi (2006) Using motif databases to help improve multiple sequence alignment. Masters thesis, Concordia University.
| PDF - Accepted Version 1551Kb |
Abstract
Current progress in genome research projects has generated huge amount of data. As a result, the analysis of these data is now a bottleneck in bioinformatics. Multiple sequence alignment is an important step in this kind of analysis. It compares unknown sequences with well studied ones, and thus infers functional and structural information of the unknown sequences. However, due to the NP-completeness nature of the multiple sequence alignment, exhaustive searching method is unrealistic. Current algorithms use heuristic approach to get a nearly global optimal result. As a consequence, any specific program may encounter certain cases that it is not good at. In this work, we use protein motif databases to improve the alignment. The basic idea is to detect possible occurrences of motifs on the sequences, and force those parts to be aligned together. Unlike existing programs, this method uses biological information instead of treating it as purely an optimization problem. It also reduces the searching space. Experiments show that using motif databases could generate good results
| Divisions: | Concordia University > Faculty of Engineering and Computer Science > Computer Science and Software Engineering |
|---|---|
| Item Type: | Thesis (Masters) |
| Authors: | Wang, Guangyi |
| Pagination: | vii, 50 leaves : ill. ; 29 cm. |
| Institution: | Concordia University |
| Degree Name: | M. Comp. Sc. |
| Program: | Computer Science and Software Engineering |
| Date: | 2006 |
| Thesis Supervisor(s): | Lam, Clement |
| ID Code: | 9056 |
| Deposited By: | Concordia University Libraries |
| Deposited On: | 18 Aug 2011 14:43 |
| Last Modified: | 18 Aug 2011 14:43 |
| Related URLs: |
Repository Staff Only: item control page

