Sequoya: multiobjective multiple sequence alignment in Python
Benitez-Hidalgo, Antonio; Nebro, Antonio J.; Aldana-Montes, Jose F.
Publicación: BIOINFORMATICS
2020
VL / 36 - BP / 3892 - EP / 3893
abstract
Motivation: Multiple sequence alignment (MSA) consists of finding the optimal alignment of three or more biological sequences to identify highly conserved regions that may be the result of similarities and relationships between the sequences. MSA is an optimization problem with NP-hard complexity (non-deterministic polynomial-time hardness), because the time needed to find optimal alignments raises exponentially along with the number of sequences and their length. Furthermore, the problem becomes multiobjective when more than one score is considered to assess the quality of an alignment, such as maximizing the percentage of totally conserved columns and minimizing the number of gaps. Our motivation is to provide a Python tool for solving MSA problems using evolutionary algorithms, a nonexact stochastic optimization approach that has proven to be effective to solve multiobjective problems. Results: The software tool we have developed, called Sequoya, is written in the Python programming language, which offers a broad set of libraries for data analysis, visualization and parallelism. Thus, Sequoya offers a graphical tool to visualize the progress of the optimization in real time, the ability to guide the search toward a preferred region in run-time, parallel support to distribute the computation among nodes in a distributed computing system, and a graphical component to assist in the analysis of the solutions found at the end of the optimization.
MENTIONS DATA
Mathematics
-
0 Twitter
-
0 Wikipedia
-
0 News
-
9 Policy
Publicaciones similares en Mathematics