Database replication for optimal performance in parallel BLAST
Document Type
Conference Proceeding
Publication Date
3-17-2017
Abstract
Parallelization of BLAST at the software level usually segments either the query or the database but not both. In this paper, we investigate a hybrid segmentation approach that combines database segmentation with query segmentation. By organizing the nodes into groups, splitting the queries among groups, and replicating the entire database at each group, we take advantage of both database segmentation and query segmentation. With an appropriate number of nodes in each group, the portion of the database at each node is small enough to reside in core memory, thus avoiding being paged to disks, and the intercommunication and synchronization between the workers and the leader of a group is contained, thus resulting a better performance.
Recommended Citation
Zhao, Guanghua, "Database replication for optimal performance in parallel BLAST" (2017). College of Health, Science, and Technology. 698.
https://digitalcommons.uncfsu.edu/college_health_science_technology/698