TY - GEN
T1 - Parallelization of large-scale drug-protein binding experiments
AU - Makris, Antonios
AU - Michail, Dimitrios
AU - Varlamis, Iraklis
AU - Dimitropoulos, Chronis
AU - Tserpes, Konstantinos
AU - Tsatsaronis, George
AU - Haupt, Joachim
AU - Sawyer, Mark
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/9/12
Y1 - 2017/9/12
N2 - Drug polypharmacology or 'drug promiscuity' refers to the ability of a drug to bind multiple proteins. Such studies have huge impact to the pharmaceutical industry, but in the same time require large investments on wet-lab experiments. The respective in-silico experiments have a significantly smaller cost and minimize the expenses for the subsequent lab experiments. However, the process of finding similar protein targets for an existing drug, passes through protein structural similarity and is a highly demanding in computational resources task. In this work, we propose several algorithms that port the protein similarity task to a parallel high-performance computing environment. The differences in size and complexity of the examined protein structures raise several issues in a naive parallelization process that significantly affect the overall time and required memory. We describe several optimizations for better memory and CPU balancing which achieve faster execution times. Experimental results, on a high-performance computing environment with 512 cores and 2048GB of memory, demonstrate the effectiveness of our approach which scales well to large amounts of protein pairs.
AB - Drug polypharmacology or 'drug promiscuity' refers to the ability of a drug to bind multiple proteins. Such studies have huge impact to the pharmaceutical industry, but in the same time require large investments on wet-lab experiments. The respective in-silico experiments have a significantly smaller cost and minimize the expenses for the subsequent lab experiments. However, the process of finding similar protein targets for an existing drug, passes through protein structural similarity and is a highly demanding in computational resources task. In this work, we propose several algorithms that port the protein similarity task to a parallel high-performance computing environment. The differences in size and complexity of the examined protein structures raise several issues in a naive parallelization process that significantly affect the overall time and required memory. We describe several optimizations for better memory and CPU balancing which achieve faster execution times. Experimental results, on a high-performance computing environment with 512 cores and 2048GB of memory, demonstrate the effectiveness of our approach which scales well to large amounts of protein pairs.
UR - http://www.scopus.com/inward/record.url?scp=85032372794&partnerID=8YFLogxK
U2 - 10.1109/HPCS.2017.39
DO - 10.1109/HPCS.2017.39
M3 - Contribución a la conferencia
AN - SCOPUS:85032372794
T3 - Proceedings - 2017 International Conference on High Performance Computing and Simulation, HPCS 2017
SP - 201
EP - 208
BT - Proceedings - 2017 International Conference on High Performance Computing and Simulation, HPCS 2017
A2 - Smari, Waleed W.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Conference on High Performance Computing and Simulation, HPCS 2017
Y2 - 17 July 2017 through 21 July 2017
ER -