TY - JOUR
T1 - A roadmap for the functional annotation of protein families
T2 - a community perspective
AU - Valery de Crecy-Lagard
AU - de Crécy-Lagard, Valérie
AU - Amorin de Hegedus, Rocio
AU - Arighi, Cecilia
AU - Babor, Jill
AU - Bateman, Alex
AU - Blaby, Ian
AU - Blaby-Haas, Crysten
AU - Bridge, Alan J
AU - Burley, Stephen K
AU - Cleveland, Stacey
AU - Colwell, Lucy J
AU - Conesa, Ana
AU - Dallago, Christian
AU - Danchin, Antoine
AU - de Waard, Anita
AU - Deutschbauer, Adam
AU - Dias, Raquel
AU - Ding, Yousong
AU - Fang, Gang
AU - Friedberg, Iddo
AU - Gerlt, John
AU - Goldford, Joshua
AU - Gorelik, Mark
AU - Gyori, Benjamin M
AU - Henry, Christopher
AU - Hutinet, Geoffrey
AU - Jaroch, Marshall
AU - Karp, Peter D
AU - Kondratova, Liudmyla
AU - Lu, Zhiyong
AU - Marchler-Bauer, Aron
AU - Martin, Maria-Jesus
AU - McWhite, Claire
AU - Moghe, Gaurav D
AU - Monaghan, Paul
AU - Morgat, Anne
AU - Mungall, Christopher J
AU - Natale, Darren A
AU - Nelson, William C
AU - O'Donoghue, Seán
AU - Orengo, Christine
AU - O'Toole, Katherine H
AU - Radivojac, Predrag
AU - Reed, Colbie
AU - Roberts, Richard J
AU - Rodionov, Dmitri
AU - Rodionova, Irina A
AU - Rudolf, Jeffrey D
AU - Saleh, Lana
AU - Sheynkman, Gloria
N1 - © The Author(s) 2022. Published by Oxford University Press.
PY - 2022/8/12
Y1 - 2022/8/12
N2 - Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
AB - Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
KW - Base Sequence
KW - Computational Biology
KW - Genome
KW - Genomics
KW - Molecular Sequence Annotation
KW - Proteins
U2 - 10.1093/database/baac062
DO - 10.1093/database/baac062
M3 - Article
C2 - 35961013
SN - 1758-0463
VL - 2022
JO - Database : the journal of biological databases and curation
JF - Database : the journal of biological databases and curation
ER -