Research Collection School Of Computing and Information Systems

MolCA: Molecular graph-language modeling with cross-modal projector and uni-modal adapter

Zhiyuan LIU
Sihang LI
Yanchen LUO
Hao FEI
Yixin CAO, Singapore Management UniversityFollow
Kenji KAWAGUCHI
Xiang WANG
Tat-Seng CHUA

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

12-2023

Abstract

Language Models (LMs) have demonstrated impressive molecule understanding ability on various 1D text-related tasks. However, they inherently lack 2D graph perception — a critical ability of human professionals in comprehending molecules’ topological structures. To bridge this gap, we propose MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter. MolCA enables an LM (i.e., Galactica) to understand both text- and graph-based molecular contents via the cross-modal projector. Specifically, the cross-modal projector is implemented as a QFormer to connect a graph encoder’s representation space and an LM’s text space. Further, MolCA employs a uni-modal adapter (i.e., LoRA) for the LM’s efficient adaptation to downstream tasks. Unlike previous studies that couple an LM with a graph encoder via cross-modal contrastive learning, MolCA retains the LM’s ability of open-ended text generation and augments it with 2D graph information. To showcase its effectiveness, we extensively benchmark MolCA on tasks of molecule captioning, IUPAC name prediction, and molecule-text retrieval, on which MolCA significantly outperforms the baselines. Our codes and checkpoints can be found at https: //github.com/acharkq/MolCA.

Discipline

Databases and Information Systems | Programming Languages and Compilers

Research Areas

Data Science and Engineering

Publication

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, December 6-10

First Page

15623

Last Page

15638

Identifier

10.18653/v1/2023.emnlp-main.966

Publisher

Association for Computational Linguistics

City or Country

Singapore

Citation

LIU, Zhiyuan; LI, Sihang; LUO, Yanchen; FEI, Hao; CAO, Yixin; KAWAGUCHI, Kenji; WANG, Xiang; and CHUA, Tat-Seng. MolCA: Molecular graph-language modeling with cross-modal projector and uni-modal adapter. (2023). Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, December 6-10. 15623-15638.
Available at: https://ink.library.smu.edu.sg/sis_research/8394

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.18653/v1/2023.emnlp-main.966

Download

Included in

Databases and Information Systems Commons, Programming Languages and Compilers Commons

COinS

Research Collection School Of Computing and Information Systems

MolCA: Molecular graph-language modeling with cross-modal projector and uni-modal adapter

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

MolCA: Molecular graph-language modeling with cross-modal projector and uni-modal adapter

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links