Research Collection School Of Computing and Information Systems

FAIR: Flow type-aware pre-training of compiler intermediate representations

Changan NIU
Chuanyi LI
Vincent NG
David LO, Singapore Management UniversityFollow
Bin LUO

Publication Type

Conference Proceeding Article

Publication Date

4-2024

Abstract

While the majority of existing pre-trained models from code learn source code features such as code tokens and abstract syntax trees, there are some other works that focus on learning from compiler intermediate representations (IRs). Existing IR-based models typically utilize IR features such as instructions, control and data flow graphs (CDFGs), call graphs, etc. However, these methods confuse variable nodes and instruction nodes in a CDFG and fail to distinguish different types of flows, and the neural networks they use fail to capture long-distance dependencies and have over-smoothing and over-squashing problems. To address these weaknesses, we propose FAIR, a Flow type-Aware pre-trained model for IR that involves employing (1) a novel input representation of IR programs; (2) Graph Transformer to address over-smoothing, over-squashing and long-dependencies problems; and (3) five pre-training tasks that we specifically propose to enable FAIR to learn the semantics of IR tokens, flow type information, and the overall representation of IR. Experimental results show that FAIR can achieve state-of-the-art results on four code-related downstream tasks.

Discipline

Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE), Lisbon, Portugal, April 14-20

First Page

378

Last Page

389

ISBN

9798400702174

Publisher

IEEE

City or Country

Los Alamitos, CA

Citation

NIU, Changan; LI, Chuanyi; NG, Vincent; LO, David; and LUO, Bin. FAIR: Flow type-aware pre-training of compiler intermediate representations. (2024). Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE), Lisbon, Portugal, April 14-20. 378-389.
Available at: https://ink.library.smu.edu.sg/sis_research/9265

This document is currently not available here.

COinS

Research Collection School Of Computing and Information Systems

FAIR: Flow type-aware pre-training of compiler intermediate representations

Publication Type

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Publisher

City or Country

Citation

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

FAIR: Flow type-aware pre-training of compiler intermediate representations

Author

Publication Type

Publication Date

Abstract

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Publisher

City or Country

Citation

Share

Search

Links

Browse

Links