Research Collection School Of Computing and Information Systems

Demonstrating multi-modal human instruction comprehension with AR smart glass

Mudiyanselage Dulanga Kaveesha WEERAKOON, Singapore Management UniversityFollow
Vigneshwaran SUBBARAJU
Tuan TRAN
Archan MISRA, Singapore Management UniversityFollow

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

1-2023

Abstract

We present a multi-modal human instruction comprehension prototype for object acquisition tasks that involve verbal, visual and pointing gesture cues. Our prototype includes an AR smart-glass for issuing the instructions and a Jetson TX2 pervasive device for executing comprehension algorithms. With this setup, we enable on-device, computationally efficient object acquisition task comprehension with an average latency in the range of 150-330msec.

Keywords

Human-AI Collaboration, Multi-Modal Networks, Pervasive Systems, Referring Expression Comprehension, Visual Grounding

Discipline

Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

2023 15th International Conference on COMmunication Systems and NETworkS COMSNETS: Bangalore, January 3-8: Proceedings

First Page

231

Last Page

233

ISBN

9781665477062

Identifier

10.1109/COMSNETS56262.2023.10041269

Publisher

IEEE

City or Country

Piscataway, NJ

Citation

WEERAKOON, Mudiyanselage Dulanga Kaveesha; SUBBARAJU, Vigneshwaran; TRAN, Tuan; and MISRA, Archan. Demonstrating multi-modal human instruction comprehension with AR smart glass. (2023). 2023 15th International Conference on COMmunication Systems and NETworkS COMSNETS: Bangalore, January 3-8: Proceedings. 231-233.
Available at: https://ink.library.smu.edu.sg/sis_research/7797

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1109/COMSNETS56262.2023.10041269

Download

Included in

Software Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

Demonstrating multi-modal human instruction comprehension with AR smart glass

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Demonstrating multi-modal human instruction comprehension with AR smart glass

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links