Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery (2023)

First Author: Bai L

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.2305.11692

Publication URI: https://arxiv.org/abs/2305.11692

Type: Preprint