OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog
Adnen Abdessaied, Manuel Hochmeister, Andreas Bulling
Proc. 31st Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pp. 1–11, 2024.
Abstract
Links
BibTeX
@inproceedings{abdessaied24_coling,
author = {Abdessaied, Adnen and von Hochmeister, Manuel and Bulling, Andreas},
title = {OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog},
booktitle = {Proc. 31st Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING)},
year = {2024},
pages = {1--11}
}