Skip to main navigation
Skip to search
Skip to main content
Heriot-Watt Research Portal Home
Help & FAQ
Home
Profiles
Research units
Research output
Datasets
Impacts
Equipment
Prizes
Activities
Press/Media
Courses
Search by expertise, name or affiliation
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding
Alessandro Suglia
, Claudio Greco
, Katie Baker
, Jose L. Part
, Ioannis Papaioannou
,
Arash Eshghi
,
Ioannis Konstas
,
Oliver Lemon
School of Mathematical & Computer Sciences
Computer Science
Research output
:
Working paper
›
Preprint
37
Downloads (Pure)
Overview
Fingerprint
Fingerprint
Dive into the research topics of 'AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding'. Together they form a unique fingerprint.
Sort by
Weight
Alphabetically
Computer Science
Video Understanding
100%
Language Modeling
100%
Artificial Intelligence
100%
Foundation Model
100%
Robot
50%
Art Performance
25%
Open Source
25%
Perceptual Experience
25%
Personal Assistant
25%
Generative Pre-Trained Transformer 4
25%
INIS
vision
100%
foundations
100%
humans
40%
robots
40%
datasets
40%
comparative evaluations
20%
performance
20%
benchmarks
20%
boron 7
20%