Computer Science – Artificial Intelligence
Scientific paper
2011-06-30
Journal Of Artificial Intelligence Research, Volume 21, pages 429-470, 2004
Computer Science
Artificial Intelligence
Scientific paper
10.1613/jair.1327
We present a visually-grounded language understanding model based on a study of how people verbally describe objects in scenes. The emphasis of the model is on the combination of individual word meanings to produce meanings for complex referring expressions. The model has been implemented, and it is able to understand a broad range of spatial referring expressions. We describe our implementation of word level visually-grounded semantics and their embedding in a compositional parsing framework. The implemented system selects the correct referents in response to natural language expressions for a large percentage of test cases. In an analysis of the system's successes and failures we reveal how visual context influences the semantics of utterances and propose future extensions to the model that take such context into account.
Gorniak P.
Roy Damien
No associations
LandOfFree
Grounded Semantic Composition for Visual Scenes does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Grounded Semantic Composition for Visual Scenes, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Grounded Semantic Composition for Visual Scenes will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-479966