Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction

Computer Science – Computer Vision and Pattern Recognition

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Scientific paper

We present an approach to labeling short video clips with English verbs as event descriptions. A key distinguishing aspect of this work is that it labels videos with verbs that describe the spatiotemporal interaction between event participants, humans and objects interacting with each other, abstracting away all object-class information and fine-grained image characteristics, and relying solely on the coarse-grained motion of the event participants. We apply our approach to a large set of 22 distinct verb classes and a corpus of 2,584 videos, yielding two surprising outcomes. First, a classification accuracy of greater than 70% on a 1-out-of-22 labeling task and greater than 85% on a variety of 1-out-of-10 subsets of this labeling task is independent of the choice of which of two different time-series classifiers we employ. Second, we achieve this level of accuracy using a highly impoverished intermediate representation consisting solely of the bounding boxes of one or two event participants as a function of time. This indicates that successful event recognition depends more on the choice of appropriate features that characterize the linguistic invariants of the event classes than on the particular classifier algorithms.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-7499

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.