Video Retrieval Using High-level features: Exploiting Query-matching and Confidence-based Weighting

Motivation

nProblems in news video retrieval

nPrimary source of semantics come from ASR, but ASR tends to be erroneous & non-grammatical

nText does not necessarily relate well to visual information

nLow-level features are unreliable and unpredictable

nWhat we have:

nAnnotation of relevant high-level features (HLFs) with varying accuracy

nQuestion: How to capitalize these HLF’s to support retrieval?