Video Google: A Text Retrieval Approach to Object Matching in Videos

We will demonstrate an approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject unstable regions and reduce the effects of noise in the descriptors. The analogy with text retrieval is in the implementation where matches on descriptors are pre-computed (using vector quantization), and inverted file systems and document rankings are used. The result is that retrieval is immediate, returning a ranked list of key frames/shots in the manner of Google (link)

Post to Twitter

This entry was posted in art, digital humanities, humanities computing, internet, video, web2.0 and tagged , . Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

  • ...this blog is obsessively directed at profiling digital humanities developments in a cultural, social, and technical sense and in terms of books and applications...it is an aggregation or 'meta' style blog with the occasional commentary

    Hi, my name is Dr Craig Bellamy and I am a digital humanities analyst for the Victorian eResearch Strategic Initiative, a consortium based at the University of Melbourne, however, the views expressed in this blog are the responsibility of the author alone.

    Subscribe

    Follow me on Twitter

  • Pages

  • Categories

  • Archives