Hypervideo
Encyclopedia
Hypervideo, or hyperlink
Hyperlink
In computing, a hyperlink is a reference to data that the reader can directly follow, or that is followed automatically. A hyperlink points to a whole document or to a specific element within a document. Hypertext is text with hyperlinks...

ed video
Video
Video is the technology of electronically capturing, recording, processing, storing, transmitting, and reconstructing a sequence of still images representing scenes in motion.- History :...

, is a displayed video stream that contains embedded, user-clickable anchors, allowing navigation between video and other hypermedia
Hypermedia
Hypermedia is a computer-based information retrieval system that enables a user to gain or provide access to texts, audio and video recordings, photographs and computer graphics related to a particular subject.Hypermedia is a term created by Ted Nelson....

 elements. Hypervideo is thus analogous to hypertext
Hypertext
Hypertext is text displayed on a computer or other electronic device with references to other text that the reader can immediately access, usually by a mouse click or keypress sequence. Apart from running text, hypertext may contain tables, images and other presentational devices. Hypertext is the...

, which allows a reader to click on a word in one document and retrieve information from another document, or from another place in the same document. That is, hypervideo combines video with a non linear information structure, allowing a user to make choices based on the content of the video and the user's interests.

A crucial difference between hypervideo and hypertext is the element of time. Text is normally static, while a video is necessarily dynamic; the content of the video changes with time. Consequently, hypervideo has different technical, aesthetic
Aesthetics
Aesthetics is a branch of philosophy dealing with the nature of beauty, art, and taste, and with the creation and appreciation of beauty. It is more scientifically defined as the study of sensory or sensori-emotional values, sometimes called judgments of sentiment and taste...

, and rhetoric
Rhetoric
Rhetoric is the art of discourse, an art that aims to improve the facility of speakers or writers who attempt to inform, persuade, or motivate particular audiences in specific situations. As a subject of formal study and a productive civic practice, rhetoric has played a central role in the Western...

al requirements than a static hypertext page. For example, hypervideo might involve the creation of a link from an object in a video that is visible for only a certain duration. It is therefore necessary to segment the video appropriately and add the metadata
Metadata
The term metadata is an ambiguous term which is used for two fundamentally different concepts . Although the expression "data about data" is often used, it does not apply to both in the same way. Structural metadata, the design and specification of data structures, cannot be about data, because at...

 required to link from frames—or even objects—in a video to the pertinent information in other media forms.

History of Hypervideo

Illustrating the natural progression to hypervideo from hypertext, the software Storyspace, a hypertext writing environment, employs a spatial metaphor for displaying links. Storyspace utilizes 'writing spaces', generic containers for content, which link to other writing spaces. HyperCafe, a popular experimental prototype of hypervideo, made use of this tool to create "narrative video spaces". HyperCafe was developed as an early model of a hypervideo system, placing users in a virtual cafe where the user dynamically interacts with the video to follow different conversations.

Video to video linking was demonstrated by the Interactive Cinema Group at the MIT Media Lab
MIT Media Lab
The MIT Media Lab is a laboratory of MIT School of Architecture and Planning. Devoted to research projects at the convergence of design, multimedia and technology, the Media Lab has been widely popularized since the 1990s by business and technology publications such as Wired and Red Herring for a...

. Elastic Charles was a hypermedia journal developed between 1988 and 1989, in which "micons" were placed inside a video, indicating links to other content. When implementing the Interactive Kon-Tiki
Kon-Tiki
Kon-Tiki was the raft used by Norwegian explorer and writer Thor Heyerdahl in his 1947 expedition across the Pacific Ocean from South America to the Polynesian islands. It was named after the Inca sun god, Viracocha, for whom "Kon-Tiki" was said to be an old name...

 Museum, Listol used micons in order to represent video footnotes. Video footnotes were a deliberate extension of the literary footnote applied to annotating video, thereby providing continuity between traditional text and early hypervideo. In 1993, Hirata et al. considered media based navigation for hypermedia systems, where the same type of media is used as a query as for the media to be retrieved. For example, a part of an image (defined by shape, or color, for example) could link to a related image. In this approach, the content of the video becomes the basis of forming the links to other related content.

HotVideo was an implementation of this kind of hypervideo, developed at IBM's
IBM
International Business Machines Corporation or IBM is an American multinational technology and consulting corporation headquartered in Armonk, New York, United States. IBM manufactures and sells computer hardware and software, and it offers infrastructure, hosting and consulting services in areas...

 China Research Laboratory in 1996. Navigation to associated resources was accomplished by clicking on a dynamic object in a video. In 1997, a project of the MIT Media Lab's Object-Based Media Group
MIT Media Lab Object-Based Media
The Object-Based Media Group at the MIT Media Lab, led by V. Michael Bove, Jr., explores the creative and technological applications and implications of the intersection of context-aware consumer electronics and self-aware digital content...

 called Hypersoap further developed this concept. HyperSoap was a short soap opera
Soap opera
A soap opera, sometimes called "soap" for short, is an ongoing, episodic work of dramatic fiction presented in serial format on radio or as television programming. The name soap opera stems from the original dramatic serials broadcast on radio that had soap manufacturers, such as Procter & Gamble,...

 program in which a viewer could click with an enhanced remote control on objects in the video to find information on how they could be purchased. The company Watchpoint Media was formed in order to commercialize the technology involved, resulting in product called Storyteller, oriented towards interactive television
Interactive television
Interactive television describes a number of techniques that allow viewers to interact with television content as they view it.- Definitions :...

. Watchpoint Media was acquired by Goldpocket in 2003, which was in turn acquired by Tandberg Television
Tandberg Television
Ericsson Television, formerly Tandberg Television, is a company providing MPEG-4 video on demand, and interactive television systems to telecommunications network operators and broadcasters. It was acquired by Swedish company, Ericsson in 2007, and was re-branded as Ericsson Television in 2010.The...

 in late 2005.

eline Technologies, founded in 1999, developed the first viable hypervideo solutions called VideoClix. Today VideoClix is the most widely used SaaS (Software as a Service) solution to distribute and monetize clickable video on the web and mobile devices. With the advantage that its videos can play back in popular video players such as QuickTime
QuickTime
QuickTime is an extensible proprietary multimedia framework developed by Apple Inc., capable of handling various formats of digital video, picture, sound, panoramic images, and interactivity. The classic version of QuickTime is available for Windows XP and later, as well as Mac OS X Leopard and...

 and Flash
Adobe Flash
Adobe Flash is a multimedia platform used to add animation, video, and interactivity to web pages. Flash is frequently used for advertisements, games and flash animations for broadcast...

 as well as multiple OVPs (online video platforms) such as Brightcove, ThePlatform and Ooyala. VideoClix also offers technology that can be integrated into any 3rd party players based on Quicktime, Flash, Mpeg4 and HTML5. this product has proven to be a commercial success. In 2006, eline Technologies was acquired by VideoClix Inc. VideoClix client base include Disney, ESPN, MTV networks, Dailymotion, Revision 3 as well as Brands such as Apple, Kraft, Fruit of the loom and many others.

In 1997, the Israeli software firm Ephyx Technologies released a product called v-active, one of the first commercial object based authoring system for hypervideo. This technology was not a success, however; Ephyx changed its name to Veon in 1999, at which time it shifted focus away from hypervideo to the provision of development tools for web
World Wide Web
The World Wide Web is a system of interlinked hypertext documents accessed via the Internet...

 and broadband
Broadband
The term broadband refers to a telecommunications signal or device of greater bandwidth, in some sense, than another standard or usual signal or device . Different criteria for "broad" have been applied in different contexts and at different times...

 content.

wireWAX, also offers a hypervideo authoring tool. With this tool user selects a person or object to 'tag' and it is then automatically tracked in real-time by a series of algorithms. The accuracy of the tracking may vary depending on video, object movement, color spectrum..etc. The hotspot can then be customized and further information can be attached by the user.

Concepts and Technical Challenges

Hypervideo is challenging, compared to hyperlinked text, due to the unique difficulty video presents in node segmentation; that is, separating a video into algorithm
Algorithm
In mathematics and computer science, an algorithm is an effective method expressed as a finite list of well-defined instructions for calculating a function. Algorithms are used for calculation, data processing, and automated reasoning...

ically identifiable, linkable content.

Video, at its most basic, is a time sequence of images, which are in turn simply two dimensional arrays of color information. In order to segment a video into meaningful pieces (objects in images, or scenes within videos), it is necessary to provide a context, both in space and time, to extract meaningful elements from this image sequence. Humans are naturally able to perform this task; however, developing a method to achieve this automatically (or by algorithm) is a complex problem.

And it is desirable to do this algorithmically. At an NTSC
NTSC
NTSC, named for the National Television System Committee, is the analog television system that is used in most of North America, most of South America , Burma, South Korea, Taiwan, Japan, the Philippines, and some Pacific island nations and territories .Most countries using the NTSC standard, as...

 frame rate of 30 frames per second, even a short video of 30 seconds comprises 900 frames. The identification of distinct video elements would be a tedious task if human intervention were required for every frame. Clearly, even for moderate amounts of video material, manual segmentation is unrealistic.

From the standpoint of time, the smallest unit of a video is the frame (the finest time granularity). Node segmentation could be performed at the frame level—a straightforward task as a frame is easily identifiable. However, a single frame cannot contain video information, since videos are necessarily dynamic. Analogously, a single word separated from a text does not convey meaning. Thus it is necessary to consider the scene, which is the next level of temporal organization. A scene can be defined as the minimum sequential set of frames that conveys meaning. This is an important concept for hypervideo, as one might wish a hypervideo link to be active throughout one scene, though not in the next. Scene granularity is therefore natural in the creation of hypervideo. Consequently, hypervideo requires algorithms capable of detecting scene transitions.

Of course, one can imagine coarser levels of temporal organization. Scenes can be grouped together to form a narrative sequence, which in turn are grouped to form a video; from the point of view of node segmentation, these concepts are not as critical. Issues of time in hypervideo were considered extensively in the creation of the HyperCafe.

Even if the frame is the smallest time unit, one can still spatially segment a video at a sub-frame level, separating the frame image into its constituent objects; this is necessary when performing node segmentation at the object level. Time introduces complexity in this case also, for even after an object is differentiated in one frame, it is usually necessary to follow the same object through a sequence of frames. This process, known as object tracking, is essential to the creation of links from objects in videos. Spatial segmentation of object can be achieved, for example, through the use of intensity gradients to detect edges, color histograms to match regions, motion detection, or a combination of these and other methods.

Once the required nodes have been segmented and combined with the associated linking information, this metadata must be incorporated with the original video for playback. The metadata is placed conceptually in layers, or tracks, on top of the video; this layered structure is then presented to the user for viewing and interaction. Thus the display technology, the hypervideo player, should not be neglected when creating hypervideo content. For example, efficiency can be gained by storing the geometry of areas associated with tracked objects only in certain keyframes, and allowing the player to interpolate between these keyframes, as developed for HotVideo by IBM. Furthermore, the creators of VideoClix emphasize the fact that its content plays back on standard players, such as Quicktime and Flash. When one considers that the Flash player alone is installed on over 98% of internet enabled desktops in mature markets, this a perhaps a reason for the success of this product in the current arena.

Hypervideo authoring tools

The process of creating hypervideo content is known as authoring. For a variety or reasons many early attempts at creating distributed authoring tools were not successful except VideoClix. This field is currently enjoying a resurgence of interest, perhaps due to the greater availability of broadband internet and more robust media players.

VideoClix is prominent in the rapidly growing domain of internet video. Its end-to-end solution is the industry de-facto for creating, managing, distributing, monetizing, syndicating and measuring hypervideo. It's one of the most battle hardened tools in the industry and is being used by many Tier-1 companies such as Viacom, Disney, Kraft, Apple and many others.

Tandberg Television, specializing in interactive television solutions, has a hypervideo system called AdPoint for video-on-demand. They also sell Storyteller, a product derived from the MIT project Hypersoap. This tool, although impressive as a university lab project, has proven to be very cumbersome to use in the industry.

ADIVI (Add Digital Information to VIdeo) is a hypervideo and rich media application, which provides the functionality to annotate videos, pictures, text, links and other file formats like .pdf, .doc and .xls to any flv video. The application is mainly focused on training, service, assembly and production issues and includes tools for collaborative work and documentation.

Klickable provides a simple web based tool to annotate videos. Klickable technology allows content publishers to create hotspots within the video, add a photo and link to wherever they choose. Klickable videos, create a more engaged user, a comprehensive publishing experience and targeted contextual advertising. Klickable's founders, Emily Gannett and Roger Wu, created the company in 2007.

Overlay.TV provides a set of interactive tools available in a online toolset for creating hypervideo using text, images, transparent links, video-in-video and clipart. Their free-form vreative approach lets designers or video artists create freely on the interactive canvas from a design perspective, but also ties product placement from retailers into the workflow as a mechanism for directing clicks from specific products in a video to the retailers that carry that product.

Adobe Flash, a popular multimedia authoring program widely used to create animated web content, can also be used to create hypervideo content. As Flash was not designed as a hypervideo authoring tool, creating such content can be difficult using Flash alone. Such added functionality has been provided through outside software in the past—for example, MoVideo and Digital Lava. However, these products are no longer sold.

wireWAX is a video tool that allows the user to upload and add hyperlinks or 'tags' to their own videos. The developers claim that it is the world's first 'taggable' video platform and has an integrated object tracking engine that can track the movements of people or objects automatically.

Riva Producer is a software that is especially designed to reduce production costs of non-linear video navigation. Therefore it is suitable to replace industrial documentation by utility films.

In the past, there have been a number of attempts to market hypervideo authoring software that is no longer available. MediaLoom, a product based on a Masters of Science project at the Georgia Institute of Technology
Georgia Institute of Technology
The Georgia Institute of Technology is a public research university in Atlanta, Georgia, in the United States...

, was an early hypervideo authoring tool. It used the Storyspace hypertext authoring environment to generate script files for the hypervideo engine of the HyperCafe. This product reached prototype stage, but was not commercially successful. Ephyx Technologies created v-active, the first authoring software using dynamically tracked objects in video. The company moved away from hypervideo, however, when it became Veon in 1999.

Hypervideo can also be created using services provided by firms with proprietary methods, such as those provided by Vimation. However, this company does not licence its authoring software.

The rise of hypervideo

As the first steps in hypervideo were taken in the late 1980s, it would appear that hypervideo is taking unexpectedly long to realize its potential. Many interesting experiments (HyperCafe, HyperSoap) have not been extensively followed up on, and authoring tools are at the moment available from only a small number of providers.

However, perhaps with the wider availability of broadband internet, this situation is rapidly changing. Interest in hypervideo is increasing, as reflected in popular blogs on the subject, as well as the extraordinary rise of the internet phenomenon YouTube
YouTube
YouTube is a video-sharing website, created by three former PayPal employees in February 2005, on which users can upload, view and share videos....

. Furthermore, by 2010, some estimates have internet downloads claiming over one third of the market for on-demand video
Video on demand
Video on Demand or Audio and Video On Demand are systems which allow users to select and watch/listen to video or audio content on demand...

.

As the amount of video content increases and becomes available on the internet, the possibilities for linking video increase even faster. Digital libraries are constantly growing, of which video is an important part. News outlets have amassed vast video archives, which could be useful in education and historical research. Direct searching of pictures or videos, a much harder task then indexing and searching text, could be greatly facilitated by hypervideo methods.

Commentary

User replies to video content, traditionally in the form of text or image links which are not embedded into the playback sequence of the video, have been allowed through such video hosting services as Viddler
Viddler
Viddler is an interactive online video platform for uploading, sharing, enhancing, tagging, commenting on, and forming groups around videos. Viddler no longer provides a free service for non-commercial users and now requires all users to choose one of the three paid plan options...

 to become embedded both within the imagery of the video and within portions of the playback (via selected time lengths inside the Progress slider element); this feature has become known as "video comments" or "audio comments".

Commercial exploitation

Perhaps the most significant consequence of hypervideo will result from commercial advertising
Video commerce
Video Commerce, Video e-Commerce or eCommerce Video is the practice of using video content to promote, sell and support commercial products or services on the Internet. The video can be downloaded and played or streamed to the viewer...

. Devising a business model to monetize video has proven notoriously difficult. The application of traditional advertising methods—for example introducing ads into video—is likely to be rejected by the online community, while revenue from selling advertising on video sharing sites has so far not been promising.

Hypervideo offers an alternate way to monetize video, allowing for the possibility of creating video clips where objects link to advertising or e-commerce sites, or provide more information about particular products. This new model of advertising is less intrusive, only displaying advertising information when the user makes the choice by clicking on an object in a video. And since it is the user who has requested the product information, this type of advertising is better targeted and likely to be more effective.

Ultimately as hypervideo content proliferates on the Internet, particularly content targeted for delivery via the television set, one can imagine an interlinked web of hypervideo forming in much the same way as the hypertext based World Wide Web has formed. This hypervideo based "Web of Televisions" or "TeleWeb" would offer the same browsing and information mining power of the Web, but be more suited to the viewing experience of being 10 feet from the screen on the living room couch than the Web is. Here may form an environment of not only interactive ads, but also one of interactive and nonlinear news, information, and even story telling.

The future of hypervideo

The above mentioned "Web of Televisions" or "TeleWeb" concepts are likely to become widely adopted as implemented by future advanced Set-Top-Box and Game-Box units with the ability to provide both the 10 foot TV and 2-foot Web experience. The addition of a wireless display and remote control ties together Web and TV, in this scenario clicking objects is non disruptive to movies and TV shows. The full screen video display provides the 10 foot video experience while supplemental content, commerce and advertising related to clicking video objects is placed on the additional display and remote control unit that provides the 2-foot PC experience.

Further reading


External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK