Timed tracks: Difference between revisions

Revision as of 00:53, 21 April 2010

This page contains notes for the development of the first version of timed track features in HTML.

Requirements

Subtitle/Caption/Karaoke File Format

Structure

multiple voices
per-segment time in/out cues
inline time cues for karaoke
bidi, newlines, ruby, italics
position

Positioning

vertical: top/middle/bottom/% (default bottom)
horizontal: left/center/right/% (default center)
display modes: replace previous text, scroll previous text up and add to bottom
multiple voices placed in adjacent places would need to automatically stack so they don't overlap
multiple segments with overlapping times would need to be stacked so they don't overlap

(Percentage positions would work like background-position in CSS.)

Formatting

Inline

text should be bidi-aware
some cases use ruby
some cases use italics

Global

color of background/text/outline is needed for readability on different types of video.
webfonts is needed to provide high quality subtitles in some non-Latin languages (e.g. Chinese where a suitable font is unlikely to be available even on Chinese computer systems).
providing a pseudo-element to style each voice would likely be sufficient for authors who want overall formatting control (this would also allow user overrides conveniently)

Audio Descriptions File Format

Just timing and text?

HTML

an API and UI for exposing what timed tracks exist and selectively enabling/disabling them

format for external subtitles/captions
format for external audio descriptions
some mechanism for text in the page to be used instead of external files, for subtitles/captions or audio descriptions
an API to allow a segment to be dynamically inserted into the rendering on the fly

an API for exposing what the currently relevant segments of each timed track are
a way to hook into this mechanism to advance slides

native rendering of subtitles
native rendering of audio descriptions
native rendering of multiple audio or video tracks, to allow pre-recorded audio descriptions to be mixed in and sign language video to be overlaid
a way to hook into this to manually render timed tracks

Architecture

Revision as of 00:53, 21 April 2010 (view source) Hixie (talk \| contribs) (→‎Architecture) ← Older edit		Revision as of 00:53, 21 April 2010 (view source) Hixie (talk \| contribs) (→‎Architecture) Newer edit →
Line 61:		Line 61:
	== Architecture ==		== Architecture ==

	<img src="http://docs.google.com/drawings/pub?id=1GR6Pzq0GY2n1sx_ZjDfuICM2LnXxLVxzvyl4kuQy-48&w=~~550~~&h=~~413~~">		<img src="http://docs.google.com/drawings/pub?id=1GR6Pzq0GY2n1sx_ZjDfuICM2LnXxLVxzvyl4kuQy-48&w=640&h=480">

Timed tracks: Difference between revisions

Revision as of 00:53, 21 April 2010

Contents

Requirements

Subtitle/Caption/Karaoke File Format

Structure

Positioning

Formatting

Inline

Global

Audio Descriptions File Format

HTML

Architecture

Navigation menu

Timed tracks: Difference between revisions

Revision as of 00:53, 21 April 2010

Requirements

Subtitle/Caption/Karaoke File Format

Structure

Positioning

Formatting

Inline

Global

Audio Descriptions File Format

HTML

Architecture

Navigation menu

Search