Showing posts with label usac. Show all posts
Showing posts with label usac. Show all posts

Tuesday, October 8, 2013

Computing Now Theme: Recent MPEG Standards for Future Media Ecosystems

The October 2013 theme of Computing Now — a new online front end to all IEEE CS magazine which transitions the printed media to virtual adopting and applying Web 2.0 principles in practice -- focuses on recent MPEG standards for future media ecosystems.


Computing Now's October 2013 monthly theme highlights some recent developments within MPEG — namely, High-Efficiency Video Coding (HEVC), Unified Speech and Audio Coding (USAC), and Dynamic Adaptive Streaming over HTTP (DASH), for which we've pulled together relevant scientific publications as well as video tutorials by the actual chairs and editors of the MPEG subgroups and standards, respectively.

Please note that the referenced articles are offered for free during October 2013, unless noted otherwise.

How to cite this theme?
C. Timmerer and A. Vetro, "Recent MPEG Standards for Future Media Ecosystems," Computing Now, vol. 6, no. 10, Oct. 2013, IEEE Computer Society [online]; http://www.computer.org/portal/web/computingnow/archive/october2013.

Wednesday, July 27, 2011

MPEG news: a report from the 97th meeting, Torino, Italy

The 97th MPEG meeting in Torino brought a few interesting news which I'd like to report here briefly. Of course, as usual, there is the official press release, however, I'd like to report on some interesting topics as follows:
  • MPEG Unified Speech and Audio Coding (USAC) reached FDIS status
  • Call for Proposals: Compact Descriptors for Visual Search (CDVS)
  • Call for Proposals: Internet Video Coding (IVC)
  • DIS on MPEG Dynamic Adaptive Streaming over HTTP (DASH)
MPEG Unified Speech and Audio Coding (USAC) reached FDIS status

ISO/IEC 23003-3 aka Unified Speech and Audio Coding (USAC) reached FDIS status and soon will be an International Standard. The FDIS itself won't be publicly available but the Unified Speech and Audio Coding Verification Test Report in September 2011 (most likely here). 

Call for Proposals: Compact Descriptors for Visual Search (CDVS)

I reported previously about that and here comes the final CfP including the evaluation framework.

MPEG is planning standardizing technologies that will enable efficient and interoperable design of visual search applications. In particular we are seeking technologies for visual content matching in images or video. Visual content matching includes matching of views of objects, landmarks, and printed documents that is robust to partial occlusions as well as changes in vantage point, camera parameters, and lighting conditions.

There are a number of component technologies that are useful for visual search, including format of visual descriptors, descriptor extraction process, as well as indexing, and matching algorithms. As a minimum, the format of descriptors as well as parts of their extraction process should be defined to ensure interoperability.

It is envisioned that a standard for compact descriptors will:
  • ensure interoperability of visual search applications and databases, 
  • enable high level of performance of implementations conformant to the standard,
  • simplify design of descriptor extraction and matching for visual search applications, 
  • enable hardware support for descriptor extraction and matching in mobile devices,
  • reduce load on wireless networks carrying visual search-related information.
It is envisioned that such standard will provide a complementary tool to the suite of existing MPEG standards, such as MPEG-7 Visual Descriptors. To build full visual search application this standard may be used jointly with other existing standards, such as MPEG Query Format, HTTP, XML, JPEG, JPSec, and JPSearch.

The Call for Proposals and the Evaluation Framework is publicly available. From a research perspective, it would be interesting to see how technologies submitted as an answer to the CfP compete with existing approaches and applications/services.

In this context, it is probably worth looking at IEEE Multimedia Jul.-Sep. 2011 issue which is dedicated to visual content: identification and search including an overview about this new MPEG standard.

Call for Proposals: Internet Video Coding (IVC)

I reported previously about that and the final CfP for Internet Video Coding Technologies is available here. The requirements reveal some interesting issues the call is about:
  • Real-time communications, video chat, video conferencing,
  • Mobile streaming, broadcast and communications,
  • Mobile devices and Internet connected embedded devices 
  • Internet broadcast streaming, downloads
  • Content sharing.
Requirements fall into the following major categories:
  • IPR requirements
  • Technical requirements
  • Implementation complexity requirements 
Clearly, this work item has an optimization towards IPR but others are not excluded. In particular,
It is anticipated that any patent declaration associated with the Baseline Profile of this standard will indicate that the patent owner is prepared to grant a free of charge license to an unrestricted number of applicants on a worldwide, non-discriminatory basis and under other reasonable terms and conditions to make, use, and sell implementations of the Baseline Profile of this standard in accordance with the ITU-T/ITU-R/ISO/IEC Common Patent Policy. 

MPEG Dynamic Adaptive Streaming over HTTP (MPEG-DASH)

For all DASH enthusiast, the latest - and probably almost final - version of DASH-related standards can be found here. Please note that DASH has been reorganized into MPEG-DASH referred to as ISO/IEC DIS 23009-1.2, Part 1: Media presentation description and segment formats. Additionally, you might be interested in the following draft:
  • ISO/IEC 14496-12:2008/DAM 3, Part 12: ISO base media file format, AMENDMENT 3: DASH support and RTP reception hint track processing
  • ISO/IEC FDIS 23001-7, Part 7: Common encryption format for ISO base media file format
All these DASH-related documents are publicly available here. In terms of implementation, the interested reader might check out the ITEC-DASH VLC-based implementation and GPAC (which provides basic support for DASH) respectively.

Further information you may find at the MPEG Web site, specifically under the hot news section and the press release. Working documents of any MPEG standard so far can be found here. If you want to join any of these activities, the list of Ad-hoc Groups (AhG) is available here (soon also here) including the information how to join their reflectors.

Wednesday, August 25, 2010

MPEG Press Release: Unified Speech and Audio Coding Achieves Major Milestone

--full press release available here.

Geneva, CH – The 93rd MPEG meeting was held in Geneva, Switzerland from the 26th to the 30th of July 2010.

Highlights of the 93rd Meeting

  • USAC progresses to Committee Draft
  • MPEG-V collection of six new parts for information exchange with virtual worlds is completed
  • AIT reaches first major standard’s development milestone
  • MPEG launches standard for the carriage of multimedia data over HTTP
  • New amendment enables efficient representation of 3D meshes with multiple attributes
  • Calls for Proposals for MMT and Composite Font Standard issued

Details for each highlight can be found in the full press release.

Digging Deeper – How to Contact MPEG

Communicating the large and sometimes complex array of technology that the MPEG Committee has developed is not a simple task. Experts, past and present, have contributed a series of white-papers and vision documents that explain each of these standards individually. The repository is growing with each meeting, so if something you are interested is not there yet, it may appear there shortly – but you should also not hesitate to request it. You can start your MPEG adventure at: http://mpeg.chiariglione.org/technologies.htm.