Monday, January 16, 2012

Top 10 Blog Posts


  1. HTTP Streaming of MPEG Media: My first article in this series which I've started after the MPEG CfP has been issued that lead to the standardization of DASH.
  2. MMSys'11 Special Session on MMT/DASH: the CfP for a special session I've organized.
  3. MPEG news: a report from the 93rd meeting in Geneva, right after the responses to the HTTP streaming CfP has been evaluated.
  4. MPEG advances DASH towards completion which is the MPEG press release after the 94th meeting in Guangzhou.
  5. Open Source Scalable Video Coding (SVC) Software where I have received quite a few comments ;-)
  6. MPEG Media Transport: Basically the same as #1 but a different scope. However, it seems the readers are more interested in HTTP streaming than media transport in general.
  7. Vision and Requirements for High-Performance Video Coding which has been renamed now to High-Efficiency Video Coding.
  8. DASH provides an overview about the Draft International Standard which is publicly available.
  9. MPEG DASH vs. W3C WebTV which is still a hop topic and worth following on both sides...
  10. Immersive Future Media Technologies: From 3D Video to Sensory Experience: I'm happy having this one in my top ten. It's the summary of a tutorial I had at ACM Multimedia 2010 together with Karsten Müller.
In general, most of the readers are very much interested in HTTP streaming / DASH / MMT followed by video coding (SVC/HEVC/3DVC) and the Sensory Experience stuff I've started some time ago.

Thanks again for visiting my blog and don't hesitate to leave a comment here and there. I'd love to read your thoughts and feedback.

Saturday, January 7, 2012

That was Multimedia Modeling 2012...

Thursday, December 8, 2011

MPEG news: a report from the 98th meeting, Geneva, Switzerland

#MPEG98 report: DASH=IS ✔ CDVS=CfP eval ✔ {MMT, HEVC, 3DAudio}=MPEG-H ✔ IVC={IVC, WebVC} ✔ 3DVC=CfP eval ✔

... MPEG news from its 98th meeting in Geneva, Switzerland with less than 140 characters and a lot of acronyms. The official press release is, as usual, here. As you can see from the press release, MPEG produced significant results, namely:
  • MPEG Dynamic Adaptive Streaming over HTTP (DASH) ratified
  • 3D Video Coding: Evaluation of responses to Call for Proposals
  • MPEG royalty free video coding: Internet Video Coding (IVC) + Web Video Coding (WebVC)
  • High Efficiency Coding and Media Delivery in Heterogeneous Environments: MPEG-H comprising MMT, HEVC, 3DAC
  • Compact Descriptors for Visual Search (CDVS): Evaluation of responses to the Call for Proposals
  • Call for requirements: Multimedia Preservation Description Information (MPDI)
  • MPEG Augmented Reality (AR)
As you can see, a long list of achievements within a single meeting but let's dig inside. For each topic I've also tried to provide some research issues which I think are worth to investigate both inside and outside MPEG. 

MPEG Dynamic Adaptive Streaming over HTTP (DASH): DASH=IS ✔

As the official press release states, the MPEG ratifies its draft standard for DASH and it comes better, the standard should become publicly available which I expect to happen somewhat early next year, approx. March 2012, or maybe earlier. I say "should" because there is no guarantee that this will actually happen but signs are good. In the meantime, feel free using our software to play around and we expect to update it to the latest version of the standard as soon as possible. Finally, IEEE Computer Society Computing Now has put together a theme on Video for the Universal Web featuring DASH.

Research issues: performance, bandwidth estimation, request scheduling (aka adaptation logic), and Quality of Service/Experience.

3D Video Coding: 3DVC=CfP eval ✔

MPEG evaluated more than 20 proposals submitted as a response to the call issued back in April 2011. The evaluation of the proposal comprised subjective quality assessments conducted by 13 highly qualified test laboratories distributed around the world and coordinated by the COST Action IC1003 QUALINET. The report of the subjective test results from the call for proposals on 3D video coding will be available by end of this week. MPEG documented the standardization tracks considered in 3DVC (i.e., compatible with MVC, AVC base-view, HEVC, ...) and agreed on a common software based on the best-performing proposals.

Research issues: encoding efficiency of 3D depth maps and compatibility for the various target formats (AVC, MVC, HEVC) as well as depth map estimation at the client side.

MPEG royalty free video coding: IVC vs. WebVC

In addition to the evaluation of the responses to the call for 3DVC, MPEG also evaluated the responses to the Internet Video Coding call. Based on the responses, MPEG decided to follow up with two approaches namely Internet Video Coding (IVC) and Web Video Coding (WebVC). The former - IVC - is based on MPEG-1 technology which is assumed to be royalty-free. However, it requires some performance boosts in order to make it ready for the Internet. MPEG's approach is a common platform called Internet video coding Test Model (ITM) which serves as the basis for further improvements. The latter - WebVC - is based on the AVC constrained baseline profile which performance is well-known and satisfactory but, unfortunately, it is not clear which patents of the AVC patent pool apply to this profile. Hence, a working draft (WD) of WebVC will be provided (also publicly available) in order to get patent statements from companies. The WD will be publicly available by December 19th.

Further information:
Research issues: coding efficiency with using only royalty free coding tools whereby the optimization is first towards royalties and then efficiency.

MPEG-H

A new star is born which is called MPEG-H referred to as "High Efficiency Coding and Media Delivery in Heterogeneous Environments" comprising three parts: Pt. 1 MMT, Pt. 2 HEVC, Pt. 3 3D Audio. There's a document called context and objective of MPEG-H but I can't find out whether it's public (I come back later on this).

Part 1: MMT (MPEG Media Transport) is progressing (slowly) but a next step should be definitely to check the relationship of MMT and DASH for which an Ad-hoc Group has been established (N12395), subscribe here, if interested.
Research issues: very general at the moment, what is the best delivery method (incl. formats) for future multimedia applications? Answer: It depends, ... ;-)

Part 2: HEVC (High-Efficiency Video Coding) made significant progress at the last meeting, in particular: only one entropy coder (note: AVC has two, CABAC and CAVLC which are supported in different profiles), 8 bit decoding (could be also 10 bit, probably done in some profiles), specific integer transform, stabilized and more complete high-level syntax and HRD description (i.e., reference picture buffering, tiles, slices, and wavefronts enabling parallel decoding process). Finally, a prototype has been demonstrated decoding HEVC in software on an iPad 2 at WVGA resolution and the 10min Big Buck Bunny sequence at SD resolution with avg. 800 kbit/s which clearly outperformed the corresponding AVC versions.
Research issues: well, coding efficiency, what else? The ultimative goal to have a performance gain of more than 50% compared to the predecessor which is AVC.

Part 3: 3D Audio Coding (3DAC) is in its early stages but there will be an event during San Jose meeting which will be announced here. As of now, use cases are provided (home theatre, personal TV, smartphone TV, multichannel TV) as well as candidate requirements and evaluation methods. One important aspect seems to be user experience for highly immersive audio (i.e., 22.2, 10.2, 5.1) including bitstream adaptation for low-bandwidth and low-complexity.
Research issues: sorry, I'm not really an audio guy but I assume it's coding efficiency, specifically for 22.2 channels ;-)

Compact Descriptors for Visual Search (CDVS)

For CDVS, responses to the call for proposals (from 10 companies/institutions) have been evaluated and a test model has been established based on the best performing proposals. The next steps include the improvement of the test model towards for inclusion in the MPEG-7 standard.
Research issues: descriptor efficiency for the intended application as well as precision on the information retrieval results.

Multimedia Preservation Description Information (MPDI)

The aim of this new work item is to provide "standard technology helping users to preserve digital multimedia that is used in many different domains, including cultural heritage, scientific research, engineering, education and training, entertainment, and fine arts for long-term across system, organizational, administrative and generational boundaries". It comes along with two public documents, the current requirements and a call for requirements which are due at the 100th MPEG meeting in April 2002.
Research issues: What and how to preserve digital multimedia information?

Augmented Reality (AR)

MPEG's newest project is on Augmented Reality (AR), starting with an application format for which a working draft exists. Furthermore, draft requirements and use cases are available. These three documents will be available on Dec 31st.
Research issues: N/A

Finally, I hope now you can better understand what I've put at the beginning with all these acronyms ...

#MPEG98 report: DASH=IS ✔ CDVS=CfP eval ✔ {MMT, HEVC, 3DAudio}=MPEG-H ✔ IVC={IVC, WebVC} ✔ 3DVC=CfP eval ✔

Friday, November 4, 2011

CfP: QoMEX 2012

Call for Papers: QoMEX 2012 
5th - 7th July 2012
Yarra Valley, Victoria, Australia
http://www.qomex2012.org/

The fourth International Workshop on Quality of Multimedia Experience (QoMEX) will bring together leading professionals and scientists interested in evaluating multimedia quality and user experience. QoMEX is the flagship workshop of the European COST action Qualinet which serves as its technical and financial sponsor. The 2012 workshop is the first of the series to be held in the Asia-Pacific and provides new opportunities for QoMEX to explore the rich technology interests of the region. The workshop venue is in the Yarra Valley, Australia, just north of Melbourne where ICME will be held from 9th July.

QoMEX is the flagship workshop of the European COST action Qualinet which serves as its technical and financial sponsor. The 2012 workshop is the first of the series to be held in the Asia- Pacific and provides new opportunities for QoMEX to explore the rich technology interests of the region.

Multimedia is a ubiquitous part of modern life; content is delivered on a wide variety of devices both fixed and mobile, the consumption environment can vary dramatically and increasingly users demand fulfilling interaction with their media. Evaluation, both objectively and subjectively of separate audio, video or image content is well known, but there are substantial challenges in evaluating users' complete experiences of mixed content. This is exacerbated by the diversity of devices, interfaces, consumption environments and content prevalent in today's market. Increasingly, evaluation approaches must include psychological, perceptual and contextual dimensions to truly represent and measure users experiences. Understanding how users react and perceive content and applications will inform and improve new applications and lead to improved user experiences.

QoMEX 2012 features oral presentations, exhibits, panels and poster sessions, providing attendees with various opportunities to discuss the latest progress and trends in multimedia user experience. The workshop venue and format has been specifically chosen to increase opportunities for expert interaction and discussion. Authors are invited to submit full- length, six- page papers according to the guidelines available on the conference website: http://www.qomex2012.org .

Topics of interest include but are not limited to:
  • Visual User Experience (Image/Video/Graphics): Objective and subjective quality evaluation, psychovisual modelling, quality- centered processing, compression and transmission, QoE evaluation in HD and beyond.
  • Auditory User Experience (Speech/Audio): Psychoacoustic modelling, quality assessment of interactive speech and audio applications/ interfaces, models and metrics for evaluation of spatial audio.
  • Sensory User Experiences: Novel sensory interfaces, methods for sensory user feedback, quality metrics and assessment methods for evaluation of ambient and sensory experiences. 
  • QoE for Mobile Devices: device- dependent, adaptive user interfaces, evaluating interactive experiences on mobile devices, objective and subjective evaluation on iPhone/Android/mobile multimedia Applications.
  • QoE for 3D Multimedia: QoE metrics and evaluation methods for 3D audio and 3D video, virtual, augmented and mixed realities, haptic interaction, other immersive modalities and interactive interfaces.
  • Standardization Activities in Multimedia Quality Evaluation: Benchmarking efforts, multimedia databases/ datasets of various modalities (speech, audio, video, sensory, etc.) and fidelities (quality, bitrate, etc.), testing conditions and methods, new objective metrics/models for upcoming standards. 
  • Applications and services: Evaluating QoE in multimedia applications e.g., games/gamification, biomedical/ telemedicine, distance education, teleconferencing/telepresence, multimedia web search/ browsing, cloud- based multimedia applications, etc. 
Important Dates:
  • Submission deadline: February 14, 2012 
  • Notification of acceptance: April 30, 2012 
  • Camera- ready submission: May 14, 2012 
Web sitehttp://www.qomex2012.org/
Organizing Committee

General Chair

  • Ian Burnett, RMIT University
General Co-Chair

  • Henry Wu, RMIT University

Technical Program Co-Chairs

  • Christian Timmerer, Alpen-Adria-Universität
  • Alexander Raake, TU Berlin
  • Christian Ritz, University of Wollongong
Steering Committee

  • Loretta Anania, EC
  • Alan Bovik, University of Texas
  • Touradj Ebrahimi, EPFL/NTNU
  • Khaled El-Maleh, Qualcomm USA
  • David Geerts, IBBT ‐ KULeuven
  • Lina Karam, Arizona State University
  • Bastiaan Kleijn, Victoria University of Wellington
  • Sebastian Möler, TU Berlin
  • Fernando Pereira, Instituto de Telecomunicações
  • Andrew Perkis, NTNU-Q2S
  • Amy Reibman, AT&T Labs
  • Peter Schelkens, IBBT ‐ VUB
Publicity Chairs

  • Christian Timmerer, Alpen-Adria-Universität (Europe)
  • Homer Chen, National Taiwan University (Asia and Pacific)
Social Program and Local Arrangements Chair

  • Lorraine Valladares, RMIT University
Webmaster

  • Stephen Davis, University of Wollongong

Tuesday, October 25, 2011

Alpen-Adria-Universität Klagenfurt participates in New FP7 Project on Social Networks

Social media applications have become a modern reality affecting a growing part of the population, as well as companies and public organisations. For example, twitter messages played an important role in the recent developments in Arab countries, while Flickr and YouTube are rich image and video collections based exclusively on user contributions and around 30 billion content items (links, photo albums, status updates) are posted on Facebook every month. It becomes clear that, much of what happens in the real-world is documented in real time by the millions of social network users that upload content, interact with each other, and give feedback (rate, comment) on already published online content. It is equally important that social networks and applications can be used for a plethora of diverse applications, beyond sharing, networking, news and entertainment. For example, they have been used for emergency planning and response, in travel and tourism, in e-government, and in product monitoring.

SocialSensor is a 3-year FP7 European Integrated Project, which aims at exploiting information in social networks and developing applications that enhance user experience. In the project framework, new techniques for analysis, aggregation and real-time search of user-generated content will be developed, in order to extract useful information and make it available for use in different applications. Innovative solutions from the fields of information extraction and retrieval, social network analysis, user modelling, semantic web services, and media adaptation, delivery and presentation, will compose a software platform that crawls and analyses multimedia User Generated Content from the social web, combines it with professional content, and makes it searchable for professional users, but also recommends, delivers and presents it to media consumers depending on their context and their personal profile. To achieve this, crucial issues have to be tackled, such as the sheer data volume, its heterogeneity and low quality.


The platform will be showcased and evaluated in two use cases: (a) news, involving professional news editors, journalists and casual readers, benefiting from the improved capabilities of SocialSensor for discovering new interesting social content and integrating it in the news creation and delivery lifecycle, and (b) infotainment, providing new multimedia search tools and unique media consumption experiences to attendants of large events (e.g. festivals). Providing real-time social indexing capabilities for both of these use cases is expected to have a transformational impact on both sectors.

In the project, whose total budget is €9.64 million, Alpen-Adria-Universität (AAU) Klagenfurt, Multimedia Communications Group participates, with research activities on media streaming and sharing within social networks. The consortium comprises 11 participants in total, coming from different universities and research organizations (CERTH-Informatics and Telematics Institute (co-ordinator), University of Koblenz-Landau, City University London, German Research Center for Artificial Intelligence, Alpen-Adria-Universität Klagenfurt), as well as from the IT industry and news domains (IBM Israel, Yahoo! Spain, Alcatel-Lucent Bell Labs, Deutsche Welle, Athens Technology Center, JCP-Consult).

For more information please contact: Dr. Christian Timmerer, AAU/TEWI/ITEC/MMC, christian.timmerer@itec.aau.at or the co-ordinators Dr. Ioannis Kompatsiaris, CERTH-ITI, ikom@iti.gr

Tuesday, October 11, 2011

CfP: ACM International Conference on Multimedia Retrieval (ICMR) 2012


CALL FOR PAPERS

ACM International Conference on Multimedia Retrieval (ICMR) 2012
June 5-8, 2012, Hong Kong

Submission Deadline: January 15, 2012

Multimedia computing, indexing and retrieval continue to be one of the most exciting and fastest-growing research areas in the field of multimedia technology. The Second ACM International Conference on Multimedia Retrieval (ICMR) offers opportunities for the exchange of ideas between researchers, practitioners and potential users of multimedia retrieval systems. This conference, puts together the long-lasting experience of former ACM CIVR and ACM MIR series, was set up to illuminate the state of the art in multimedia (including image, video and audio) retrieval.

ICMR 2012 is seeking original high quality submissions addressing innovative research in the broad field of multimedia retrieval. Contributions addressing the challenges of large-scale search and user behavior analysis are especially welcome.

Topics of Interest (not limited to)

  • Content/semantic/affective based indexing and retrieval
  • Large-scale and web-scale multimedia processing
  • Integration of content, meta data and social network
  • Scalable and distributed search
  • User behavior and HCI issues in multimedia retrieval
  • Advanced descriptors and similarity metrics
  • Multimedia fusion
  • High performance indexing algorithms
  • Machine learning for multimedia retrieval
  • Ontology for annotation and search
  • 3D video and model processing
  • Large-scale summarization and visualization
  • Performance evaluation
  • Very large scale multimedia corpus
  • Navigation and browsing on the Web
  • Retrieval from multimodal lifelogs
  • Database architectures for storage and retrieval
  • Novel multimedia data management systems and applications
  • Applications in forensic, biomedical image and video collections

Important Dates

  • Paper Submission: January 15, 2012
  • Notification of Acceptance: March 15, 2012
  • Camera-Ready Papers Due: April 5, 2012
  • Conference Date: June 5-8, 2012

Questions?



Monday, October 10, 2011

HOW TO: Connect With Computing Now on the Social Web


As Computing Now (CN) is growing in terms of readers, also the number of topics we cover is growing and the ways readers access our content. Most of our readers signed up for our newsletter but there are various other ways accessing our content which I'd like to summarize and highlight within this blog post.

Follow via RSS feeds: Various feeds are available featuring CN Monthly Theme Articles, CN What's New Articles, CN Newsfeed, and CN News Briefs. Additionally, various feeds for podcasts and blogs related to CN are available (see also below).

Computing Now Content and News

Podcasts

Blogs

Follow via Email, i.e., subscribe to our free newsletters and alerts, select "Computing Now" from the check-box but feel free selecting also others! Furthermore, you may subscribe to our issue alerts for all CS magazines and transactions.

Follow @computingnow via Twitter maintained by @timse7 as well as CS staff. Realize the "engage" part of Computing Now by joining the conversation in and around vital topics related to computer science.

Follow on Facebook, discuss topics there, interact with the staff and other volunteers and/or connect on LinkedIn. Finally, you may join our YouTube channel for latest videos and interviews.

Finally, a couple of Applications have been developed recently and are continuously updated: CN iPhone AppCN Android AppCN iGoogle App.

Stay tuned and if you think something is missing, please let me know and/or approach me with your ideas!