In this section, we discuss the topic of how to derive a multi-view 3D content from a stereo image pair, as the future glass-free displays require multi-view content. So far, no automatic multi-view generation algorithm can claim to reach a satisfactory visual quality, so some human editing or tuning is necessary during the processing. A few fundamental automatic modules are required in the process:
In this process, the major challenges can be summarized as follows:
So far there are very few solutions in the market on this topic. In almost all these efforts, the processing is conducted in a view-based mode, thus objects evolving in the time domain across the video frames are seldom considered in the framework, which undermines its capability to reduce flickering artifacts. However, very few efforts consider user interaction as an effective assistance to improve the content quality. It may be worth considering a new concept which is aligned with the human intuition of object-based access and manipulation in the 3D environment. With a patented 3D data format and the associated data derivation and generation scheme, the stereo analysis, object manipulation, and final 3D view rendering are fully decoupled in the process. The data format needs to have a good extensibility that can be converted to and from other multi-view 3DTV formats, such as MVD (multi-view video plus depth). A set of computer vision algorithms need to be deployed to reduce human intervention during the process to the minimum, but this approach needs to have the extensibility to be optimized for various autostereoscopic displays.