1
[email protected] www.ient.rwth-aachen.de Institut für Nachrichtentechnik, Melatener Str. 23, 52074 Aachen ICIP 2017 ANALYSIS/SYNTHESIS CODING OF DYNAMIC TEXTURES BASED ON MOTION DISTRIBUTION STATISTICS Olena Chubach, Patrick Garus, Mathias Wien, Jens-Rainer Ohm Institut für Nachrichtentechnik, RWTH Aachen University ANALYSIS/SYNTHESIS CODING OF DYNAMIC TEXTURES BASED ON MOTION DISTRIBUTION STATISTICS Olena Chubach, Patrick Garus, Mathias Wien, Jens-Rainer Ohm Institut für Nachrichtentechnik, RWTH Aachen University Motivation Highly textured parts: challenging for conventional codec but perceptually irrelevant for humans. Fig. 1: Examples of dynamic textures content Viewer rather perceives semantic equivalence of displayed content than specific details Exact positions of the texture patterns are irrelevant for humans Therefore textures may be displayed without a pixel fidelity, instead of conventional coding This allows to omit encoding prediction residuals and motion vector coding of dynamic textures, leading to substantial reduction of bits to be coded Motion-based Characterization of Dynamic Textures Dynamic textures are represented by a set of first order motion features computed along the space and time dimensions Motion vectors from 3 spatial and 2 temporal neighboring positions are considered for motion co-occurrence matrix (MCM), providing efficient representation of motion distribution in dynamic textures MCM Estimate dense OF Estimate dense OF ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... Fig. 2: Procedure of MCM computation Modified Coding Structure 0 1 8 9 16 6 7 2 3 4 5 14 15 10 11 12 13 Fig. 3: Modified coding structure in case of sGOP size 16 Frames 0, 1, 8, 9 and 16 are reference frames and reconstructed first Remaining 12 frames are skipped during encoding/decoding and will be synthesized Proposed Scheme sGOP: T original frames MCM MVF Synthesized MVF Compressed MCM Estimate dense OF between adjacent frames Discard non-probable MV combinations Analyse motion distribution Derive motion vectors Discard intermediate frames Apply warping and blending Analysis Synthesis T-2 synthesised frames Fig. 4: Proposed scheme of DT analysis/synthesis based on motion distribution statistics Initial MVF is estimated between adjacent reference frames Compressed MCM is signaled to the decoder side for synthesis Synthetic MVFs are predicted based on initial MVF and compressed MCM and utilized for generating intermediate frames Synthesis procedure is performed twice: in forward direction using frames from the past and in backward direction using frames from the future Corresponding synthesized frames ˆ I f t and ˆ I b t are then blended: ˆ I t = (1 - λ(t)) ˆ I f t + λ(t) ˆ I b t Results Proposed method tested on sequences from HomTex database, containing water, leaves and smoke Test sequences: 256x256 pixels in width and height; 250 frames; 25 or 60 fps Encoded with HEVC Test Model (HM-16.6) with modified RA config., QP=22 sGOP size considered - 16 frames 50% of the most probable MV combinations are kept for every sGOP MCMs and and MV interval ranges are compressed by arithmetic coding Sequence HEVC rate, Synth, rate, Rate CTC RA, [kb] modif. RA, [kb] reduction, % BricksBushes 1744.5 766.1+ 5.7 -55.7 Static-Bushes1 BricksBushes 1579.1 717.3+ 2.9 -55.4 Static-Bushes2 LampLeaves 1578.9 728.3+ 11.9 -53.1 Bushes1 LampLeaves 1146.4 507.4+ 54.1 -51.0 Bushes2 LampLeaves 1294.6 545.8+ 135.5 -47.4 Bushes3 LampLeaves 552.4 281.2+ 7.5 -47.8 BushesBackground PetibatoCropped 735.1 391.5+ 153.3 -25.9 TreeWills-Cropped 970.6 584.3+ 0.44 -39.8 Table 1: Rate comparison: 2 nd column - HEVC CTC RA configuration; 3 rd column - reference frames encoded with modified RA configuration (1 st term); parameters required for synthesis, compressed by arithmetic coding (2 nd term); 4 th column - rate reduction, under the assumption of acceptable quality drop

ANALYSIS/SYNTHESIS CODING OF DYNAMIC TEXTURES BASED … · Compressed MCM Estimate dense OF between adjacent frames Discard non-probable MV combinations Analyse motion distribution

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: ANALYSIS/SYNTHESIS CODING OF DYNAMIC TEXTURES BASED … · Compressed MCM Estimate dense OF between adjacent frames Discard non-probable MV combinations Analyse motion distribution

[email protected]

www.ient.rwth-aachen.de

Institut für Nachrichtentechnik, Melatener Str. 23, 52074 Aachen

ICIP2017

ANALYSIS/SYNTHESIS CODING OF DYNAMICTEXTURES BASED ON MOTIONDISTRIBUTION STATISTICS

Olena Chubach, Patrick Garus, Mathias Wien, Jens-Rainer OhmInstitut für Nachrichtentechnik, RWTH Aachen University

ANALYSIS/SYNTHESIS CODING OF DYNAMICTEXTURES BASED ON MOTIONDISTRIBUTION STATISTICS

Olena Chubach, Patrick Garus, Mathias Wien, Jens-Rainer OhmInstitut für Nachrichtentechnik, RWTH Aachen University

Motivation

Highly textured parts: challenging for conventional codec but perceptuallyirrelevant for humans.

Fig. 1: Examples of dynamic textures content

•Viewer rather perceives semantic equivalence of displayed content thanspecific details

•Exact positions of the texture patterns are irrelevant for humans

•Therefore textures may be displayed without a pixel fidelity, instead ofconventional coding

•This allows to omit encoding prediction residuals and motion vector coding ofdynamic textures, leading to substantial reduction of bits to be coded

Motion-based Characterization of Dynamic Textures

•Dynamic textures are represented by a set of first order motion featurescomputed along the space and time dimensions

•Motion vectors from 3 spatial and 2 temporal neighboring positions areconsidered for motion co-occurrence matrix (MCM), providing efficientrepresentation of motion distribution in dynamic textures

MCM

Estimate

dense OF

Estimate

dense OF

...

...

...

...

... ......

... ......

...

...

...

...

... ......

... ......

...

...

...

...

... ......

... ......

...... ...

...

...... ...

...

...

...

...

...

... ......

... ......

...

...

...

...

... ......

... ......

...... ...

...

...... ...

...

Fig. 2: Procedure of MCM computation

Modified Coding Structure

0 1 8 9 16

6 72 3 4 5 14 1510 11 12 13

Fig. 3: Modified coding structure in case of sGOP size 16

•Frames 0, 1, 8, 9 and 16 are reference frames and reconstructed first

•Remaining 12 frames are skipped during encoding/decoding and will besynthesized

Proposed Scheme

sGOP:

T original frames

MCM

MVF

Synthesized

MVF

Compressed

MCM

Estimate dense OF between

adjacent frames

Discard non-probable

MV combinations

Analyse motion

distribution

Derive motion vectors

Discard intermediate

frames

Apply warping

and blending

Analysis

Synthesis

T-2 synthesised frames

Fig. 4: Proposed scheme of DT analysis/synthesis based on motion distribution statistics

• Initial MVF is estimated between adjacent reference frames

•Compressed MCM is signaled to the decoder side for synthesis

•Synthetic MVFs are predicted based on initial MVF and compressed MCMand utilized for generating intermediate frames

•Synthesis procedure is performed twice: in forward direction using framesfrom the past and in backward direction using frames from the future

•Corresponding synthesized frames Ift and Ibt are then blended:

It = (1− λ(t))Ift + λ(t)Ibt

Results

•Proposed method tested on sequences from HomTex database, containingwater, leaves and smoke

•Test sequences: 256x256 pixels in width and height; 250 frames; 25 or 60 fps

•Encoded with HEVC Test Model (HM-16.6) with modified RA config., QP=22

• sGOP size considered - 16 frames

• 50% of the most probable MV combinations are kept for every sGOP

•MCMs and and MV interval ranges are compressed by arithmetic coding

SequenceHEVC rate, Synth, rate, Rate

CTC RA, [kb] modif. RA, [kb] reduction, %

BricksBushes1744.5 766.1 + 5.7 −55.7

Static-Bushes1BricksBushes

1579.1 717.3 + 2.9 −55.4Static-Bushes2LampLeaves

1578.9 728.3 + 11.9 −53.1Bushes1LampLeaves

1146.4 507.4 + 54.1 −51.0Bushes2LampLeaves

1294.6 545.8 + 135.5 −47.4Bushes3LampLeaves

552.4 281.2 + 7.5 −47.8BushesBackgroundPetibatoCropped 735.1 391.5 + 153.3 −25.9TreeWills-Cropped 970.6 584.3 + 0.44 −39.8

Table 1: Rate comparison: 2nd column - HEVC CTC RA configuration; 3rd column - reference frames encoded with

modified RA configuration (1st term); parameters required for synthesis, compressed by arithmetic coding (2nd

term); 4th column - rate reduction, under the assumption of acceptable quality drop