22 3. CHANNEL CODING OF FEATURES
where the offset has been computed according to (3.6) and w
.i;j /
denotes the spatial weights, a
2D Gaussian kernel h
.i; j / with standard deviation .
e original DFT approach is a multi-scale algorithm typically using three different stan-
dard deviations, but for simplifying the subsequent arguments, we ignore this detail and stick to
the single-scale case.
e next step of DFT after accumulating histogram values locally is to smooth the coeffi-
cients c
n
along the channel index n using a 1D Gaussian kernel and results in the DF d.i; j; n/.
is is the post-smoothing step previously mentioned.
During the tracking, the DF of the internal model, d
model
, is compared to the DF of a
bounding-box in the current frame, d
f
, within a local search window. e distance measure used
is the sum of absolute differences, i.e.,
L
1
.d
model
; d
f
/ D
X
i;j;n
jd
model
.i; j; n/ d
f
.i; j; n/j : (3.19)
e displacement is estimated by local search of the minimum L
1
error within a window
of maximum displacement, a further parameter of the method chosen as 30 pixels according
to Sevilla-Lara and Learned-Miller [2012].
When the best-fitting position has been found, the current template d
model;t
is updated
with the current DF d
f
using linear weights D 0:95 for the previous template and .1 / D
0:05 for the novel patch
d
model;tC1
.i; j; n/ D d
model;t
.i; j; n/ C .1 /d
f
.i; j; ci/: (3.20)
Due to the density-based comparison, the method is robust against outliers, and due to the
template-update, the method can also deal with continuous changes of object aspects and the
lighting [Sevilla-Lara and Learned-Miller, 2012].
EDFT enhances DFT in various ways [Felsberg, 2013], but most importantly, the post-
smoothing of histograms (rectangular kernels) is replaced with a quadratic B-spline channel rep-
resentation, i.e., pre-smoothing before binning, and results in significant improvements. In later
versions of EDFT, cos
2
-kernels have been used instead, giving further improvements [Öäll and
Felsberg, 2014b], also based on a modified model update (3.20) and distance measure (3.19).
e model update in (3.20) has been replaced with a power-update rule
c
.i;j /
model;tC1;n
D
c
.i;j /
model;t;n
q
C .1 /
c
.i;j /
f;n
q
1=q
(3.21)
and a coherence weight has been introduced to the distance measure (3.19)
L
w
1
.c
model
; c
f
/ D
X
i;j;n
coh.c
i;j
/
ˇ
ˇ
ˇ
c
.i;j /
model;n
c
.i;j /
f;n
ˇ
ˇ
ˇ
: (3.22)
e coherence measure will be formally introduced in Chapter 5, but it can easily be explained
using Figure 3.3. Coherent regions imply that the information in the distribution is highly dis-
criminative. Incoherent regions imply that the distribution is uninformative. e discriminative