문제 해설

DropPath (Stochastic Depth) [medium]

정규화 · medium

preview

DropPath [medium]

v1 Column dropout 은 feature 단위 drop. DropPath (Stochastic Depth, Huang et al. 2016) 은 ResNet 의 전체 residual 블록 을 샘플별로 skip:

$y^{(n)} = x^{(n)} + \text{mask}^{(n)} \cdot \text{residual}(x^{(n)}) / (1-p)$

$\text{mask}^{(n)} \in \{0, 1\}$ , Bernoulli(1-p) — 배치 샘플마다 독립.

ResNet 의 한 블록 $y = x + f(x)$ 에서:

즉 residual 부분만 on/off.

이 문제는 residual 텐서 $x$ (이미 계산된 결과) 를 입력받아 drop_path 만 적용:

if not training or p == 0:
    return x
mask (N, 1, 1, ...) shape 에 broadcast ← 샘플별 독립
return x * mask / (1 - p)

(NN 블록 전체가 아니라 residual 출력에 적용한다고 가정 — PyTorch timm.models.layers.DropPath 와 동일.)

함수 drop_path(x, p, seed, training) 를 완성하세요.

코드 작성

Loading...

실행 결과

코드를 작성하고 Run 을 눌러보세요.