浅谈LabelSmooth两种实现及推导开发者社区

link之家

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

浅谈LabelSmooth两种实现及推导

# labelsmooth 
import torch 
import torch.nn as nn 
import torch.nn.functional as F 
class LabelSmoothingCrossEntropy(nn.Module):
    NLL loss with label smoothing.
    def __init__(self, smoothing=0.1):
        Constructor for the LabelSmoothing module.
        :param smoothing: label smoothing factor
        super(LabelSmoothingCrossEntropy, self).__init__()
        assert smoothing < 1.0
        self.smoothing = smoothing
        self.confidence = 1. - smoothing
    def forward(self, x, target):
        logprobs = F.log_softmax(x, dim=-1)
        nll_loss = -logprobs.gather(dim=-1, index=target.unsqueeze(1))
        nll_loss = nll_loss.squeeze(1)
        smooth_loss = -logprobs.mean(dim=-1)
        loss = self.confidence * nll_loss + self.smoothing * smooth_loss
        return loss.mean()

def one_hot(x, num_classes, on_value=1., off_value=0., device='cuda'):
    x = x.long().view(-1, 1)
    return torch.full((x.size()[0], num_classes), off_value, device=device).scatter_(1, x, on_value)
def mixup_target(target, num_classes, lam=1., smoothing=0.0, device='cuda'):
    off_value = smoothing / num_classes

浅谈LabelSmooth两种实现及推导

浅谈LabelSmooth两种实现及推导

一、交叉熵损失(CrossEntropyLoss)

二、LabelSmooth

三、公式推导