损失函数总结（八）：MultiMarginLoss、MultiLabelMarginLoss

news2025/7/4 1:41:41

损失函数总结（八）：MultiMarginLoss、MultiLabelMarginLoss

1 引言
2 损失函数
- 2.1 MultiMarginLoss
- 2.2 MultiLabelMarginLoss
3 总结

1 引言

在前面的文章中已经介绍了介绍了一系列损失函数 (L1Loss、MSELoss、BCELoss、CrossEntropyLoss、NLLLoss、CTCLoss、PoissonNLLLoss、GaussianNLLLoss、KLDivLoss、BCEWithLogitsLoss、MarginRankingLoss、HingeEmbeddingLoss)。在这篇文章中，会接着上文提到的众多损失函数继续进行介绍，给大家带来更多不常见的损失函数的介绍。这里放一张损失函数的机理图：
在这里插入图片描述

2 损失函数

2.1 MultiMarginLoss

MultiMarginLoss 是一种损失函数，通常用于多分类问题，其中每个样本只能属于一个类别。这个损失函数的主要目标是鼓励模型将正确类别的得分与错误类别的得分之间的间隔（差距）最小化。通常，这个损失函数被用于训练神经网络模型，以确保正确的类别获得高的分数，而错误的类别获得低的分数。MultiMarginLoss 的数学表达式如下：
$y)=\frac{\sum_iw[y]*max(0, margin-x[y]+x[i])^p}{x.size(0)}$

其中：

$p$ : 默认值为1，仅可选1或者2。
$ma r g in$ : 默认值为1.
$w [y]$ : 为各类别的weight。weight必须是float类型的tensor，其长度要于类别C一致，即每一个类别都要设置有weight。

代码实现（Pytorch）：

loss = nn.MultiMarginLoss()
x = torch.tensor([[0.1, 0.2, 0.4, 0.8]])
y = torch.tensor([3])
# 0.25 * ((1-(0.8-0.1)) + (1-(0.8-0.2)) + (1-(0.8-0.4)))
loss(x, y)

在siamese net或者Triplet net任务中被广泛使用。。。。

2.2 MultiLabelMarginLoss

MultiLabelMarginLoss 是一种损失函数，通常用于多标签分类问题，其中每个样本可以属于多个类别。它有助于训练模型以将样本正确分类到其相关类别，并在训练中惩罚不正确的分类。MultiLabelMarginLoss 的数学表达式如下：
$\sum_{ij}\frac{max(0,1-(x[y[j]] - x[i]))}{x.size(0)}$

其中：

$x [y [j]]$ : 表示样本x所属类的输出值。
$x [i]$ : 表示不等于该类的输出值。并且，对于所有的 $i$ 和 $j$ ， $i\neq y[j]$ 。

代码实现（Pytorch）：

loss = nn.MultiLabelMarginLoss()
x = torch.FloatTensor([[0.1, 0.2, 0.4, 0.8]])
# for target y, only consider labels 3 and 0, not after label -1
y = torch.LongTensor([[3, 0, -1, 1]])
# 0.25 * ((1-(0.1-0.2)) + (1-(0.1-0.4)) + (1-(0.8-0.2)) + (1-(0.8-0.4)))
loss(x, y)