常见任务与损失函数对照表

| 任务类型 | 典型场景 | 推荐损失函数 | 数据格式 |
|---------|---------|------------|---------|
| **语义检索** | 问答系统、文档检索 | `MultipleNegativesRankingLoss` | `(query, passage)` 对 |
| **语义相似度** | STS、相关性评分 | `CoSENTLoss` | `(sent1, sent2)` + `score` |
| **文本分类** | NLI、关系分类 | `SoftmaxLoss` | `(sent1, sent2)` + `class` |
| **重复检测** | 去重、相似度二分类 | `ContrastiveLoss` | `(sent1, sent2)` + `0/1` |
| **三元组学习** | 有明确负例的检索 | `TripletLoss` | `(anchor, pos, neg)` |
| **无监督学习** | 领域适应、预训练 | `DenoisingAutoEncoderLoss` | `(damaged, original)` |
| **知识蒸馏** | 模型压缩 | `MSELoss` / `MarginMSELoss` | 教师模型嵌入 |


### 1. 根据数据格式选择
```python
# 检查你的数据格式
print(dataset.column_names)
# 如果只有两列文本，无标签 → MultipleNegativesRankingLoss
# 如果有两列文本 + score列 → CoSENTLoss
# 如果有两列文本 + label列（类别）→ SoftmaxLoss
```

### 2. 根据任务目标选择
- 目标是检索：优先 `MultipleNegativesRankingLoss`
- 目标是相似度评分：优先 `CoSENTLoss`
- 目标是分类：使用 `SoftmaxLoss`

### 3. 性能优化建议
- 检索任务：使用 `CachedMultipleNegativesRankingLoss` 增大批次
- 相似度任务：优先 `CoSENTLoss` 而非 `CosineSimilarityLoss`
- 有硬负例：使用 `OnlineContrastiveLoss` 而非 `ContrastiveLoss`

sentence-transformers Embedding Models 损失函数

首页

分类

时间线

友链

动态

工具

联系我

Qwen/Qwen3-235B-A22B-Instruct-2507-FP8 部署

微调owl32b，让mobile agent v3更聪明

任务类型	典型场景	推荐损失函数	数据格式
语义检索	问答系统、文档检索	`MultipleNegativesRankingLoss`	`(query, passage)` 对
语义相似度	STS、相关性评分	`CoSENTLoss`	`(sent1, sent2)` + `score`
文本分类	NLI、关系分类	`SoftmaxLoss`	`(sent1, sent2)` + `class`
重复检测	去重、相似度二分类	`ContrastiveLoss`	`(sent1, sent2)` + `0/1`
三元组学习	有明确负例的检索	`TripletLoss`	`(anchor, pos, neg)`
无监督学习	领域适应、预训练	`DenoisingAutoEncoderLoss`	`(damaged, original)`
知识蒸馏	模型压缩	`MSELoss` / `MarginMSELoss`	教师模型嵌入

目录

1. 根据数据格式选择

2. 根据任务目标选择

3. 性能优化建议