模型能否通过梯度黑客手段规避 SFT 能力诱导？ — LessWrong | BestBlogs.dev

F

加载中...

模型能否通过梯度黑客手段规避 SFT 能力诱导？ — LessWrong | BestBlogs.dev