大语言模型微调及其应用的探索 跟踪前沿的技术
torch.save({ "model_state_dict": "model.state_dict(), "optimizer_state_dict": optimizer.state_dict() }, "model_opt.pth" );