CogVideoX-Fun-V1.1-Reward-LoRAs:通过奖励反向传播技术训练Lora,以优化CogVideoX-Fun-V1.1生成的视频

视频模型2个月前更新 小马良
190 0

CogVideoX-Fun-V1.1-Reward-LoRAs是通过奖励反向传播技术训练Lora,以优化CogVideoX-Fun-V1.1生成的视频,使其更好地与人类偏好保持一致。

CogVideoX-Fun-V1.1-Reward-LoRAs:通过奖励反向传播技术训练Lora,以优化CogVideoX-Fun-V1.1生成的视频
模型名基础模型奖励模式下载地址描述
CogVideoX-Fun-V1.1-5b-InP-HPS2.1.safetensorsCogVideoX-Fun-V1.1-5bHPS v2.1LinkOfficial HPS v2.1 reward LoRA (rank=128 and network_alpha=64) for CogVideoX-Fun-V1.1-5b-InP. It is trained with a batch size of 8 for 1,500 steps.
CogVideoX-Fun-V1.1-2b-InP-HPS2.1.safetensorsCogVideoX-Fun-V1.1-2bHPS v2.1LinkOfficial HPS v2.1 reward LoRA (rank=128 and network_alpha=64) for CogVideoX-Fun-V1.1-2b-InP. It is trained with a batch size of 8 for 3,000 steps.
CogVideoX-Fun-V1.1-5b-InP-MPS.safetensorsCogVideoX-Fun-V1.1-5bMPSLinkOfficial MPS reward LoRA (rank=128 and network_alpha=64) for CogVideoX-Fun-V1.1-5b-InP. It is trained with a batch size of 8 for 5,500 steps.
CogVideoX-Fun-V1.1-2b-InP-MPS.safetensorsCogVideoX-Fun-V1.1-2bMPSLinkOfficial MPS reward LoRA (rank=128 and network_alpha=64) for CogVideoX-Fun-V1.1-2b-InP. It is trained with a batch size of 8 for 16,000 steps.
CogVideoX-Fun-V1.1-Reward-LoRAs:通过奖励反向传播技术训练Lora,以优化CogVideoX-Fun-V1.1生成的视频
© 版权声明

相关文章

暂无评论

none
暂无评论...