tf预训练模型转换为torch预训练模型

程序员文章站 2022-06-13 16:03:22

...

在将albert的tensorflow预训练模型转换为 torch类型预训练模型，踩了很多坑。终于解决，希望对大家有用

前期准备
创建一个环境带有torch和tf的环境，步骤如下：
首先创建环境
python conda create -n torchtf_env python=3.7
然后，安装torch（根据自己电脑的cuda安装）
python conda install pytorch torchvision torchaudio cudatoolkit=11.1 -c pytorch -c conda-forge
之后，继续安装tensorflow-gpu版本
python conda install tensorflow-gpu==1.15
最后安装transformers
pip install transformers

2 .从github上下载tensorflow预训练的albert版本

#! usr/bin/env python3
# -*- coding:utf-8 -*-
"""
Created on 19/03/2021 20:22 
@Author: lixj
"""

from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

import argparse
import torch
from transformers import AlbertConfig, AlbertForPreTraining, load_tf_weights_in_albert
import logging
logging.basicConfig(level=logging.INFO)

def convert_tf_checkpoint_to_pytorch(tf_checkpoint_path, bert_config_file, pytorch_dump_path):
    # Initialise PyTorch model
    config = AlbertConfig.from_pretrained(bert_config_file)
    # print("Building PyTorch model from configuration: {}".format(str(config)))
    model = AlbertForPreTraining(config)
    # Load weights from tf checkpoint
    load_tf_weights_in_albert(model, config, tf_checkpoint_path)

    # Save pytorch-model
    print("Save PyTorch model to {}".format(pytorch_dump_path))
    torch.save(model.state_dict(), pytorch_dump_path)



if __name__ == "__main__":
    parser = argparse.ArgumentParser()
    # Required parameters
    parser.add_argument(
        "--tf_checkpoint_path", default='albert_base_en/model.ckpt-best', type=str,  help="Path to the TensorFlow checkpoint path."
    )
    parser.add_argument(
        "--bert_config_file",
        default='albert_base_en/albert_config.json',
        type=str,
        help="The config json file corresponding to the pre-trained BERT model. \n"
        "This specifies the model architecture.",
    )
    parser.add_argument(
        "--pytorch_dump_path", default='albert_base_en/pytorch_model.bin', type=str,help="Path to the output PyTorch model."
    )
    args = parser.parse_args()
    convert_tf_checkpoint_to_pytorch(args.tf_checkpoint_path, args.bert_config_file, args.pytorch_dump_path)

上一篇：健康的保护神——漫淡视保屏

下一篇：映众GTX 1080Ti冰龙超级版全面图解评测和显卡天梯图

tf预训练模型转换为torch预训练模型

PyTorch加载预训练模型实例(pretrained)

Tensorflow加载预训练模型和保存模型的实例

解决Pytorch修改预训练模型时遇到key不匹配的情况

使用pytorch搭建AlexNet操作(微调预训练模型及手动搭建)

Pytorch 预训练模型下载和加载

pytorch加载预训练模型与自己模型不匹配的解决方案

pytorch 预训练模型读取修改相关参数的填坑问题

Pytorch中使用Bert预训练模型，并给定句子得到对应的向量

pytorch学习之加载预训练模型

PyTorch预训练Bert模型的示例