AttributeError: 'ORTTrainingArguments' object has no attribute 'deepspeed_plugin'

Yanis · August 1, 2023, 12:49pm

I’m getting the following error when switching my trainer to the optimum[onnxruntime] one in my TokenClassification app:

AttributeError                            Traceback (most recent call last)
<ipython-input-4-3f32f8ae7125> in <cell line: 184>()
    182 compute_metrics = compute_metrics_wrapper(id2label)
    183 
--> 184 trainer = ORTTrainer(
    185         model=model,
    186         args=training_args,

2 frames
/usr/local/lib/python3.10/dist-packages/optimum/onnxruntime/trainer.py in __init__(self, model, tokenizer, feature, args, data_collator, train_dataset, eval_dataset, model_init, compute_metrics, callbacks, optimizers, preprocess_logits_for_metrics, onnx_model_path)
    302         onnx_model_path: Union[str, os.PathLike] = None,
    303     ):
--> 304         super().__init__(
    305             model=model,
    306             args=args,

/usr/local/lib/python3.10/dist-packages/transformers/trainer.py in __init__(self, model, args, data_collator, train_dataset, eval_dataset, tokenizer, model_init, compute_metrics, callbacks, optimizers, preprocess_logits_for_metrics)
    334         self.is_in_train = False
    335 
--> 336         self.create_accelerator_and_postprocess()
    337 
    338         # memory metrics - must set up as early as possible

/usr/local/lib/python3.10/dist-packages/transformers/trainer.py in create_accelerator_and_postprocess(self)
   3805         # create accelerator object
   3806         self.accelerator = Accelerator(
-> 3807             deepspeed_plugin=self.args.deepspeed_plugin, gradient_accumulation_plugin=gradient_accumulation_plugin
   3808         )
   3809 

AttributeError: 'ORTTrainingArguments' object has no attribute 'deepspeed_plugin'

Below you can see the code for the part I changed to integrate the ONXX functionalities:

!pip install optimum[onnxruntime]

from optimum.onnxruntime import ORTTrainer, ORTTrainingArguments

model = AutoModelForTokenClassification.from_pretrained(
    checkpoint,
    id2label=id2label,
    label2id=label2id,
    ignore_mismatched_sizes=True
)

training_args = ORTTrainingArguments(
        output_dir=output_path,
        learning_rate=lr,
        per_device_train_batch_size=batch_size,
        per_device_eval_batch_size=batch_size,
        num_train_epochs=num_epochs,
        weight_decay=weight_decay,
        logging_steps=logging_steps,
        report_to="wandb",
        run_name = "ml-training-v1",
        evaluation_strategy="epoch",
        save_strategy=save_strategy,
        save_total_limit=1,
        load_best_model_at_end=load_best_model_at_end,
        metric_for_best_model="overall_f1",
        optim = "adamw_ort_fused"
    )

device = torch.device("cuda") if torch.cuda.is_available() else torch.device("cpu")
model.to(device)

compute_metrics = compute_metrics_wrapper(id2label)

trainer = ORTTrainer(
        model=model,
        args=training_args,
        train_dataset=tokenized_datasets["train"],
        eval_dataset=tokenized_datasets["test"],
        data_collator=data_collator,
        tokenizer=tokenizer,
        compute_metrics=compute_metrics,
        callbacks=callbacks,
        feature="token-classification"
    )

trainer.train()

metrics = trainer.evaluate(inference_with_ort=True)

I haven’t found this error anywhere… Any help is appreciated

crumbz · August 2, 2025, 8:40am

Encountering the same error as well. Are there any fixes for this?

John6666 · August 2, 2025, 11:31am

According to GitHub issues, this error often occurs due to library version incompatibility.

!pip install -U optimum[onnxruntime] accelerate transformers

github.com/huggingface/transformers

transformers Trainer has no attribute 'deepspeed_plugin'

opened 02:07PM - 23 Oct 23 UTC

closed 09:15AM - 25 Oct 23 UTC

shibing624

### System Info ``` Traceback (most recent call last): File "/apdcephfs_teg…_2/share_1367250/flemingxu/MedicalGPT/supervised_finetuning.py", line 1307, in <module> main() File "/apdcephfs_teg_2/share_1367250/flemingxu/MedicalGPT/supervised_finetuning.py", line 1248, in main trainer = SavePeftModelTrainer( File "/apdcephfs_teg_2/share_1367250/flemingxu/miniconda3/envs/py3.10/lib/python3.10/site-packages/transformers/trainer.py", line 335, in __init__ self.create_accelerator_and_postprocess() File "/apdcephfs_teg_2/share_1367250/flemingxu/miniconda3/envs/py3.10/lib/python3.10/site-packages/transformers/trainer.py", line 3853, in create_accelerator_and_postprocess deepspeed_plugin=self.args.deepspeed_plugin, AttributeError: 'PeftArguments' object has no attribute 'deepspeed_plugin' (py3.10) ``` ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [ ] My own task or dataset (give details below) ### Reproduction transformers==4.35.0.dev0 , run https://github.com/huggingface/transformers/blob/main/examples/pytorch/language-modeling/run_clm.py with llama2 model ### Expected behavior success

github.com/huggingface/optimum

Running optimum/examples/onnxruntime/training/language-modeling fails

opened 02:51PM - 12 Jul 23 UTC

closed 04:48PM - 24 Aug 23 UTC

pks

bug

### System Info ```shell > pip freeze | grep -P "(optimum|transformers|onnxr…untime|deepspeed)" deepspeed==0.9.5 onnxruntime==1.15.1 onnxruntime-gpu==1.15.1 onnxruntime-training==1.15.1 optimum==1.9.1 transformers==4.30.2 > python --version Python 3.9.16 ``` I also tried the latest version from github for `transformers` and `optimum`, resulting in the exact same error. ### Who can help? @JingyaHuang @echarlaix ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [ ] My own task or dataset (give details below) ### Reproduction Steps to reproduce the behavior: 1. Install optimum and other dependencies (I also noted that `pytest` is missing from the `requirements.txt` file. 2. Run: ``` torchrun --nproc_per_node=4 run_clm.py \ --model_name_or_path gpt2 \ --dataset_name wikitext \ --dataset_config_name wikitext-2-raw-v1 \ --do_train \ --output_dir test-clm \ --fp16 ``` 3. Observe stacktrace: ``` Traceback (most recent call last): File "[redacted]/optimum/examples/onnxruntime/training/language-modeling/run_clm.py", line 593, in <module> main() File "[redacted]/optimum/examples/onnxruntime/training/language-modeling/run_clm.py", line 519, in main trainer = ORTTrainer( File "[redacted]/venv/lib/python3.9/site-packages/optimum/onnxruntime/trainer.py", line 304, in __init__ super().__init__( File "[redacted]/venv/lib/python3.9/site-packages/transformers/trainer.py", line 336, in __init__ self.create_accelerator_and_postprocess() File "[redacted]/venv/lib/python3.9/site-packages/transformers/trainer.py", line 3842, in create_accelerator_and_postprocess deepspeed_plugin=self.args.deepspeed_plugin, gradient_accumulation_plugin=gradient_accumulation_plugin AttributeError: 'ORTTrainingArguments' object has no attribute 'deepspeed_plugin' ``` ### Expected behavior I'd expect to train a language model from scratch using `onnxruntime-training`.

Topic		Replies	Views
`run_translation.py` example is erroring out with the recommended settings DeepSpeed	1	6407	April 4, 2022
No module named 'deepspeed.checkpoint.utils' DeepSpeed	6	2239	June 28, 2023
AttributeError: 'NoneType' object has no attribute 'pad_token' 🤗Optimum	1	3516	May 3, 2023
Optimizer got an empty parameter list when using deepspeed Beginners	0	918	October 29, 2021
DeBERTa V2 ONNX with pipeline does not work Beginners	5	830	February 27, 2024

AttributeError: 'ORTTrainingArguments' object has no attribute 'deepspeed_plugin'

Related topics