Model is not properly moved to GPU memory with torch.no_grad()

ekazakos · August 24, 2022, 9:04am

Hi everyone,

I’m using OWLViTForObjectDetection model and I want to perform inference on the GPU. So what I’m doing is something like:

model = model.to(device='cuda')
with torch.no_grad():
    model.eval()
    data = data.to(device='cuda')
    # inference code

It seems that the inclusion of torch.no_grad() is probably causing some of the model’s parameters to not be copied in the GPU memory because I’m getting an error that all tensors should be on the same device but at least two different devices were found (cuda and cpu). If I remove torch.no_grad() the error does not happen but then I get an out of memory error because all the model’s activation are kept in GPU memory for gradient calculation.

This has not happened to me ever in the past with various models that I’ve been using, so I’m wondering whether it is particularly related to HuggingFace models. Have this occurred to anyone else? Are there any known workarounds for this?

Thank you!

adirik · August 24, 2022, 9:55am

Hi @ekazakos,

Your are probably getting a GPU error unrelated to torch.no_grad() if you installed the PyPI release of transformers with pip install transformers. Sorry about that! This issue was fixed a few weeks ago and you should be able to run the model without any problems if you install the development branch instead:
pip install -q git+https://github.com/huggingface/transformers.git

In general, there is no need to call the eval() method within torch.no_grad(). If your issue persists, could you copy paste the minimal code to reproduce the error and the full error log?

model = model.to(device='cuda')
model.eval()
with torch.no_grad():
    data = data.to(device='cuda')

Hope this helps!

ekazakos · August 24, 2022, 9:58am

Thank you @adirik !! Will shortly try and let you know!

nielsr · August 24, 2022, 11:00am

Note that PyTorch moves a model in-place, so it’s sufficient to do:

model.to(device:"cuda")

ekazakos · August 24, 2022, 11:04am

Thanks! I’m using PyTorch a few years now and I didn’t know about this

ekazakos · August 24, 2022, 2:26pm

Hi @adirik,

It works! Thank you!

Topic		Replies	Views
Inference without gradient computation? 🤗Transformers	2	7548	December 26, 2024
CUDA OOM on model(inputs) but not on model.generate(inputs), but doesn't generate use model(inputs)? Intermediate	4	322	May 4, 2024
Expected all tensors to be on the same device Beginners	3	9700	April 30, 2022
Cannot pin 'torch.cuda.LongTensor' only dense CPU tensors can be pinned 🤗Transformers	1	1245	September 26, 2024
Move model with device_map="balanced" to CPU 🤗Transformers	1	6660	February 5, 2024

Model is not properly moved to GPU memory with torch.no_grad()

Related topics