I'm here to tell about an error I got while trying to run the TempoGAN on a brand new server. The error happens when "session.run" is called, and it gives:
Internal: Blas SGEMM launch failed : m=216000, n=32, k=4
If you search on the internet, everyone will tell this is due to lack of memory in the GPU. Ok, this net is quite large but... I have 48GB available to me! This net can't be that greedy.
So, what did I found? Yes, it needs more memory in the graphics card... BECAUSE IT IS NOT ALLOCATING ANY! The code is from 2018, the GPU architecture is modern, and cuBLAS from CUDA 10 does not know how to do its job.
I'm sorry, but you and I will have to upgrade the code to Tensorflow 2.X.
Good luck for us.
No comments:
Post a Comment