k8s pod使用gpu前提
- k8s节点有gpu显卡
- k8s节点安装gpu显卡驱动
- k8s节点docker或containerd运行时使用nvidia-container-runtime
- k8s部署gpu device plugin daemonset
1.安装gpu显卡驱动
查看节点显卡类型
nvidia-smi -L
GPU 0: Tesla V100-SXM2-32GB (UUID: GPU-f2b15a66-0630-5f77-1f17-28abb3854f1c)
# 忘记没安装驱动,用不了上面命令,使用
lspci | grep -i nvidia
00:03.0 3D controller: NVIDIA Corporation Device 1eb8 (rev a1)
00:04.0 3D controller: NVIDIA Corporation Device 1eb8 (rev a1)
根据 1eb8到这个网站查
http://pci-ids.ucw.cz/mods/PC/10de?action=help?help=pci


根据型号到这个网站查找驱动安装程序
https://www.nvidia.com/Download/Find.aspx#
# 下载
wget https://us.download.nvidia.com/tesla/515.65.01/NVIDIA-Linux-x86_64-515.65.01.run
chmod +x NVIDIA-Linux-x86_64-515.65.01.run
# 上述安装程序依赖这些包,安装
apt install gcc linux-kernel-headers dkms
sh NVIDIA-Linux-x86_64-515.65.01.run --ui=none --disable-nouveau --no-install-libglvnd --dkms -s
# 使用下面命令验证是否安装成功。
nvidia-smi
Thu Nov 3 19:17:50 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
|


1万+

被折叠的 条评论
为什么被折叠?



