# 实践:快速搭建一个可运行的docker gpu环境 ## 安装Docker和NVIDIA Docker 安装 ``` sudo apt-get update sudo apt-get install docker.io ``` 验证 ``` - docker info ``` ## 检查GPU驱动程序 nvidia-smi ``` 2.1. Verify You Have a CUDA-Capable GPU lspci | grep -i nvidia 00:09.0 3D controller: NVIDIA Corporation GP104GL [Tesla P4] (rev a1) 2.2. Verify You Have a Supported Version of Linux ``` uname -m && cat /etc/*release ``` 2.3. Verify the System Has gcc Installed gcc --version ``` ## 安装NVIDIA Container Toolkit 检查是否安装 nvidia-container-toolkit 软件包: ``` $ dpkg -l | grep nvidia-container-toolkit ii nvidia-container-toolkit 1.13.5-1 amd64 NVIDIA Container toolkit ii nvidia-container-toolkit-base 1.13.5-1 amd64 NVIDIA Container Toolkit Base ``` 添加 NVIDIA Container Toolkit APT 仓库 ``` distribution=$(. /etc/os-release;echo $ID$VERSION_ID) curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker-$distribution.list curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \ && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \ sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \ sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list ``` ### 备选安装 Configure the production repository: ``` curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \ && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \ sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \ sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list Optionally, configure the repository to use experimental packages: sed -i -e '/experimental/ s/^#//g' /etc/apt/sources.list.d/nvidia-container-toolkit.list ``` ### 安装 更新 APT 软件包索引: ``` sudo apt-get update ``` 安装 NVIDIA Container Toolkit 软件包 ``` sudo apt-get install -y nvidia-container-toolkit ``` 重新启动 Docker 服务: ``` sudo systemctl restart docker ``` 验证安装: ``` sudo docker run --rm --gpus all nvidia/cuda:11.4-base nvidia-smi ```