环境准备¶
Basic Commands¶
$ lspci | grep VGA # identify and list the GPUs in your system
nvidia-smi # NVIDIA System Management Interface
sudo lshw -C display # provide detailed information about the display controllers including graphics cards.
inxi -G # information about the graphics subsystem, including details about the GPU and the display.
sudo hwinfo --gfxcard # obtain detailed information about the graphics cards
示例1¶
列出所有连接到系统上的PCI设备,并通过grep过滤出那些描述为与VGA(Video Graphics Array)兼容的控制器:
$ lspci | grep 3D
23:00.0 3D controller: NVIDIA Corporation Device 26ba (rev a1)
# 说明
# 注意:这个才是 Nvidia 的GPU芯片
$ lspci | grep VGA
00:02.0 VGA compatible controller: Cirrus Logic GD 5446
# 说明:
# 注意:这个不是nvidia的GPU
# 02:01.0:
# 这是设备的总线地址,它指出了该设备在PCI总线上的位置。
# 这里的“02”表示的是主总线编号,
# “01”是子总线上的设备编号,
# “0”是指设备上的功能号(多数情况下为0,除非设备支持多个功能)。
# VGA compatible controller:
# 这表明这是一个与VGA标准兼容的控制器,即它是一个能够处理图形输出的设备。
# Cirrus Logic GD 5446:
# 这是具体的设备型号。
# Cirrus Logic是一家知名的半导体公司,这里提到的GD 5446是该公司生产的一个老式图形芯片。
# 这款芯片主要应用于较旧或特定需求的系统中,比如一些虚拟机环境或者是嵌入式系统里,因为它可以提供基本的2D图形加速功能。
显示系统中与显示相关的硬件设备的详细信息:
$ sudo lshw -C display
*-display:0
description: VGA compatible controller
product: GD 5446
vendor: Cirrus Logic
physical id: 2
bus info: pci@0000:00:02.0
logical name: /dev/fb0
version: 00
width: 32 bits
clock: 33MHz
capabilities: vga_controller rom fb
configuration: depth=16 driver=cirrus latency=0 resolution=1024,768
resources: irq:0 memory:fa000000-fbffffff memory:fe424000-fe424fff memory:c0000-dffff
*-display:1
description: 3D controller
product: GV100GL [Tesla V100 SXM2 32GB]
vendor: NVIDIA Corporation
physical id: 9
bus info: pci@0000:00:09.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress bus_master cap_list
configuration: driver=nvidia latency=0
resources: iomemory:100-ff iomemory:180-17f irq:34 memory:fd000000-fdffffff memory:1000000000-17ffffffff memory:1800000000-1801ffffff
说明:
第一个:
# 基于Cirrus Logic GD 5446芯片的显卡。
# 这种类型的显卡通常出现在虚拟机或者一些旧的系统环境中,提供基本的图形处理能力
第二个:
# NVIDIA公司的显卡,它提供了更强大的图形和3D渲染能力。
# 这个显卡使用了NVIDIA的驱动程序,并且支持更高的分辨率和颜色深度
nvidia-smi:
Thu Feb 29 19:54:59 2024
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 520.61.05 Driver Version: 520.61.05 CUDA Version: 11.8 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Tesla V100-SXM2... On | 00000000:00:09.0 Off | 0 |
| N/A 41C P0 37W / 300W | 4792MiB / 32768MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 7562 C python 884MiB |
| 0 N/A N/A 1425768 C ./main 836MiB |
| 0 N/A N/A 2854007 C python 1092MiB |
| 0 N/A N/A 3937597 C ...3/envs/btc_env/bin/python 1164MiB |
| 0 N/A N/A 4027652 C ...3/envs/btc_env/bin/python 808MiB |
+-----------------------------------------------------------------------------+
说明:
名称: NVIDIA V100
inxi -G:
Graphics:
Device-1: Cirrus Logic GD 5446 driver: cirrus v: kernel
Device-2: NVIDIA GV100GL [Tesla V100 SXM2 32GB] driver: nvidia v: 520.61.05
Display: server: X.org v: 1.21.1.4 driver: gpu: cirrus note: X driver n/a tty: 231x64
Message: GL data unavailable in console. Try -G --display
注:
需要安装:
sudo apt install inxi
sudo hwinfo –gfxcard:
18: PCI 09.0: 0302 3D controller
[Created at pci.386]
Unique ID: WL76.hcMmklvUUx9
SysFS ID: /devices/pci0000:00/0000:00:09.0
SysFS BusID: 0000:00:09.0
Hardware Class: graphics card
Model: "nVidia GV100GL [Tesla V100 SXM2 32GB]"
Vendor: pci 0x10de "nVidia Corporation"
Device: pci 0x1db5 "GV100GL [Tesla V100 SXM2 32GB]"
SubVendor: pci 0x10de "nVidia Corporation"
SubDevice: pci 0x1249
Revision: 0xa1
Driver: "nvidia"
Driver Modules: "nvidia"
Memory Range: 0xfd000000-0xfdffffff (rw,non-prefetchable)
Memory Range: 0x1000000000-0x17ffffffff (ro,non-prefetchable)
Memory Range: 0x1800000000-0x1801ffffff (ro,non-prefetchable)
IRQ: 34 (2828758 events)
Module Alias: "pci:v000010DEd00001DB5sv000010DEsd00001249bc03sc02i00"
Driver Info #0:
Driver Status: nvidiafb is not active
Driver Activation Cmd: "modprobe nvidiafb"
Driver Info #1:
Driver Status: nouveau is not active
Driver Activation Cmd: "modprobe nouveau"
Driver Info #2:
Driver Status: nvidia_drm is active
Driver Activation Cmd: "modprobe nvidia_drm"
Driver Info #3:
Driver Status: nvidia is active
Driver Activation Cmd: "modprobe nvidia"
Config Status: cfg=new, avail=yes, need=no, active=unknown
19: PCI 02.0: 0300 VGA compatible controller (VGA)
[Created at pci.386]
Unique ID: _Znp.v67VofT6db4
SysFS ID: /devices/pci0000:00/0000:00:02.0
SysFS BusID: 0000:00:02.0
Hardware Class: graphics card
Model: "Red Hat QEMU Virtual Machine"
Vendor: pci 0x1013 "Cirrus Logic"
Device: pci 0x00b8 "GD 5446"
SubVendor: pci 0x1af4 "Red Hat, Inc."
SubDevice: pci 0x1100 "QEMU Virtual Machine"
Driver: "cirrus"
Driver Modules: "cirrus"
Memory Range: 0xfa000000-0xfbffffff (ro,non-prefetchable)
Memory Range: 0xfe424000-0xfe424fff (rw,non-prefetchable)
Memory Range: 0x000c0000-0x000dffff (rw,non-prefetchable,disabled)
Module Alias: "pci:v00001013d000000B8sv00001AF4sd00001100bc03sc00i00"
Driver Info #0:
XFree86 v4 Server Module: cirrus
Config Status: cfg=new, avail=yes, need=no, active=unknown
Primary display adapter: #19
注:需要安装
sudo apt install hwinfo
禁用Nouveau的驱动¶
在/etc/modprobe.d/blacklist.conf最后添加如下代码:(用来禁用nouveau第三方驱动):
blacklist nouveau
options nouveau modeset=0
执行:
$ update-initramfs -u
重启后,执行以下代码,若没有显示则禁用成功:
$ lsmod | grep nouveau
检测NVIDIA显卡型号¶
方法一:使用ubuntu-drivers-common这个软件`:
$ apt install ubuntu-drivers-common
$ ubuntu-drivers devices
== /sys/devices/pci0000:00/0000:00:01.1/0000:01:00.0 ==
modalias : pci:v000010DEd00002560sv000017AAsd00003A80bc03sc00i00
vendor : NVIDIA Corporation
model : GA106M [GeForce RTX 3060 Mobile / Max-Q]
driver : nvidia-driver-530 - distro non-free
driver : nvidia-driver-525-server - distro non-free
driver : nvidia-driver-515-server - distro non-free
driver : nvidia-driver-525-open - distro non-free
driver : nvidia-driver-530-open - distro non-free recommended
driver : nvidia-driver-515 - distro non-free
driver : nvidia-driver-515-open - distro non-free
driver : nvidia-driver-525 - distro non-free
driver : nvidia-driver-510 - distro non-free
driver : nvidia-driver-470-server - distro non-free
driver : nvidia-driver-470 - distro non-free
driver : xserver-xorg-video-nouveau - distro free builtin
方法二:
$ sudo lshw -numeric -C display
*-display UNCLAIMED
description: VGA compatible controller
product: GA106M [GeForce RTX 3060 Mobile / Max-Q] [10DE:2560]
vendor: NVIDIA Corporation [10DE]
physical id: 0
bus info: pci@0000:01:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller cap_list
configuration: latency=0
resources: iomemory:fa0-f9f iomemory:fc0-fbf memory:d0000000-d0ffffff memory:fa00000000-fbffffffff memory:fc00000000-fc01ffffff ioport:2000(size=128) memory:d1080000-d10fffff
*-graphics
product: EFI VGA
physical id: 2
logical name: /dev/fb0
capabilities: fb
configuration: depth=32 resolution=2560,1600
方法三:
$ lspci -vnn | grep VGA
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA106M [GeForce RTX 3060 Mobile / Max-Q] [10de:2560] (rev a1) (prog-if 00 [VGA controller])
驱动安装方法¶
卸载驱动:
$ sudo apt-get remove --purge nvidia*
更新:
$ sudo apt update
查看并安全驱动:
$ sudo ubuntu-drivers devices
== /sys/devices/pci0000:00/0000:00:01.1/0000:01:00.0 ==
modalias : pci:v000010DEd00002560sv000017AAsd00003A80bc03sc00i00
vendor : NVIDIA Corporation
model : GA106M [GeForce RTX 3060 Mobile / Max-Q]
driver : nvidia-driver-525 - distro non-free
driver : nvidia-driver-470-server - distro non-free
driver : nvidia-driver-530-open - distro non-free recommended
driver : nvidia-driver-525-open - distro non-free
driver : nvidia-driver-510 - distro non-free
driver : nvidia-driver-525-server - distro non-free
driver : nvidia-driver-515 - distro non-free
driver : nvidia-driver-470 - distro non-free
driver : nvidia-driver-530 - distro non-free
driver : nvidia-driver-515-open - distro non-free
driver : nvidia-driver-515-server - distro non-free
driver : xserver-xorg-video-nouveau - distro free builtin
安装某个驱动:
$ sudo apt install nvidia-driver-530
备注
要记得重启
重启后看看你们的驱动是否装好:
$> nvidia-smi
Mon Apr 24 23:50:19 2023
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 530.41.03 Driver Version: 530.41.03 CUDA Version: 12.1 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA GeForce RTX 3060 L... Off| 00000000:01:00.0 On | N/A |
| N/A 43C P5 12W / N/A| 264MiB / 6144MiB | 2% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| 0 N/A N/A 1887 G /usr/lib/xorg/Xorg 136MiB |
| 0 N/A N/A 2121 G /usr/bin/gnome-shell 125MiB |
+---------------------------------------------------------------------------------------+