The utilization rate of computing power in the asset overview has always been 100%

1.Problem description
I created a pod and set nvidia. com/gpucores=20. When I started the training task in the pod, I saw that the computing power utilization rate was 100% in the HAMI-WEB asset overview. But the correct one should be 20%。

<img width="1895" height="494" alt="Image" src="https://github.com/user-attachments/assets/5c0afc92-eabe-47b2-9551-2062bf7b9453" />

2.Environment Configuration
I configured gpuCorePolicy=force

<img width="1598" height="634" alt="Image" src="https://github.com/user-attachments/assets/ad6434df-a0ed-4e1d-b3d2-2dc554271b1f" />

3.Here is the YAML file where I created the pod
# cat test.yml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: gpu-pod
spec:
  replicas: 1
  selector:
    matchLabels:
      app: gpu-app
  template:
    metadata:
      labels:
        app: gpu-app
    spec:
      containers:
      - name: simple-container
        image: pytorch/pytorch:2.9.0-cuda13.0-cudnn9-runtime
        command: ["python", "test_gpu.py"]
        env:
          - name: LIBCUDA_LOG_LEVEL
            value: "4"
        resources:
          requests:
            cpu: "1"
            memory: "1Gi"
            nvidia.com/gpu: "1"
            nvidia.com/gpucores: 20
            #nvidia.com/gpumem: "4000"
          limits:
            cpu: "1"
            memory: "1Gi"
            nvidia.com/gpu: "1"
            nvidia.com/gpucores: 20
            #nvidia.com/gpumem: "4000"
        volumeMounts:
        - name: data-volume
          mountPath: /workspace
        - name: shm-volume
          mountPath: /dev/shm
      volumes:
      - name: data-volume
        hostPath:
          path: /root/vgpu
          type: Directory
      - name: shm-volume
        emptyDir:
          medium: Memory


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The utilization rate of computing power in the asset overview has always been 100% #64

cat test.yml

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The utilization rate of computing power in the asset overview has always been 100% #64

Description

cat test.yml

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions