Files
homelab/k8s/apps/llamacpp/configmap-cuda.yaml
T
Ultradesu fb7dfbee57
Check with kubeconform / lint (push) Failing after 11m16s
Update Kubernetes Services Wiki / Generate and Update K8s Wiki (push) Failing after 12m37s
Auto-update README / Generate README and Create MR (push) Failing after 15m12s
Changed uk-desktop llama model
2026-06-17 12:58:09 +01:00

13 lines
300 B
YAML

apiVersion: v1
kind: ConfigMap
metadata:
name: llamacpp-cuda-config
data:
LLAMA_CACHE: /models
LLAMA_ARG_HOST: 0.0.0.0
LLAMA_ARG_PORT: "8080"
LLAMA_ARG_HF_REPO: "unsloth/diffusiongemma-26B-A4B-it-GGUF:Q4_K_M"
LLAMA_ARG_CTX_SIZE: "32768"
LLAMA_ARG_FLASH_ATTN: auto
LLAMA_ARG_FIT: "on"