5 docs tagged with "labs"

Kubernetes LLM Labs

Hands-on Kubernetes LLM labs for vLLM inference, RAG retrieval, observability, and production readiness.

Challenge-style observability lab for Kubernetes LLM workloads covering latency, queueing, GPU saturation, traces, logs, and alerts.

Challenge-style Kubernetes LLM production readiness lab covering security, rollback, quota, cost, observability, and launch review.

Challenge-style RAG on Kubernetes lab for ingestion, vector retrieval, metadata filters, answer quality, and failure drills.

Challenge-style vLLM Kubernetes lab for GPU scheduling, model cache, OpenAI-compatible serving, probes, metrics, and failure drills.