Skip to main content

5 docs tagged with "labs"

View all tags

Kubernetes LLM Labs

Hands-on Kubernetes LLM labs for vLLM inference, RAG retrieval, observability, and production readiness.

LLM Observability Lab

Hands-on observability lab for Kubernetes LLM workloads covering latency, queueing, GPU saturation, traces, logs, and alerts.

Production Readiness Lab

Kubernetes LLM production readiness lab covering security, rollback, quota, cost, observability, and launch review.

RAG Retrieval Lab

Hands-on RAG on Kubernetes lab for ingestion, vector retrieval, metadata filters, answer quality, and failure drills.

vLLM Inference Lab

Hands-on vLLM Kubernetes lab for GPU scheduling, model cache, OpenAI-compatible serving, probes, metrics, and failure drills.