Once pods exist, two questions: where should this pod run, and how many should I have? L06 covers both — the scheduling primitives (where Pods land) and the scaling family (how many Pods run, how much they get).
Taints and tolerations — keeping pods off (or onto) specific nodes
Node affinity / pod anti-affinity — schedule based on labels
Topology spread constraints — spread replicas across zones/nodes
PriorityClass and preemption — the only signal the scheduler uses to evict a lower-priority Pod
Scheduling gates — hold a Pod back from scheduling until an external signal
Resource requests vs limits — what each does, QoS classes, cgroups, the limits debate
HPA (horizontal scale replicas), VPA (vertical resize requests), Cluster Autoscaler + Karpenter (add nodes), KEDA (event-driven) — what each does and how they fit
PodDisruptionBudgets — keep services available during voluntary disruption
Restart policies — Always, OnFailure, Never and when each applies
Extended resources — GPUs, FPGAs, and the device plugin model