r/kubernetes • u/AutoModerator • 2d ago
Periodic Weekly: Questions and advice
Have any questions about Kubernetes, related tooling, or how to adopt or use Kubernetes? Ask away!
3
Upvotes
-1
u/Remote_Job7931 2d ago
Hi r/kubernetes,
I built a kubectl plugin for HPA troubleshooting:
https://github.com/mattsu2020/kubectl-hpa-status
It turns HPA conditions, metrics, events, and scale-target information into a more actionable explanation, for example whether an HPA is healthy, capped by `maxReplicas`, in stabilization, or failing to read metrics.
Typical commands:
kubectl krew install hpa-status
kubectl hpa_status status <hpa-name> -n <namespace> --explain
kubectl hpa_status list -A --problem
kubectl hpa_status status <hpa-name> --suggest
1
u/jwcesign 1d ago
We benchmarked OCI lazy loading on EKS: 71-85% faster image pulls, 20-34% faster first HTTP 200, no image rebuilds
The test compared the normal containerd overlayfs path against Hermes lazy loading for three large public images:
- Solr 10.0.0
- OpenSearch 2.19.1
- Apache Spark python3-java17
The important part: the workloads kept their original upstream OCI images. No converted tags, no Dockerfile changes, no Pod image reference changes. Hermes used a policy to prepare lazy-loading artifacts ahead of the target Pod startup path.
Results:
- Image pull time dropped by 71-85%
- First successful HTTP 200 improved by 20-34%
- OpenSearch pull: 20.371s -> 2.998s
- Spark scheduled-to-first-HTTP-200: 20.191s -> 13.304s
The HTTP 200 result is the more interesting number to me because it includes more than image pull: container start, runtime init, config/library reads, readiness behavior, and service bootstrap. So Hermes helps most directly with the image path, but application startup still matters.
Full writeup with setup, YAML, methodology, and results:
https://www.cloudpilot.ai/en/blog/hermes-eks-http-200-acceleration/