<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>GPU - Tag - vo.rs</title><link>https://vo.rs/tags/gpu/</link><description>GPU - Tag - vo.rs</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.</copyright><lastBuildDate>Thu, 09 Jan 2025 09:00:00 +0000</lastBuildDate><atom:link href="https://vo.rs/tags/gpu/" rel="self" type="application/rss+xml"/><item><title>Running AI Inference on Kubernetes: GPU Scheduling, Ollama, and Resource Sharing</title><link>https://vo.rs/story/running-ai-inference-on-kubernetes-gpu-scheduling-ollama-and-resource-sharing/</link><description>&lt;p&gt;Kubernetes was designed for a world of stateless web services you could scale by adding more identical replicas. GPUs are the opposite of that: scarce, expensive, and absolutely not interchangeable with CPU. So the moment you decide to run model inference on your cluster, you discover that Kubernetes treats your graphics card as a curious unknown — it doesn&amp;rsquo;t schedule on it, it can&amp;rsquo;t see it, and your pods come up GPU-less and confused.&lt;/p&gt;</description><pubDate>Thu, 09 Jan 2025 09:00:00 +0000</pubDate></item><item><title>Running Stable Diffusion on a Budget GPU: What Actually Works Below 8GB VRAM</title><link>https://vo.rs/story/running-stable-diffusion-on-a-budget-gpu-what-actually-works-below-8gb-vram/</link><description>&lt;p&gt;Every thread about running Stable Diffusion locally eventually arrives at the same smug conclusion: just buy a 4090. This is wonderful advice if you have a spare grand and a power supply that doesn&amp;rsquo;t sound like a hairdryer. The rest of us are sitting on a 6GB laptop card, an old GTX 1060, or a 4GB GPU that the internet has decided is e-waste. Good news: the internet is wrong, and I have spent enough late nights proving it to write this down.&lt;/p&gt;</description><pubDate>Tue, 27 Feb 2024 09:00:00 +0000</pubDate></item></channel></rss>