AI/ML & vGPU – vAndu

Heading to NVIDIA GTC AI Conference – Let’s Connect! March 17-21, 2025 | San Jose

Posted on March 9, 2025March 9, 2025

If you’re attending the NVIDIA GTC Conference this year, feel free to reach out and connect with me. Here are some seminars that I highly recommend. Heritage Meets Technology: Leveraging Virtualized Platforms for Architectural Conservation and Business Innovation [S72407] Thursday, Mar 20 – 2:00 PM – 2:40 PM PDT Mark Cichy, Principal, Director of Design…

How Fast Is My Home Lab?

Posted on January 6, 2025January 7, 2025

I have been tuning my Home Lab network during my vacation. Private AI/ML, vGPU and VDI requires a fast Home Lab setup. I tweaked my Switch (Dell EMC S4112T-ON), UDM Pro, ESXi hosts, and also revisited the subnetting logic. I went over my servers’ BIOS CPU and RAM features and found out that default settings…

Silent Cooling Solution for the Nvidia L4 24 GB GPU

Posted on October 20, 2024October 20, 2024

I am keeping this post very short, with mostly photos. I tested the cooling performance with different games. The GPU’s max power is 72W, though during my tests, it exceeded 75W. It’s also possible to limit it to 30W. I tested the GPU by running games like Black Myth: Wukong, Cyberpunk 2077, Uncharted 4: A…

Nvidia L4: Powerful Low-Power GPU for Nvidia AI Enterprise and Virtual GPU

Posted on October 20, 2024October 20, 2024

I’ve been searching the internet for a long time to find a versatile GPU for AI and video graphics workloads that also supports vGPU and Nvidia AI Enterprise. Some of the GPUs I considered were the RTX 6000 Ada, A2, A10, L4, T4, A40, and A16. I was most drawn to the RTX 6000 Ada…

Deploying and Configuring Nvidia DLS for AI Enterprise and vGPU: Step-by-Step Guide

Posted on August 11, 2024August 11, 2024

NB! At the end of the blog post, there is a YouTube video and an eBook – a photo-based step-by-step guide Download Nvidia vGPU Drivers for ESXi Download Nvidia vGPU License Server: Installing Nvidia vGPU Drivers on ESXi Deploying the NvidiaDLS OVA to vSphere Configuring Nvidia DLS (License) Server Installing Nvidia Drivers on a Windows…

Overcoming PCIe Slot Compatibility Challenges for Nvidia Tesla P4 GPU Installation

Posted on August 7, 2024August 7, 2024

I bought an Nvidia Tesla P4. It was an unused GPU and came with a 3D-printed cooler and fan. I played around with this GPU on my AI/ML server, and it worked fine. Then I decided to move it to my other server, which runs 24/7. The reason is simple: I have jump hosts and…

Quick and Easy Guide to Installing Meta Llama 3.1 405B, 70B, 8B Language Models with Ollama, Docker, and OpenWebUI

Posted on July 28, 2024July 28, 2024

I will show how easy and quick it is to install Llama 3.1 405B, 70B, 8B, or another language model on your computer or VM using Ollama, Docker, and OpenWebUI. It is so simple to install that even a grandmother or grandfather could do it. This is private AI, not cloud-based. All data is on…

Meta Llama 3.1 405B: GPU vs. CPU Performance Evaluation and RAM Considerations

Posted on July 28, 2024July 29, 2024

It’s time to start testing various Private AI models, and fortunately, the timing is just right. Meta has just released six new AI language models. These models run on-premises and do not interact with the cloud or OpenAI’s ChatGPT. Llama 3.1 405B competes with leading models like GPT-4, GPT-4o, and Claude 3.5 Sonnet, while smaller…

Solving Real Business Problems with Private AI: Unlocking Efficiency and Productivity

Posted on July 19, 2024July 20, 2024

I’m on a mission to find a company facing a real problem that can be solved using Private AI—AI that operates entirely within your own data center, without relying on cloud services. I need your help to identify the most challenging, time-consuming, or tedious issues within your company that could greatly benefit from an AI…

Struggling with Crucial T705 NVMe and VMware ESXi 8. U3 Compatibility

Posted on July 12, 2024July 14, 2024

I was so confident that the Crucial T705 4TB PCIe Gen5 NVMe M.2 SSD would work with VMware ESXi without any issues that I didn’t even bother to Google or check. I wanted to get and test what it feels like and how much of a performance boost it gives when using the fastest (R:…