Cookie Consent by Free Privacy Policy Generator Update cookies preferences 📌 Setting Up an Ollama + Open-WebUI Cluster


📚 Setting Up an Ollama + Open-WebUI Cluster


💡 Newskategorie: Programmierung
🔗 Quelle: dev.to

Why?

Having set up an Ollama + Open-WebUI machine in a previous post I started digging into all the customizations Open-WebUI could do, and amongst those was the ability to add multiple Ollama server nodes. This got me thinking about setting up multiple Ollama, and eventually Open-WebUI, nodes to load and share the work and make an internal cloud or cluster of sorts.

Before we build a cluster we first need a stable node(server/instance). We'll start by creating a BoM(Bill of Materials) to test. Here's my starter list(NOTE: This is not a shopping list, you'll see why in a moment).

Experimenting

The first thing I did was install Ubuntu 22.04 LTS on an external drive for testing, in the future this should be an onboard drive(SATA/NVME). This was fairly straightforward. After installing all updates I installed Ollama and OpenWebUI, see my post on setting that up here. After installing and testing we now had a base to start from. Getting the GPU to work was a little less straightforward.

Powering this card was not going to work with the provided SATA to 6 Pin adapter, this was because the laptop drives these Tiny's use only outputs 3v and 5v out, this is where that USB C PD supply comes in. This part is relatively straightforward, cut the SATA port end off, tie the yellow to positive and black to negative, NOTE: before plugging anything into the PCIe adapter plug in the USB C PD supply and use the button to select 12v, it will remember this going forward.

Fitting the Mini PCIe end into the Lenovo Tiny required cutting back the plastic strain relief on the cable, and snapping off the extension tabs on the PCB(perforation dots are on the bottom indicating where to break/cut)

Now that the card is connected, it was time to install the drivers. You can do this through the driver download page from Nvidia or using Ubuntu's method, be sure to use the 550 drivers. After I would recommend re-installing Ollama to ensure it sees the Nvidia Card. If you run into issues with it seeing the card check cables and connectors, make sure the power for the card is on, and run the following to see if the card is listed.

sudo lspci | grep nvidia

Results?

While I was able to see the Nvidia card in lspci and Ollama, and it was working through the GPU, it was as slow, if not slightly slower, than CPU-only mode. This is likely because there aren't enough PCIe lanes for this to make a meaningful benefit. That said, I still learned that we can adapt the card, power it externally, and see it in Ubuntu. I also learned that putting this card in an HP ProDesk 600 G2 SFF would not boot, this is likely due to not enough power from the built-in power supply.

Coming up in this series I'll be testing more hardware and looking for used hardware that gets us full use of our GPU and still be lite, power efficient, and still be scalable. Till next time, fair winds and following seas.

...



📌 Supercharge Your Productivity with Ollama + Open Web UI and Large Language Models


📈 25.16 Punkte

📌 Meet Verba 1.0: Run State-of-the-Art RAG Locally with Ollama Integration and Open Source Models


📈 25.16 Punkte

📌 Preisgabe von Informationen in open-cluster-management (Red Hat)


📈 15.86 Punkte

📌 New Relic open sources Pixie, its Kubernetes-native in-cluster observability platform


📈 15.86 Punkte

📌 Pause für Cluster, Open Source und Optimieren der Auslastung bei Crate.io - DataCenter-Insider


📈 15.86 Punkte

📌 Im Test: Thomas-Krenn Open-E RA1112 Metro Cluster


📈 15.86 Punkte

📌 Im Test: Thomas-Krenn Open-E RA1112 Metro Cluster


📈 15.86 Punkte

📌 Im Test: Thomas-Krenn Open-E RA1112 Metro Cluster


📈 15.86 Punkte

📌 Comparative Analysis of Open Source Cluster Management Systems: Kubernetes vs. Apache Mesos


📈 15.86 Punkte

📌 k0smotron: Open-source Kubernetes cluster management


📈 15.86 Punkte

📌 Im Test: Thomas-Krenn Open-E RA1112 Metro Cluster


📈 15.86 Punkte

📌 Bioshock 4: Neueste Stellenausschreibungen deuten auf Open World-Setting hin


📈 13.77 Punkte

📌 Open World mit Sci-Fi-Setting angeblich bei PUBG-Entwickler in Arbeit


📈 13.77 Punkte

📌 BioShock 4: Hinweise auf ein Open-World-Setting verdichten sich


📈 13.77 Punkte

📌 Villa: Setting new expectations for open source maintainers


📈 13.77 Punkte

📌 CVE-2015-7385 | Open-Xchange OX Guard up to 2.0.0-rev10 Guard PGP Setting cross site scripting (ID 134415 / ID 1034166)


📈 13.77 Punkte

📌 CVE-2022-3139 | We're Open Plugin up to 1.41 on WordPress Setting cross site scripting


📈 13.77 Punkte

📌 Setting up your GitHub Repository for Open Source Development


📈 13.77 Punkte

📌 Microsoft reveals the quickest way to open the Sound output quick setting


📈 13.77 Punkte

📌 New video as part of the Open Science MOOC: An introduction to Open Research Software and Open Source. Surprisingly human.


📈 12.82 Punkte

📌 Internet der Dinge: Das Open Internet Consortium wird zur Open Connectivity Foundation


📈 8.54 Punkte

📌 Open Translators to Things: An Open Approach for Accessing Similar Things


📈 8.54 Punkte

📌 Open-Xchange OX AppSuite 7.8.0 XSS / Open Redirect


📈 8.54 Punkte

📌 Internet der Dinge: Das Open Internet Consortium wird zur Open Connectivity Foundation


📈 8.54 Punkte

📌 Open Translators to Things: An Open Approach for Accessing Similar Things


📈 8.54 Punkte

📌 Open-Xchange OX AppSuite 7.8.0 XSS / Open Redirect


📈 8.54 Punkte

📌 Open Government: Ruf nach Open-Data-Ökosystem bis 2030


📈 8.54 Punkte











matomo