13

Conducting deep web searches and gathering sources is one of the main things I've been using LLMs for. How far away are we from being able to self-host something like Claude's web search capabilities? Or even just a service where I'd pay with my money instead of my data?

you are viewing a single comment's thread
view the rest of the comments
[-] avidamoeba@lemmy.ca 3 points 2 days ago

Open WebUI + SearXNG + llama.cpp + Qwen 3.6 35B + 16-32GB GPU. Gives you 256K context and runs with 80-100tps on 3090. If you have less VRAM like 16GB it'll be slower but still probably tens of tps on anything recent. I run it on AMD Pro 9700 which is about as fast as 3090.

this post was submitted on 21 Jun 2026
13 points (100.0% liked)

Self Hosted - Self-hosting your services.

20016 readers
4 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules

Important

Cross-posting

If you see a rule-breaker please DM the mods!

founded 5 years ago
MODERATORS