88
submitted 2 months ago* (last edited 2 months ago) by nikodindon@lemmy.world to c/selfhosted@lemmy.world

I just pushed v22 of my project : a local AI companion for Radarr, that goes beyond generic genre or TMDb lists.

This isn't "yet another recommender". It's your personal taste explorer that actually gets the vibe you want in natural language and builds recommendations starting from your existing library.

Key highlights from a real recent run:

  • Command: --mood "dystopian films like Idiocracy, Gattaca or In Time"
  • Output: Metropolis (1927), V for Vendetta, Children of Men, Brazil (1985), Minority Report, Dark City, Equilibrium, Upgrade, The Road... → oppressive/surveillance/inequality/societal critique atmosphere, not just "dark sci-fi".

How it works :

  • Starts by sampling random movies from your Radarr collection (or uses your mood/like/saga input).
  • Asks a local Ollama LLM (e.g. mistral-small:22b) for 25 thematic suggestions based on atmosphere/vibe.
  • Validates each via OMDb (IMDb rating, genres, plot, director, cast...).
  • Scores intelligently: IMDb rating + genre match + director/actor bonus + plot embedding similarity (cosine on Ollama embeddings).
  • Adds the top ones directly to Radarr (with confirmation: all / one-by-one / no).
  • Persistent blacklist to avoid repeats.

Different modes :

  • --mood "dark psychological thrillers with unreliable narrators" , any vibe you describe
  • --like "Parasite" --mood "mind-bending class warfare" (or just --like "Whiplash")
  • --saga (auto-detects incomplete sagas in your library and suggests missing entries) or --saga "Star Wars"
  • --director "Kubrick" / --actor "De Niro" / --cast "Pacino De Niro" (movies where they co-star)
  • --analyze → full library audit + gaps (e.g. "You're missing Kurosawa classics and French New Wave")
  • --watchlist → import from Letterboxd/IMDb
  • --auto → perfect for daily cron / Task Scheduler (wake up to 10 fresh additions)

Standout features:

  • 100% local + privacy-first (Ollama + free OMDb API only)
  • No cloud AI, no tracking
  • colored console output, logs, stats, HTML/CSV exports
  • Synopsis preview before adding
  • Configurable quality profile, min IMDb, availability filters
  • Works on Windows, Linux, Mac

GitHub (clean single-file Python script + detailed README):
https://github.com/nikodindon/radarr-movie-recommender

If you're tired of generic Discover lists, Netflix-style randomness, or manual hunting give it a spin. The vibe/mood mode + auto saga completion really change how you expand your collection.

Let me know what you think, any weird mood examples you'd like to test, or features you'd want added!

top 50 comments
sorted by: hot top controversial new old
[-] circuscritic@lemmy.ca 54 points 2 months ago* (last edited 2 months ago)

Since no one is leaving critical comments that might explain all the downvotes, I'm going to assume they're reflexively anti-AI, which frankly, is a position that I'm sympathetic to.

But one of the benign useful things I already use AI for, is giving it criterias for shows and asking it to generate lists.

So I think your project is pretty neat and well within the scope of actually useful things that AI models, especially local ones, can provide the users.

[-] webkitten@piefed.social 22 points 2 months ago

Seriously; local AI use is what everyone should strive for not only for privacy but because it's better than using a large data centre and the power use for Ollama is negligible.

[-] eksb@programming.dev 10 points 2 months ago* (last edited 2 months ago)

No LLM use is benign. The effects on the environment, the internet, and society are real, and that cannot be ignored.

You can make the argument that in some cases it is justified, e.g.: for scientific research.

[-] eager_eagle@lemmy.world 16 points 2 months ago* (last edited 2 months ago)

chill, this is extracting text embeddings from a local model, not generating feature-length films

that's like saying "no jet use is benign" meant for comparing a private jet to a jet-ski

the generative aspect is not even used here

[-] Mordikan@kbin.earth 13 points 2 months ago

Saw it was already commented about CO2, so I thought I'd counter-point your environment claim regarding water usage (since that is something I've seen a lot of too).

The ISSA had a call to action due to the AI water use "crisis": https://www.issa.com/industry-news/ai-data-center-water-consumption-is-creating-an-unprecedented-crisis-in-the-united-states/

68 billion gallons of water by 2028! That's a lot....right? Well, what I found is that this is somewhat of a bad faith argument. 68 billion gallons annually is a lot for one town, but those are numbers from a national level and it isn't compared to usage from anything else. So, lets look at US agriculture (that's something that's tracked very well by the USDA): https://www.nass.usda.gov/Publications/Highlights/2024/Census22_HL_Irrigation_4.pdf

That's 26.4 trillion gallons of water annually. So, AI datacenter represents 0.26% of agriculture consumption. If AI datacenter consumption is a crisis, why is agriculture consumption not a crisis? You could argue that agriculture produces "something useful", but usefulness doesn't factor into the scarcity of a resource. So, either its not a crisis, or you are cherry picking something that has no meaningful outcome to solving the problem.

[-] eager_eagle@lemmy.world 7 points 2 months ago* (last edited 2 months ago)

yeah, I think the whole "water" argument really dilutes the case against data centers.

On a serious note, the argument works for areas that already struggle to supply enough water for consumers. Otherwise, we should be focusing more on the power stress to the grid, and the domino effect on supply chain of hardware cost increases that it's happening across many industries. It started with GPUs, now it's CPU, storage, networking equipment, and other components.

If these prices are too high for a couple of years, we'll start seeing generalized price increases as companies need to pass along the costs to consumers.

[-] Mordikan@kbin.earth 3 points 2 months ago

I think the supply chain issue is probably the most pressing out of all of them. The other points people have are either non-issues or a result of dropping usage hogs into existing electrical infrastructure. Infrastructure can be updated, though.

Supply chain is different. There isn't a supply shortage of chips, its that profitability dictates you should sell them to datacenters or adjacent industry. Unlike infrastructure where you can just build out more, adding more supply for chips just means you have more to sell to datacenters. Since the demand is there, end of day profits will always win.

[-] irmadlad@lemmy.world 13 points 2 months ago

The effects on the environment

Didn't down vote you. I hear this line of complaint in conjunction with AI, especially if the person saying it is anti-AI. Without even calculating in AI, some 25 million metric tons of CO2 emissions annually from streaming and content consumption. Computers, smartphones, and tablets can emit around 200 million metric tons CO2 per year in electrical consumption. Take data centers for instance. If they are powered by fossil fuels, this can add about 100 million metric tons of CO2 emissions. Infrastructure contributes around 50 million metric tons of CO2 per year.

Now.....who wants to turn off their servers and computers? Volunteers? While it is true that AI does contribute, we're already pumping out some significant CO2 without it. Until we start switching to renewable energy globally, this will continue to climb with or without AI. It seems tho, that we will have to deplete the fossil fuel supply globally before renewables become the de facto standard.

[-] circuscritic@lemmy.ca 10 points 2 months ago

So running a local model is unforgivable, but "scientific research" running on hyperscalers, can be justified?

[-] prenatal_confusion@feddit.org 5 points 2 months ago

Do You drive a car? Eat meat? Fly for holidays?

Nothing is black and white.

[-] u_tamtam@programming.dev 8 points 2 months ago

LLMs are not the tool for a recommender job

[-] FerCR@kbin.earth 7 points 2 months ago

The local LLM here is, if I'm not mistaken @nikodindon@lemmy.world , just used as a feature extraction tool. It's not like asking ChatGPT what to watch next but rather asking it to sumarise the movie as an excel file, that you then process to compute which movie(s) is(are) similar.

[-] illusionist@lemmy.zip 3 points 2 months ago

Huh? There are other ways to link similarities of movies without the use of a llm. You may use ai to find similar movies but it's nonsense that everyone has to ask a llm to link movies.

[-] eager_eagle@lemmy.world 10 points 2 months ago

no one is saying everyone has to ask an LLM for movie recommendations

[-] illusionist@lemmy.zip 3 points 2 months ago

OP wrote a python script that call a llm to ask for a recommendation.

But you are right, op doesn't say that everyone shall do it

[-] eager_eagle@lemmy.world 8 points 2 months ago* (last edited 2 months ago)

No, it also doesn't do that. It gets embeddings from an LLM and uses that to rank candidates.

load more comments (4 replies)
[-] eager_eagle@lemmy.world 23 points 2 months ago

that's pretty cool, this is just the wrong crowd, don't worry about the downvotes

load more comments (1 replies)
[-] meldrik@lemmy.wtf 14 points 2 months ago

This is a cool tool. Thanks for sharing. Don’t worry about the downvotes. The Fediverse has a few anti-AI zealots who love to brigade.

load more comments (1 replies)
[-] pfr@piefed.social 11 points 2 months ago

Anti-AI evangelism is at its peak rn.

load more comments (5 replies)
[-] timestatic@feddit.org 10 points 2 months ago

Sorry OP that you're getting downvote bombed. This is actually really neat. People go nuts when they hear AI but this is fully local so I think this reaction is unjust. This has nothing to do with ram prices since that stems from data centers or corpos pushing AI on you. Thank you for sharing

[-] nikodindon@lemmy.world 2 points 2 months ago

thanks for your comment :) I edited my original post to explain the project better

[-] Scrath@lemmy.dbzer0.com 10 points 2 months ago

I remember building something vaguely related in a university course on AI before ChatGPT was released and the whole LLM thing hadn't taken off.

The user had the option to enter a couple movies (so long as they were present in the weird semantic database thing our professor told us to use) and we calculated a similarity matrix between them and all other movies in the database based on their tags and by putting the description through a natural language processing pipeline.

The result was the user getting a couple surprisingly accurate recommendations.

Considering we had to calculate this similarity score for every movie in the database it was obviously not very efficient but I wonder how it would scale up against current LLM models, both in terms of accuracy and energy efficiency.

One issue, if you want to call it that, is that our approach was deterministic. Enter the same movies, get the same results. I don't think an LLM is as predictable for that

[-] LiveLM@lemmy.zip 5 points 2 months ago

One issue, if you want to call it that, is that our approach was deterministic. Enter the same movies, get the same results. I don't think an LLM is as predictable for that

Maybe lowering the temperature will help with this?
Besides, a tinge of randomness could even be considered a fun feature.

[-] four@lemmy.zip 3 points 2 months ago

I'm not an expert, but LLMs should still be deterministic. If you run the model with 0 creativity (or whatever the randomness setting is called) and provide exactly the same input, it should provide the same output. That's not how it's usually configured, but it should be possible. Now, if you change the input at all (change order of movies, misspell a title, etc) then the output can change in an unpredictable way

[-] hendrik@palaver.p3x.de 3 points 2 months ago* (last edited 2 months ago)

Yes. I think determinism a misunderstood concept. In computing, it means exact same input leads to always the same output. Could be a correct result or entirely wrong, though. As long as it stays the same, it's deterministic. There's some benefit in introducing randomness to AI. But it can be run in an entirely deterministic way as well. Just depends on the settings. (It's called "temperature".)

[-] fubarx@lemmy.world 9 points 2 months ago

The more local inference, the better. Nice work!

load more comments (1 replies)
[-] Overspark@piefed.social 7 points 2 months ago

A recommendation for Moonrise Kingdom based on Mickey 17? The genres might match, but those are totally different movies.

[-] Janx@piefed.social 5 points 2 months ago

Also, A Bug's Life from Mickey 17!?

load more comments (1 replies)
[-] november@piefed.blahaj.zone 7 points 2 months ago

Y'all need LLMs to tell you what to watch, now?

[-] webkitten@piefed.social 4 points 2 months ago

Is it any different that getting movies based on recommendations from employees at video stores?

[-] november@piefed.blahaj.zone 2 points 2 months ago

Yes, LLMs are different from human beings, what kind of question is this?

load more comments (2 replies)
[-] Somecall_metim@lemmy.dbzer0.com 5 points 2 months ago
[-] RIotingPacifist@lemmy.world 3 points 2 months ago

How does this compare to an ML approach?

are you training or just using an LLM for this?

[-] eager_eagle@lemmy.world 8 points 2 months ago* (last edited 2 months ago)

There's no training, the LLM embeddings are used to compare the plots via a cosine similarity, then a simple weighted score with other data sources is used to rank the candidates. There's no training, evaluation, or ground-truth, it's just a simple tool to start using.

load more comments (1 replies)
[-] hendrik@palaver.p3x.de 3 points 2 months ago

Did you build it, though, or did Claude code it?

[-] prettygorgeous@aussie.zone 12 points 2 months ago

Built with Claude by the looks of things. Not sure if Claude was used to generate the boilerplate and whether the dev reviewed it after or whether Claude did all of it, but definitely Claude was used for some of it. I recognise the coding style that Claude outputs and the bugs that it implements that will cause TypeErrors if not handled.

FWIW, I'm not against using AI as an assistant for coding (I do it too, using Claude and Vercel as assistants) just as long as the code is reviewed and understood in full by the dev before publishing.

[-] irmadlad@lemmy.world 4 points 2 months ago

FWIW, I’m not against using AI as an assistant for coding (I do it too, using Claude and Vercel as assistants) just as long as the code is reviewed and understood in full* by the dev before publishing. *my emphasis

A very sane take. I do wish devs would fully disclose this on their github or other. That way, if the project is seasoned, well starred, et al, and the dev used AI as an assistant, then the user gets to decide. Given all the criteria are met, I would deploy it.

I will say that I have observed what seems like a pretty decent up tick in selfhosted apps, and I would be willing to bet a goodly amount of them have at the very least, used AI in some capacity, if not most/all code. I don't have any solid evidence to back that up but it just seems that way to me.

[-] prettygorgeous@aussie.zone 8 points 2 months ago* (last edited 2 months ago)

I think the problem is a cyclical one. Some devs are afraid to admit that they used AI to help them code because there's so much hatred towards using AI to code. But the hatred only grows because some devs are not disclosing that they've had help from AI to code and it seems like they're hiding something which then builds distrust. And of course, that's not helped by the influx of slop too where an AI has been used and the code has not been reviewed and understood before its released.

I don't mind more foss projects, even if they're vibe coded, but please PLEASE understand your code IN FULL before releasing it, if at least so you can help troubleshoot the bugs people experience when they happen!

[-] irmadlad@lemmy.world 5 points 2 months ago

Some devs are afraid to admit that they used AI to help them code because there’s so much hatred towards using AI to code.

I would say there is a lot of truth to that statement. The backlash is immediate and punishing. I've said before, I think there are a lot of young devs who would like to contribute to the opensource/selfhosting community, but lack the experience.

load more comments (2 replies)
[-] november@piefed.blahaj.zone 4 points 2 months ago

Some devs are afraid to admit that they used AI to help them code because there’s so much hatred towards using AI to code.

Cowards. "Some devs" would not survive five minutes in the real world as a queer person.

[-] hendrik@palaver.p3x.de 4 points 2 months ago* (last edited 2 months ago)

Yeah. Maybe it's time to adopt some new rule in the selfhosted community. Mandating disclosure. Because we got several AI coded projects in the last few days or weeks.

I just want some say in what I install on my computer. And not be fooled by someone into using their software.

I mean I know why people deliberately hide it, and say "I built ..." when they didn't. Because otherwise there's an immediate shitstorm coming in. But deceiving people about the nature of the projects isn't a proper solution either. And it doesn't align well with the traditional core values of Free Software. I think a lot of value is lost if honesty (and transparency) isn't held up anymore within our community.

[-] timwa@lemmy.snowgoons.ro 3 points 2 months ago

Warning, anecdote:

I was unexpectedly stuck in Asia for the last month (because of the impact of the war), turning an in-person dev conference I was organising into an "in-person except for me" one at a few days notice.

I needed a simple countdown timer/agenda display I could mix into the video with OBS; a simple requirement, so I tried a few from the standard package repos (apt, snap store, that kind of thing.)

None of them worked the way I wanted or at all - one of them written in Python installed about 100 goddamned dependencies (because, Python,) and then crashed because, well, Python.

So I gave up and asked my local hosted LLM model to write it for me in Rust. In less than 10 minutes I had exactly what I wanted, in a few hundred lines of Rust. And yeah, I did tidy it up and publish it to the snap store as well, because it's neat and it might help someone else.

Which is more secure? The couple of hundred lines of Rust written by my LLM, or the Python or node.js app that the developer pinky-promises was written entirely by human hand, and which downloads half the Internet as dependencies that I absolutely am not going to spend time auditing just to display a goddamned countdown clock in a terminal window?

The solution to managing untrusted code isn't asking developers for self-declared purity test results. It's sandboxing, containers, static analysis... All the stuff that you are doing already with all the code/apps you download if you're actually concerned. You are doing those things, right?

load more comments (1 replies)
load more comments (1 replies)
[-] timwa@lemmy.snowgoons.ro 3 points 2 months ago

Honestly, any developer that isn't using an LLM as an assistant these days is an idiot and should be fired/shunned as such; it's got all the rational sense of "I refuse to use compilers and I hand-write my assembly code in vi."

(And I speak as someone who has a .emacs file that's older than most programmers alive today and finally admitted I should stop using csh as my default shell this year.)

Here's the disclosure you need: all projects you see have involved AI somewhere, whether the developers like to admit it or not. End of. The genie is out of the bottle, and it's not going back in. Railing against it really isn't going to change anything.

[-] irmadlad@lemmy.world 4 points 2 months ago

Here’s the disclosure you need: all projects you see have involved AI somewhere, whether the developers like to admit it or not. End of. The genie is out of the bottle, and it’s not going back in. Railing against it really isn’t going to change anything.

I've said it before, AI is here to stay. It's not a fad. Kind of like when the internet first started to become publicly available. Lots of people deemed it a fad. It's now a global phenom and it is the basis by which we do business on the daily, minute by minute, globally. I do think that AI needs some heavy governmental regulation. It would be great if we could all play nicely together without involving the government(s). Alas, we don't seem to be able to do that, and so, government(s) has to step in, unfortunately. The problem with that is, imho, surveillance capitalism has worked so well that governments also want to take a peek at that data too. I have nothing to back up that conspiracy theory, it's just a feeling I get.

load more comments (4 replies)
load more comments
view more: next ›
this post was submitted on 20 Mar 2026
88 points (100.0% liked)

Selfhosted

59481 readers
1265 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

  7. No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 3 years ago
MODERATORS