367
submitted 10 months ago by jeffw@lemmy.world to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] AceSLS@ani.social 17 points 10 months ago

A private local LLM

Running on a phone? No way, not without being absolutely horrible, slow or making your phone churn through your battery anyway.

Good LLMs are olready slow on a GTX 1080, which is already miles faster than any phone out there

[-] subtext@lemmy.world 24 points 10 months ago

I hear you, but also I would be shocked if Apple were to roll this out and it be an absolutely terrible experience. Like their MO is “luxury” products with “premium” experiences, it would not be fitting of the brand to have a piece of crap experience on their flagship announcement.

I’m willing to give them the benefit of the doubt on this one.

[-] Eldritch@lemmy.world 11 points 10 months ago

You might wanna check with siri on that. Apple regularly failed at that even under the leadership of Jobs. And Tim Cook is no Steve Jobs. It's already looking like it's going to be just standard remote chat GPT. Hallucinations and all.

[-] habanhero@lemmy.ca 17 points 10 months ago

It's not a LLM, it's a much smaller model (~3B) which is closer to what Microsoft labels as a SLM (Small Language Models, e.g. MS Phi-3 Mini).

https://machinelearning.apple.com/research/introducing-apple-foundation-models

[-] Womble@lemmy.world 5 points 10 months ago* (last edited 10 months ago)

Microsoft's penchant for making up names for thing that already have names is neither here nor there. It is an LLM, in fact its already twice as large as chatGPT2 (1.5B params).

[-] habanhero@lemmy.ca 3 points 10 months ago

I do think it's a useful distinction considering open models can be more than 100B+ nowdays and GPT4 is rumored to be 1.7T params. Plus this class of models are far more likely to be on-device.

[-] kill_dash_nine@lemm.ee 7 points 10 months ago

You would be surprised. If you haven’t tried to run a LLM on Apple silicon, it’s pretty snappy but like all others, RAM can be a significantly limiting factor unless the model is trimmed down to do very specific things to reduce the size.

[-] felixwhynot@lemmy.world 2 points 10 months ago

I think It’s running on their “Private cloud compute” platform, not locally (I’m not sure though)

[-] acosmichippo@lemmy.world 9 points 10 months ago

some things are run locally.

this post was submitted on 14 Jun 2024
367 points (100.0% liked)

Technology

69772 readers
3575 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS