Any news about the release of the new AI Chat? (lemmy.world)

submitted 1 day ago by DemifiendQueen@lemmy.world to c/perchance@lemmy.world

11 comments fedilink hide all child comments

There are rumors about a new AI Chat coming soon based on Llama 3. Is that true? Don't get me wrong, the current AI Chat is great, but its goldfish memory ruins any chance to roleplay with an elaborate lore or world-building, not to mention the constant repetition of expressions like "maybe, just maybe" and so on.

you are viewing a single comment's thread
view the rest of the comments

[-] DemifiendQueen@lemmy.world 1 points 18 hours ago

Thanks! I can't wait. Just imagine going from 4k tokens to 128k tokens. This is gonna revolutionize free chatbots.

[-] Mattias@lemmy.world 2 points 17 hours ago

I mean if it's just Llama 3, it will feature context window of 8k tokens, not a big uplift from Llama 2 and its minuscule context window, I heard it's very impressive but it's being held down by its small context window, if it's Llama 3.3, then yes, it's 128k context window size would be a game changer.

[-] DemifiendQueen@lemmy.world 2 points 17 hours ago

Well, I hope it's 3.3, after two years of waiting, and with the image generators using Chroma and Flux. I can't complain about a free service anyway. It will be a bittersweet victory; having 2x context will surely improve the roleplay experience, but it will still be impossible to create lore and world-building unless we have at least 50-60k context tokens. And yes, the actual model is amazing, but it's being held down by the context window; you cannot create a deep lore or word building, and worse of all, the chat forgets the key events around 3-4 pages of convo, having to rely on summaries that miss A LOT of important elements.

[-] GrumblePuss@lemmy.world 2 points 12 hours ago

Context tokens are not directly based on the model unless talking about very large token counts. Context tokens come from the amount of memory it is given to work with, which costs more to host. Some models (not any of the Llama models) are better at managing this memory, but it still uses a lot of resources to host a large token context.

[-] Mattias@lemmy.world 2 points 14 hours ago

I agree, people have been waiting for over a year now since one of perchance's reddit mods said it will be updated, I think the mod's comment goes back to early/mid 2024, even if it was Llama 3, it will be a welcome change, but I feel like that Llama 3.3 will provide future proofing until the dev update to Llama 4 or 5 two or three years from now because of its superior context window size. The rapid progress of Ai is terrifying, it's like going from 8 gb of storage all the way to 1 Tb in a flash, don't be surprised if the upcoming models (not just Llama, all LLMs in general) have billions and billions of context windows, which will probably be enough to write someone's entire life story.

[-] DemifiendQueen@lemmy.world 1 points 10 hours ago

Yeah, there are already models with millions of tokens like Gemini. I've been thinking about that a lot lately, and I think that will be fucking amazing. Imagine losing yourself in the world of your favourite game or novel, interacting with all the characters, creatures, and places. I think I will become a crazy old cat lady lol

this post was submitted on 07 Aug 2025

6 points (100.0% liked)

Perchance - Create a Random Text Generator

1031 readers

10 users here now

⚄︎ Perchance

This is a Lemmy Community for perchance.org, a platform for sharing and creating random text generators.

Feel free to ask for help, share your generators, and start friendly discussions at your leisure :)

This community is mainly for discussions between those who are building generators. For discussions about using generators, especially the popular AI ones, the community-led Casual Perchance forum is likely a more appropriate venue.

See this post for the Complete Guide to Posting Here on the Community!

Rules

1. Please follow the Lemmy.World instance rules.

The full rules are posted here: (https://legal.lemmy.world/)
User Rules: (https://legal.lemmy.world/fair-use/)

2. Be kind and friendly.

Please be kind to others on this community (and also in general), and remember that for many people Perchance is their first experience with coding. We have members for whom English is not their first language, so please be take that into account too :)

3. Be thankful to those who try to help you.

If you ask a question and someone has made a effort to help you out, please remember to be thankful! Even if they don't manage to help you solve your problem - remember that they're spending time out of their day to try to help a stranger :)

4. Only post about stuff related to perchance.

Please only post about perchance related stuff like generators on it, bugs, and the site.

5. Refrain from requesting Prompts for the AI Tools.

We would like to ask to refrain from posting here needing help specifically with prompting/achieving certain results with the AI plugins (text-to-image-plugin and ai-text-plugin) e.g. "What is the good prompt for X?", "How to achieve X with Y generator?"
See Perchance AI FAQ for FAQ about the AI tools.
You can ask for help with prompting at the 'sister' community Casual Perchance, which is for more casual discussions.
We will still be helping/answering questions about the plugins as long as it is related to building generators with them.

6. Search through the Community Before Posting.

Please Search through the Community Posts here (and on Reddit) before posting to see if what you will post has similar post/already been posted.

founded 2 years ago

MODERATORS

eatham@lemmy.world

eatham@aussie.zone

VioneT@lemmy.world

perchance@lemmy.world