New AI model can hallucinate a game of 1993’s Doom in real time (arstechnica.com)

submitted 2 months ago by Gaywallet@beehaw.org to c/gaming@beehaw.org

29 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] MentalEdge@sopuli.xyz 60 points 2 months ago* (last edited 2 months ago)

"The potential here is absurd," wrote app developer Nick Dobos in reaction to the news. "Why write complex rules for software by hand when the AI can just think every pixel for you?"

"Can it run Doom?"

"Sure, do you have a spare datacenter or two full of GPUs, and perhaps a nuclear powerplant for a PSU?"

What the fuck are these people smoking. Apparently it can manage 20 fps on one "TPU" but to get there it was trained on shitload of footage of Doom. So just play Doom?!

The researchers speculate that with the technique, new video games might be created "via textual descriptions or examples images" rather than programming, and people may be able to convert a set of still images into a new playable level or character for an existing game based solely on examples rather than relying on coding skill.

It keeps coming back to this, the assumption that these models, if you just feed them enough stuff will somehow become able to "create" something completely new, as if they don't fall apart the second you ask for something that wasn't somewhere in the training data. Not to mention that this type of "gaming engine" will never be as efficient as an actual one.

[-] Telorand@reddthat.com 17 points 2 months ago

Apparently it can manage 20 fps on one "TPU" but to get there it was trained on shitload of footage of Doom. So just play Doom?!

Shhhh! Are you nuts? People are going to start realizing this is another tech bubble, like Blockchain...

[-] BadlyDrawnRhino@aussie.zone 15 points 2 months ago

To be fair, half of the AAA gaming industry is all about trying to clone the latest successful game with a new coat of paint. Maybe using AI to make these clones will mean that the talented people behind the scenes are free to explore other ideas instead.

Of course in reality, it just means that the largest publishers will lay off a whole lot of people and keep churning out these uninspired games in the name of corporate profits, but it's nice to dream sometimes.

[-] Gaywallet@beehaw.org 10 points 2 months ago

big weird flex but okay vibes except actually not okay

[-] frogman@beehaw.org 2 points 2 months ago

left me stunlocked with this one icl

[-] Even_Adder@lemmy.dbzer0.com 7 points 2 months ago

I mean, you've never seen a purple elephant with a tennis racket. None of that exists in the data set since elephants are neither purple nor tennis players. Exposure to all the individual elements allows for generation of concepts outside the existing data, even though they don't exit in reality or in the data set.

[-] MentalEdge@sopuli.xyz 14 points 2 months ago* (last edited 2 months ago)

Ok.

Try to get an image generator to create an image of a tennis racket, with all racket-like objects or relevant sport data removed from the training data.

Explain the concept to it with words alone, accurately enough to get something that looks exactly like the real thing. Maybe you can give it pictures, but one won't really be enough, you'll basically have to give it that chunk of training data you removed.

That's the problem you'll run into the second you want to realize a new game genre.

[-] Even_Adder@lemmy.dbzer0.com 6 points 2 months ago

There are more forms of guidance than just raw words. Just off the top of my head, there's inpainting, outpainting, controlnets, prompt editing, and embeddings. The researchers who pulled this off definitely didn't do it with text prompts.

[-] MentalEdge@sopuli.xyz 13 points 2 months ago* (last edited 2 months ago)

Obviously.

But at what point does that guidance just become the dataset you removed from the training data?

To get it to run Doom, they used Doom.

To realize a new genre, you'll "just" have to make that game the old fashion way, first.

[-] Even_Adder@lemmy.dbzer0.com 6 points 2 months ago

But at what point does that guidance just become the dataset you removed from the training data?

The whole point is that it didn't know the concepts beforehand, and no it doesn't become the dataset. Observations made of the training data are added to the model's weights after training, the dataset is never relevant again as the model's weights are locked in.

To get it to run Doom, they used Doom.

To realize a new genre, you’ll “just” have to make that game the old fashion way, first.

Or you could train a more general model. These things happen in steps, research is a process.

[-] MentalEdge@sopuli.xyz 7 points 2 months ago* (last edited 2 months ago)

You are completely missing what I'm saying.

I know the input doesn't alter the model, that's not what I mean.

And "general" models are only "general" in the sense that they are massively bloated and still crap at dealing with shit that they weren't trained on.

And no, "comprehending" new concepts by palette swapping something and smashing two existing things together isn't the kind of creativity I'm saying these systems are incapable of.

[-] Even_Adder@lemmy.dbzer0.com 4 points 2 months ago

What kind of creativity are you talking about then? I've also never heard of a bloated model. Which models are bloated?

[-] MentalEdge@sopuli.xyz 2 points 2 months ago* (last edited 2 months ago)

Bloated, as in large and heavy. More expensive, more power hungry, less efficient.

I already brought it up. They can't deal with something completely new.

When you discuss what you want with a human artist or programmer or whatever, there is a back and forth process where both parties explain and ask until comprehension is achieved, and this improves the result. The creativity on display is the kind that can unfold and realize a complex idea based on simple explanations even when it is completely novel.

It doesn't matter if the programmer has played games with regenerating health before, one can comprehend and implement the concept based on just a couple sentences.

Now how would you do the same with a "general" model that didn't have any games that work like that in the training data?

My point is that "general" models aren't a thing. Not really. We can make models that are really, really big, but they remain very bad at filling in gaps in reality that weren't in the training data. They don't start magically putting two and two together and comprehending all the rest.

[-] Even_Adder@lemmy.dbzer0.com 1 points 2 months ago

Do you have any examples of how they fail? There are plenty of ways to explain new concepts to models.

https://arxiv.org/abs/2404.19427 https://arxiv.org/abs/2406.11643 https://arxiv.org/abs/2403.12962 https://arxiv.org/abs/2404.06425 https://arxiv.org/abs/2403.18922 https://arxiv.org/abs/2406.01300

[-] MentalEdge@sopuli.xyz 2 points 2 months ago* (last edited 2 months ago)

In a couple sentences? In a way that doesn't approach, equal or exceed the effort of training the model with that data to begin with?

You insist these models can do new things out of nothing, and you keep saying "all you have to do, is give them something".

[-] Even_Adder@lemmy.dbzer0.com 2 points 2 months ago

You keep moving the goal posts and putting words in my mouth. I never said you can do new things out of nothing. Nothing I mentioned is approaching, equaling, or exceeding the effort of training a model.

You haven't answered a single one of my questions, and you are not arguing in good faith. We're done here. I can't say it's been a pleasure.

[-] MentalEdge@sopuli.xyz 1 points 2 months ago* (last edited 2 months ago)

My argument was and is that neural models don't produce anything truly new. That they can't handle things outside what is outlined by the data they were trained on.

Are you not claiming otherwise?

You say it's possible to guide models into doing new things, and I can see how that's the case, especially if the model is a very big one, meaning it is more likely that it has relevant structures to apply to the task.

But I'm also pretty damn sure they have insurmountable limits. You can't "guide" and LLM into doing image generation, except by having it interact with an image generation model.

this post was submitted on 28 Aug 2024

48 points (100.0% liked)

Gaming

30566 readers

105 users here now

From video gaming to card games and stuff in between, if it's gaming you can probably discuss it here!

Please Note: Gaming memes are permitted to be posted on Meme Mondays, but will otherwise be removed in an effort to allow other discussions to take place.

See also Gaming's sister community Tabletop Gaming.

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

Whom@beehaw.org

The_Hunted_One@beehaw.org

chloyster@beehaw.org

remington@beehaw.org