AI General Thread

Legend · Mar 14, 2026, 05:52 PM

Quote from: the-Pi-guy on Mar 14, 2026, 05:35 PMFor some reason, running local models feels goofier than it should be. And I'm not sure why. Like for some reason, prompt adherence is worse when I'm using different Android app.

Not sure if they're formatting the requests differently, or passing different default values for the model.

Different sampling method/temperature?

the-pi-guy · Mar 14, 2026, 06:46 PM

Quote from: Legend on Mar 14, 2026, 05:52 PMDifferent sampling method/temperature?

Most of the apps let you change the temperature, top K, top P.

I'm leaning towards formatting being different, which would be harder to fix as an end user.

the-pi-guy · Mar 14, 2026, 08:45 PM

LM Studio has so many options you can mess with.

You can even set how the context window is managed (truncate middle, rolling).

Legend · Mar 14, 2026, 09:16 PM

Quote from: the-Pi-guy on Mar 14, 2026, 08:45 PMLM Studio has so many options you can mess with.

You can even set how the context window is managed (truncate middle, rolling).

Is there a way to run two models in parallel so that the next token can be sampled using both their outputs? Like averaged or multiplied or whatever?

the-pi-guy · Mar 15, 2026, 02:23 AM

Quote from: Legend on Mar 14, 2026, 09:16 PMIs there a way to run two models in parallel so that the next token can be sampled using both their outputs? Like averaged or multiplied or whatever?

Not that I know of.

the-pi-guy · Mar 17, 2026, 02:41 AM

Something that I've really wanted to do is combine two bots.

I don't think the bots are actually smart enough to do this.
I just tried something with copilot, and it kind of can do it as a single instance. But it felt sloppy.

But it would be cool to have two bots - for example have one of them be a Dungeon Master and have one be a player. Where the latter one has no knowledge of the other's thoughts.

You could probably do it with a single instance, but I think chances are good that the model would get confused.

The way I'm thinking that I would try it, would still technically be a single instance, but just having a program to carefully cull different parts of the context to simulate each bot.

the-pi-guy · Mar 20, 2026, 08:03 PM

I have a dumb question. And maybe it's just dumb because Copilot is telling me that some of these systems are actually just inferring and they're not generative. And maybe that's not actually a difference that makes any sense. Basically because upscaling a 480p image to a 1080p image doesn't feel generative. It's just inferring what would be there.

What would be the line between generative and nongenerative/inference AI in different instances?

I feel like if I ask a model to create a completely new image, that is very obviously generative AI.

What if I give an AI half an image and ask it to fill in the other half? That still feels pretty generative.

But what if I give an AI half an image in a checkerboard pattern. That feels kind of different.

But what if I give the AI 1/16th of an image in a kind of checkerboard pattern.

The image generator obviously would have more degrees of freedom the further away from the filled in image it has. But to some extent that feels like the only difference.

Legend · Mar 20, 2026, 09:07 PM

Good question Pi.

I'd say I'd agree with your point of degrees of freedom. But we could refine it slightly and say it is not about degrees of freedom themselves, it's about being able to make something unexpected.

If you can see the result and think "that is not the result I was expecting" but it did everything technically correct, then that is generative imo. Maybe it puts a chess piece in the corner of the image and surprises you.

the-pi-guy · Mar 24, 2026, 09:18 PM

Apparently Sora is done?

Legend · Mar 24, 2026, 10:20 PM

Quote from: the-Pi-guy on Mar 24, 2026, 09:18 PMApparently Sora is done?

Wow. Strange. I mean they had fallen pretty far behind on video gen I think. Maybe they are needing to focus on text.

kitler53 · Mar 24, 2026, 10:47 PM

Quote from: Legend on Mar 24, 2026, 10:20 PMWow. Strange. I mean they had fallen pretty far behind on video gen I think. Maybe they are needing to focus on text.

from what i read it's too expensive. openAI is running out of money and needs to cut costs. also disney bailed on a billion dollar investment related to sora.

Legend · Mar 27, 2026, 05:29 PM

Pi have you been using LTX 2.3? IDK if it got an update but I'm seeing vids from it today.

the-pi-guy · Mar 27, 2026, 06:14 PM

Quote from: Legend on Mar 27, 2026, 05:29 PMPi have you been using LTX 2.3? IDK if it got an update but I'm seeing vids from it today.

Was working on downloading it on Sunday, but I haven't used it.

Was also hearing about a promising new one: daVinci‑MagiHuman

But it sounded like it required some hefty specs. Not really for consumer hardware.

the-pi-guy · Mar 27, 2026, 06:22 PM

The biggest thing I'd really like to see is a way to create longer videos. 20 seconds isn't amazing.

If we could create like even 10 minute videos that'd be cool.

Legend · Mar 29, 2026, 09:30 PM

Quote from: the-Pi-guy on Mar 27, 2026, 06:22 PMThe biggest thing I'd really like to see is a way to create longer videos. 20 seconds isn't amazing.

If we could create like even 10 minute videos that'd be cool.

It is odd that that hasn't been solved yet. Like yes it is a ton of data but like maybe just make a 32x32 latent video that can be 10 minutes long and then use current tech to upscale it. IDK