@Eccitaze

Eccitaze@yiffit.net · 27 days ago

Yeah, suuuuure you weren’t.

Note that the proof also generalizes to any form of creating an AI by training it on a dataset, not just LLMs. But sure, we’ll absolutely develop an entirely new approach to cognitive science in a few years, we’re definitely not boiling the planet and funneling enough money to end world poverty several times over into a scientific dead end!

Eccitaze@yiffit.net · 27 days ago

You literally were LMAO

Other than that, we will keep incrementally improving our technology and it’s only a matter of time untill we get there. May take 5 years, 50 or 500 but it seems pretty inevitable to me.

Literally a direct quote. In what world is this not talking about LLMs?

Eccitaze@yiffit.net · 27 days ago

Did you read the article, or the actual research paper? They present a mathematical proof that any hypothetical method of training an AI that produces an algorithm that performs better than random chance could also be used to solve a known intractible problem, which is impossible with all known current methods. This means that any algorithm we can produce that works by training an AI would run in exponential time or worse.

The paper authors point out that this also has severe implications for current AI, too–since the current AI-by-learning method that underpins all LLMs is fundamentally NP-hard and can’t run in polynomial time, “the sample-and-time requirements grow non-polynomially (e.g. exponentially or worse) in n.” They present a thought experiment of an AI that handles a 15-minute conversation, assuming 60 words are spoken per minute (keep in mind the average is roughly 160). The resources this AI would require to process this would be 60*15 = 900. The authors then conclude:

“Now the AI needs to learn to respond appropriately to conversations of this size (and not just to short prompts). Since resource requirements for AI-by-Learning grow exponentially or worse, let us take a simple exponential function O(2n ) as our proxy of the order of magnitude of resources needed as a function of n. 2^900 ∼ 10^270 is already unimaginably larger than the number of atoms in the universe (∼10^81 ). Imagine us sampling this super-astronomical space of possible situations using so-called ‘Big Data’. Even if we grant that billions of trillions (10 21 ) of relevant data samples could be generated (or scraped) and stored, then this is still but a miniscule proportion of the order of magnitude of samples needed to solve the learning problem for even moderate size n.”

That’s why LLMs are a dead end.

Eccitaze@yiffit.net · 1 month ago

When IT folks say devs don’t know about hardware, they’re usually talking about the forest-level overview in my experience. Stuff like how the software being developed integrates into an existing environment and how to optimize code to fit within the bounds of reality–it may be practical to dump a database directly into memory when it’s a 500 MB testing dataset on your local workstation, but it’s insane to do that with a 500+ GB database in production environment. Similarly, a program may run fine when it’s using a NVMe SSD, but lots of environments even today still depend on arrays of traditional electromechanical hard drives because they offer the most capacity per dollar, and aren’t as prone to suddenly tombstoning when it dies like flash media. Suddenly, once the program is in production, it turns out that same program’s making a bunch of random I/O calls that could be optimized into a more sequential request or batched together into a single transaction, and now it runs like dogshit and drags down every other VM, container, or service sharing that array with it. That’s not accounting for the real dumb shit I’ve read about, like “dev hard coded their local IP address and it breaks in production because of NAT” or “program crashes because it doesn’t account for network latency.”

Game dev is unique because you’re explicitly targeting a single known platform (for consoles) or targeting for an extremely wide range of performance specs (for PC), and hitting an acceptable level of performance pre-release is (somewhat) mandatory, so this kind of mindfulness is drilled into devs much more heavily than business software dev is, especially in-house dev. Business development is almost entirely focused on “does it run without failing catastrophically” and almost everything else–performance, security, cleanliness, resource optimization–is given bare lip service at best.

Eccitaze@yiffit.net · 3 months ago

The used game market is still insane, I’m seeing $20-30 for even shit-tier, obscure, normally worthless nes games. If you bought the console while it was new it’s still worth keeping, but absolutely just get a flash cart instead of subjecting yourself to the price gouging retro market.

Eccitaze@yiffit.net · 3 months ago

It’s not just “worse” graphics. CRTs have little/no input lag, which is crucial for some older games like Punch-Out!.

Eccitaze@yiffit.net · edit-2 4 months ago

Basically, X11/Xorg doesn’t isolate programs from one another. This is horrible for security since malicious software can read every window, as well as all the input from mice and keyboards, just by querying the X server, but it’s also handy for screen reading software, streaming, etc. Meanwhile, Wayland isolates programs in their own sandbox, which prevents, say, a malicious browser tab from reading all of your keyboard inputs and logging your root password, but also breaks those things we like to use. To make matters worse, it looks like everyone’s answer for this and similar dilemmas wasn’t “let’s fix Wayland” but “let’s develop an extension to fix Wayland” and we wound up with that one fucking xkcd standards comic that I won’t bother linking because everyone has seen it a zillion times.

ETA: Basically, my (layman’s) understanding is that fixing this and making screen readers work in Wayland is hard because the core Wayland developers seem to have little appetite for fixing this themselves. Meanwhile, there’s 3-4 implementations of Wayland that do things differently, so fixing it via extensions means either writing multiple backends in your program to do the same damn thing (aka a giant pain in the ass) or getting everyone to agree on the same standard implementation (good fucking luck).

Eccitaze@yiffit.net · 9 months ago

The problem is that there’s no incentive for employees to stay beyond a few years. Why spend months or years training someone if they leave after the second year?

But then you have to question why employees aren’t loyal any longer, and that’s because pensions and benefits have eroded, and your pay doesn’t keep up as you stay longer at a company. Why stay at a company for 20, 30, or 40 years when you can come out way ahead financially by hopping jobs every 2-4 years?

Eccitaze@yiffit.net · 10 months ago

It makes sense to judge how closely LLMs mimic human learning when people are using it as a defense to AI companies scraping copyrighted content, and making the claim that banning AI scraping is as nonsensical as banning human learning.

But when it’s pointed out that LLMs don’t learn very similarly to humans, and require scraping far more material than a human does, suddenly AIs shouldn’t be judged by human standards? I don’t know if it’s intentional on your part, but that’s a pretty classic example of a motte-and-bailey fallacy. You can’t have it both ways.

Eccitaze@yiffit.net · 1 year ago

Who even knows? For whatever reason the board decided to keep quiet, didn’t elaborate on its reasoning, let Altman and his allies control the narrative, and rolled over when the employees inevitably revolted. All we have is speculation and unnamed “sources close to the matter,” which you may or may not find credible.

Even if the actual reasoning was absolutely justified–and knowing how much of a techbro Altman is (especially with his insanely creepy project to combine cryptocurrency with retina scans), I absolutely believe the speculation that the board felt Altman wasn’t trustworthy–they didn’t bother to actually tell anyone that reasoning, and clearly felt they could just weather the firestorm up until they realized it was too late and they’d already shot themselves in the foot.

Eccitaze@yiffit.net · 1 year ago

…So your metric of “too much AI safety” is that it won’t let you fuck the fish…?

boykisser meme saying "I ain't even got a meme for this bro what the fuck"

Eccitaze@yiffit.net · 1 year ago

The speculation I heard in the Ars Technica article is that the board was unhappy with how quickly he was pushing to commercialize OpenAI, and they were wary about all the AI side hustles he was starting, including an AI chip company to compete with nvidia.

Eccitaze@yiffit.net · 1 year ago

I keep NoScript around because there’s been a few times where I clicked a bad link and NoScript blocking JS by default has saved my bacon. Plus, a lot of services like twitch serve ads through separate domains that I can block from running entirely with NoScript–the entire time people were complaining about Twitch trying to bypass adblockers, I never once saw a single ad.

Eccitaze@yiffit.net · 1 year ago

Yeah, as someone in a tech job whose primary function is “parsing and interpreting logs” sometimes even the repeated flood of seemingly useless logs can be helpful. If nothing else, they explain why there aren’t any useful logs and that can guide how I respond to the problem.

Eccitaze@yiffit.net · 1 year ago

First, it’s important to find an instance that caters to your interests, especially if you have more niche hobbies. Once you’re set up, search for and follow hashtags related to your personal interests, and use those to find accounts you like. Use hashtags in your own posts so that people can discover you more easily, and browse users that follow you to see if they’d be interesting to follow back and expand your network out. Keep an eye on the local and federated timeline for interesting posts, which includes all posts from people on the same instance and from all federated instances. Eventually, as you build up a follow list (and especially as you follow highly active accounts) your followed accounts will start introducing you to new accounts themselves through boosting posts.

It’s more work since you’re building the network yourself instead of having it spoon-fed to you by an algorithm, but it’s overall much more rewarding, and lets you tailor your experience to your own personal preferences.

Eccitaze@yiffit.net · 1 year ago

Not OP, but I think the point they’re making is that LTT screwed up the video, and that the drama sparked from LTT’s screwup gave Billet a lot of publicity they wouldn’t have had otherwise.

Personally, I’d trade the publicity for my only working prototype and $2,000 GPU back and a video that didn’t shit on me, but if you believe any publicity is good publicity…

Eccitaze@yiffit.net · 1 year ago

I mean, even a cursory search on Google shows that smart TVs can gather a hell of a lot more data than just that, up to and including analyzing the actual video being displayed to figure out what you’re watching

Eccitaze@yiffit.net · 1 year ago

Smart TVs will collect your personal info and viewing habits and send it to the manufacturer of they’re given half a chance

Some scummy brands will even configure their TVs to automatically and silently connect to open wifi networks to phone home