@lagrangeinterpolator

lagrangeinterpolator@awful.systems · 2 days ago

Unfortunately, I don’t think anyone is ever going to go through all 19,797 submissions and 75,800 reviews (to one conference, in one year) and manually review them all. Then again, using the ultra-advanced cutting-edge innovative statistical technique of randomly sampling a few papers/reviews, one can still get useful conclusions.

lagrangeinterpolator@awful.systems · 2 days ago

After the bubble collapses, I believe there is going to be a rule of thumb for whatever tiny niche use cases LLMs might have: “Never let an LLM have any decision-making power.” At most, LLMs will serve as a heuristic function for an algorithm that actually works.

Unlike the railroads of the First Gilded Age, I don’t think GenAI will have many long term viable use cases. The problem is that it has two characteristics that do not go well together: unreliability and expense. Generally, it’s not worth spending lots of money on a task where you don’t need reliability.

The sheer expense of GenAI has been subsidized by the massive amounts of money thrown at it by tech CEOs and venture capital. People do not realize how much hundreds of billions of dollars is. On a more concrete scale, people only see the fun little chat box when they open ChatGPT, and they do not see the millions of dollars worth of hardware needed to even run a single instance of ChatGPT. The unreliability of GenAI is much harder to hide completely, but it has been masked by some of the most aggressive marketing in history towards an audience that has already drunk the tech hype Kool-Aid. Who else would look at a tool that deletes their entire hard drive and still ever consider using it again?

The unreliability is not really solvable (after hundreds of billions of dollars of trying), but the expense can be reduced at the cost of making the model even less reliable. I expect the true “use cases” to be mainly spam, and perhaps students cheating on homework.

lagrangeinterpolator@awful.systems · 2 days ago

Now I’m even more skeptical of the programmers (and managers) who endorse LLMs.

lagrangeinterpolator@awful.systems · edit-2 2 days ago

The basilisk now eats its own tail.

lagrangeinterpolator@awful.systems · edit-2 2 days ago

Promptfans still can’t get over the Erdős problems. Thankfully, even r/singularity has somehow become resistant to the most overhyped claims. I don’t think I need to comment on this one.

Link: https://www.reddit.com/r/singularity/comments/1pag5mp/aristotle_from_harmonicmath_just_proved_erdos/

alt text (original claim)

We are on the cusp of a profound change in the field of mathematics. Vibe proving is here.

Aristotle from @HarmonicMath just proved Erdos Problem #124 in @leanprover, all by itself. This problem has been open for nearly 30 years since conjectured in the paper “Complete sequences of sets of integer powers” in the journal Acta Arithmetica.

Boris Alexeev ran this problem using a beta version of Aristotle, recently updated to have stronger reasoning ability and a natural language interface.

Mathematical superintelligence is getting closer by the minute, and I’m confident it will change and dramatically accelerate progress in mathematics and all dependent fields.

alt text (comments)

Gcd conditions removed, still great, but really hate the way people shill their stuff without any rigor to explaining the process. A lot of things become very easy when you remove a simple condition. Heck reimann hypothesis is technically solved for function fields over finite fields. But nowadays in the age of hype, a tweet post would probably say “Reimann hypothesis oneshotted by AI” even though that’s not true.

Gcd conditions removed

So they didn’t solve the actual problem?

lagrangeinterpolator@awful.systems · edit-2 7 days ago

True, it is possible to achieve 100,000x speedups if you dispose of the silly restriction of being correct.

lagrangeinterpolator@awful.systems · edit-2 7 days ago

We will secure energy dominance by dumping even more money and resources into a technology that is already straining our power grid. But don’t worry. The LLM will figure it all out by reciting the Wikipedia page for Fusion Power.

AI is expected to make cutting-edge simulations run “10,000 to 100,000 times faster.”

Turns out it’s not good to assume that literally every word that comes out of a tech billionaire’s mouth is true. Now everyone else thinks they can get away with just rattling off numbers where their source is they made it the fuck up. I still remember Elon Musk saying a decade ago that he could make rockets 1,000 times cheaper, and so many people just thought it was going to happen.

We need scientists and engineers. We do not need Silicon Valley billionaire visionary innovator genius whizzes with big ideas who are pushing the frontiers of physics with ChatGPT.

lagrangeinterpolator@awful.systems · edit-2 8 days ago

You’d think peer review would make things better here, but big ML conferences have to deal with an absurd amount of submissions these days. NeurIPS this year got over 21000. The system they use for reviews is that anyone who submits a paper is required to review a certain number of other papers. So yeah, your ML paper is getting reviewed by other people who happen to submit their own papers. Who are competing with you to get their own papers accepted. Yeah, no problems there.

lagrangeinterpolator@awful.systems · 13 days ago

Just make sure you have a few missile turrets protecting the area if you’re playing against zerg. You don’t want your SCV that is building the SMR to get sniped by a flock of mutalisks.

lagrangeinterpolator@awful.systems · 15 days ago

In my experience most people just suck at learning new things, and vastly overestimate the depth of expertise. It doesn’t take that long to learn how to do a thing. I have never written a song (without AI assistance) in my life, but I am sure I could learn within a week. I don’t know how to draw, but I know I could become adequate for any specific task I am trying to achieve within a week. I have never made a 3D prototype in CAD and then used a 3D printer to print it, but I am sure I could learn within a few days.

This reminds me of another tech bro many years ago who also thought that expertise is overrated, and things really aren’t that hard, you know? That belief eventually led him to make a public challenge that he could beat Magnus Carlsen in chess after a month of practice. The WSJ picked up on this, and decided to sponsor an actual match with him and Carlsen. They wrote a fawning article about it, but it did little to stop his enormous public humiliation in the chess community. Here’s a reddit thread discussing that incident: https://www.reddit.com/r/HobbyDrama/comments/nb5b1k/chess_one_month_to_beat_magnus_how_an_obsessive/

As a sidenote, I found it really funny that he thought his best strategy was literally to train a neural network and … memorize all the weights and run inference with mental calculations during the game. Of course, on the day of the match, the strategy was not successful because his algorithm “ran out of time calculating”. How are so many techbros not even good at tech? Come on, that’s the one thing you’re supposed to know!

lagrangeinterpolator@awful.systems · 15 days ago

Just had a conversation about AI where I sent a link to Eddy Burback’s ChatGPT Made Me Delusional video. They clarified that no, it’s only smart people who are more productive with AI since they can filter out all the bad outputs, and only dumb people would suffer all the negative effects. I don’t know what to fucking say.

lagrangeinterpolator@awful.systems · 16 days ago

It is important to update your beliefs with new information and listen to criticism from people who may disagree with you. But never listen to those SneerClub guys! Their non-Rational sneering will corrupt your bodily fluids!

lagrangeinterpolator@awful.systems · 28 days ago

One of the core beliefs of rationalism is that Intelligence™ is the sole determinant of outcomes, overriding resource imbalances, structural factors, or even just plain old luck. For example, since Elon Musk is so rich, that must be because he is very Intelligent™, despite all of the demonstrably idiotic things he has said over the years. So, even in an artificial scenario like chess, they cannot accept the fact that no amount of Intelligence™ can make up for a large material imbalance between the players.

There was a sneer two years ago about this exact question. I can’t blame the rationalists though. The concept of using external sources outside of their bubble is quite unfamiliar to them.

lagrangeinterpolator@awful.systems · 29 days ago

The dumb strawman protagonist is called “Mr. Humman” and the ASI villain is called “Mr. Assi”. I don’t think any parody writer trying to make fun of rationalist writing could come up with something this bad.

The funniest comment is the one pointing out how Eliezer screws up so many basic facts about chess that even an amateur player can see all the problems. Now, if only the commenter looked around a little further and realized that Eliezer is bullshitting about everything else as well.

lagrangeinterpolator@awful.systems · 1 month ago

At the same time, they constantly complain about OpenAI screwing them over with rerouting to GPT5. I don’t know how to tell them this, but OpenAI is starting to realize that maybe lighting mountains of cash on fire is actually bad.

lagrangeinterpolator@awful.systems · 1 month ago

The saddest part is that they are extremely defensive about all this. The entire subreddit is restricted so nobody can post without moderator approval, and so many posts there constantly reference haters and trolls (like this one). Yeah sure, anything like this will attract a lot of trolls, but this is a perfect pretense for censoring legitimate concerns. Many of these people encourage others to fall deeper into the hole with reasonable-sounding arguments, and they never see any pushback because all of it has been censored.

lagrangeinterpolator@awful.systems · 1 month ago

Oh boy, another AI doom video popped up on my feed. Time for more morbid curiosity. The topic is about Big Yud and Nate Soares’s new book (“If You Build It, Everyone Dies”) about how AI is gonna kill us all. I have better things to waste 30 minutes on, so I’m not watching the full video, but the thumbnail (“The 7 Minute War”) kinda suggests what the contents are gonna be.

Thankfully, the description of the video has a Google doc with their sources! I’m sure it’s full of hard evidence from careful experiments that logically demonstrate why their doomsday scenario is something to worry about, not just a random assortment of Anthropic blog posts and completely unrelated events.

Somehow, there are a bunch of sources for the first 2 minutes of the video.

“In the New York Times’ best-selling book, which was endorsed by Nobel laureates and the godfathers of AI” Geoffrey Hinton — Personal estimate >50% existential risk.

Geoffrey “All radiologists will be replaced in 5 years” Hinton, Nobel laureate in physics, famous for his work in … physics.

“researchers from the Machine Intelligence Research Institute describe in detail one potential example future” Machine Intelligence Research Institute — The Sable scenario from If Anyone Builds It, Everyone Dies by Yudkowsky & Soares. Fictional narrative illustrating risks, not prediction.

This is not the first we’ve seen from MIRI, and I have a feeling it will not be the last. The monster under my bed is a fictional narrative illustrating risks, not prediction.

“AI researchers have known this has been potentially a very bad idea since at least 2024” Anthropic/Apollo Research — Multiple 2024 papers document deceptive/self-preserving behaviors in controlled evaluations.

They are still trying to flog the Anthropic/Apollo Research claims that chatbots will lie to you if you tell them to lie to you.

“They spin up 200,000 GPUs and let Sable think for 16 hours straight” xAI/NVIDIA — Colossus supercomputer in Memphis scaling toward ~200,000 GPUs for Grok training.

What does this even demonstrate? Some people can do some stuff with some GPUs? I ate some oatmeal today. Now everyone should be thoroughly convinced of my oatmeal-eating abilities.

I watched for a few seconds around the timestamp, and it seems to be the beginning of their scifi story, I mean, AGI scenario. Yes, if you want to convince people that your scenario is plausible, I’m sure this is the part that you need serious amounts of evidence for. Remember, almost half the sources have timestamps for the first two minutes of the video.

“a stunt to see if Sable can crack famous math problems like the Riemann hypothesis” Clay Mathematics Institute — Riemann Hypothesis remains unsolved after 160+ years, considered most famous unsolved problem in pure mathematics.

Again, what does this demonstrate? I tried solving P vs NP with a cheeseburger. That didn’t work either. The only purpose of mentioning this is for narrative window dressing, because Math Is For Smart People.

These are the sources for just the first two minutes. After that, they get a bit sparse.

“Back in 2024, smaller models showed flashes of the same behavior” Multiple Papers — Documented deception/scheming findings in frontier models.

“Claude 3.7 was caught repeatedly cheating on coding tasks even when told to stop”

More Anthropic blog posts and system cards? Come on, I can’t sneer the same thing twice in one post!

“Steal cryptocurrency from weak exchanges just like hackers did to Mt. Gox in 2011” U.S. Department of Justice — Russian nationals charged for 2011 Mt. Gox hack. 647,000-850,000 BTC stolen.

I don’t know what this has to do with supporting the validity of their AI doomsday scenario, but kudos to them for showing why cryptocurrency is also stupid, I guess.

“or Bybit in 2025” Reuters/FBI — Largest cryptocurrency theft to date. FBI attributed to North Korean Lazarus Group.

More? I guess this is hard evidence for showing why cryptocurrency is stupid. I still don’t understand how this demonstrates that AI is scary.

“Reminder, this scenario is based on years of technical research by the Machine Intelligence Research Institute, laid out in the book If Anyone Builds It Everyone Dies” MIRI — Meta-commentary explaining the scenario is illustrative, not predictive.

I knew MIRI would be back. It’s illustrative, not predictive! Please don’t blame us if none of this even remotely happens! But it’s based on years of technical research. An entire graduate student’s worth of output in a decade.

“In 2023, a human gave an LLM access to the internet and created an X account, Terminal of Truths, which gained hundreds of thousands of followers and launched its own crypto meme coin that reached a literal billion dollar market cap” Terminal of Truths — Real-world example of AI agent gaining social media following and wealth.

The link they give references … another one of their own videos. You really are not beating the circular reference allegations here. Even if the purported story is somehow accurate, this again demonstrates how cryptocurrency is stupid. At least they use an LLM as a prop this time.

“Gain of function research. Any one of them could be hijacked to unleash catastrophe.” Science/CIDRAP — Fouchier and Kawaoka created ferret-transmissible H5N1. Controversial GOF research began 2011.

I think Yud is obsessed with this topic in particular. Better than diamondoid bacteria, I guess. Again, the AI just magically comes in and uses this stuff somehow.

“The number one and number two most cited living scientists across all fields think scenarios like this are not only possible but likely to happen. And the average AI researcher thinks there is a 16% chance of AI causing human extinction.”

Okay, let me be completely serious for this one. What would someone do if they truly believed that their work would lead to a horrible disaster, such as the extinction of humanity? Would they continue to work in the field, let alone make enough contributions to rise to the top? Alright I’m done.

lagrangeinterpolator@awful.systems · edit-2 1 month ago

Every time I hear a moderate AI argument (e.g. AI will be an aid for searching literature or writing code), it’s like, “Look, it’s impressive that the AI managed to do this. Sure, it took about three dozen prompts over five hours, made me waste another five hours because it generated some completely incorrect nonsense that I had to verify, produced an answer that was much lower quality than if I had just searched it up myself, and boiled two lakes in the process. You should acknowledge that there is something there, even if it did take a trillion dollars of hardware and power to grind the entire internet and all books and scientific papers into a viscous paste. Your objections are invalid because I’m sure things are gonna improve because Progress.”

I am doubly annoyed when I turn my back and they switch back to spouting nonsense about exponential curves and how AI is gonna be smarter than humans at literally everything.

lagrangeinterpolator@awful.systems · 1 month ago

I did not think anything could make me sympathetic to the authors who put 0.1pt white text in their papers so that any reviewer lazy enough to use an LLM would get prompt injected, but here we are.

lagrangeinterpolator@awful.systems · edit-2 2 months ago

More AI bullshit hype in math. I only saw this just now so this is my hot take. So far, I’m trusting this r/math thread the most as there are some opinions from actual mathematicians: https://www.reddit.com/r/math/comments/1o8xz7t/terence_tao_literature_review_is_the_most/

Context: Paul Erdős was a prolific mathematician who had more of a problem-solving style of math (as opposed to a theory-building style). As you would expect, he proposed over a thousand problems for the math community that he couldn’t solve himself, and several hundred of them remain unsolved. With the rise of the internet, someone had the idea to compile and maintain the status of all known Erdős problems in a single website (https://www.erdosproblems.com/). This site is still maintained by this one person, which will be an important fact later.

Terence Tao is a present-day prolific mathematician, and in the past few years, he has really tried to take AI with as much good faith as possible. Recently, some people used AI to search up papers with solutions to some problems listed as unsolved on the Erdős problems website, and Tao points this out as one possible use of AI. (I personally think there should be better algorithms for searching literature. I also think conflating this with general LLM claims and the marketing term of AI is bad-faith argumentation.)

You can see what the reasonable explanation is. Math is such a large field now that no one can keep tabs on all the progress happening at once. The single person maintaining the website missed a few problems that got solved (he didn’t see the solutions, and/or the authors never bothered to inform him). But of course, the AI hype machine got going real quick. GPT5 managed to solve 10 unsolved problems in mathematics! (https://xcancel.com/Yuchenj_UW/status/1979422127905476778#m, original is now deleted due to public embarrassment) Turns out GPT5 just searched the web/training data for solutions that have already been found by humans. The math community gets a discussion about how to make literature more accessible, and the rest of the world gets a scary story about how AI is going to be smarter than all of us.

There are a few promising signs that this is getting shut down quickly (even Demis Hassabis, CEO of DeepMind, thought that this hype was blatantly obvious). I hope this is a bigger sign for the AI bubble in general.

EDIT: Turns out it was not some rando spreading the hype, but an employee of OpenAI. He has taken his original claim back, but not without trying to defend what he can by saying AI is still great at literature review. At this point, I am skeptical that this even proves AI is great at that. After all, the issue was that a website maintained by a single person had not updated the status of 10 problems inside a list of over 1000 problems. Do we have any control experiments showing that a conventional literature review would have been much worse?