Linus Torvalds has come out strong against proposed support for RISC-V big endian capabilities within the Linux kernel.

In response to a mailing list comment whether RISC-V big endian “BE” patches being worked on would be able to make it for this current Linux kernel cycle. Linus Torvalds initially wrote:

"Oh Christ. Is somebody seriously working on BE support in 2025?

WHY?

Seriously, that sounds like just stupid. Is there some actual real reason for this, or is it more of the “RISC-V is used in academic design classes and so people just want to do endianness for academic reasons”?

Because I’d be more than happy to just draw a line in the sand and say “New endianness problems are somebody ELSES problem”, and tell people to stop being silly.

Let’s not complicate things for no good reason. And there is NO reason to add new endianness.

RISC-V is enough of a mess with the millions of silly configuration issues already. Don’t make it even worse.

Tell people to just talk to their therapists instead. That’s much more productive."

  • Eldritch@piefed.world
    link
    fedilink
    English
    arrow-up
    26
    ·
    1 day ago

    Big endian is a byte order scheme. For registers and memory. If I’m not mistaken, pretty much all modern systems these days are little endian. The differences can largely be mitigated at the operating system level. But it definitely invites new chaos back into the equation. If programs don’t get their bytes in the right order, it will be garbage in garbage out everywhere. Try playing an old PCM wave file from a 68k based Macintosh or Amiga on a PC.

    • calliope@retrolemmy.com
      link
      fedilink
      arrow-up
      9
      ·
      edit-2
      1 day ago

      Try playing an old PCM wave file from a 68k based Macintosh or Amiga on a PC.

      I really appreciate how specific this is. It sounds like it’s from experience.

      • Eldritch@piefed.world
        link
        fedilink
        English
        arrow-up
        14
        ·
        edit-2
        1 day ago

        It 100% is. I have distinct memories back in the 80s of getting a hold of files off of FidoNet, etc. and trying to play them only to find out they were big endian files. It sounded like a mildly uncomfortable shoggoth in a blender. It had nothing on the dulcet tones of dial up modems.

    • ulterno@programming.dev
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 day ago

      The differences can largely be mitigated at the operating system level.

      Exactly. And that also means that they have to be mitigated at the OS level and in this case, the kernel level.
      It would usually be fine if the system had nothing other than RAM and fixed storage, but anything else (as simple as keyboard drivers and stuff) and you will need to make the program accordingly. Maybe even SATA would require handling endianness.

      Honestly, I thought that they already handled endianness, due to how many different things Linux works on.
      But if it doesn’t then I don’t see there being much of a requirement.

      For networking devices and such, I’d rather favour more customised solutions either way.

      • elmicha@feddit.org
        link
        fedilink
        arrow-up
        1
        ·
        23 hours ago

        Some big-endian architectures are supported, but not RISC-V:

        A related problem is big-endian support, which is also 32-bit only, and also obsolete. Its removal is blocked because IBM is still supporting big-endian mainframe and PowerPC systems; as long as that support continues, big-endian support will stay in the kernel.

      • teawrecks@sopuli.xyz
        link
        fedilink
        arrow-up
        3
        ·
        14 hours ago

        If you have a file containing 4 bytes, where the first byte in the file is 0x01, second is 0x02, third is 0x03, and 4th is 0x04, and you keep that same order when you load them into the first 4 bytes of memory on an LE machine vs a BE machine, what value does the uint32 located at address 0 contain? On LE it’s 0x04030201, on BE it’s 0x01020304. So yes, it absolutely matters.

      • Eldritch@piefed.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        16 hours ago

        Some formats are designed around a particular endian-ness. But not all. Some only specify where a byte is to be stored and what it should represent. But not how it should be stored. PCM WAV files are ancient and uncompressed. with a few bytes in the headder telling you bit depth and sample rate. Followed by a stream of raw bytes representing the samples. Whether you store those bytes in LSB or MSB order. They’re both valid. But passing such a file to a system expecting the opposite. Your file will still play. It will just be squealing noise.

        Bitmap images can be similar as well. So arbitrary is the data in RAW Bitmap files. That you can open it in audio software like Audacity as a raw PCM. Perform Fourier transforms or other audio options on it. Save the raw byte stream back out as an image again. Bytes are bytes, and a lot of formats don’t care what order they’re in. Even if the user might.