Why my company is stuck rn

Jason Voorhees · Mar 19, 2026

I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

Jason Voorhees · Mar 19, 2026

@Glorious King @topology @SharpOrange @LXR @User28823

NinjaRG9 · Mar 19, 2026

Always some bullshit with this old nigga

Jason Voorhees · Mar 19, 2026

@mcmentalonthemic @jeoyw9192 @Jatt @childishkillah

tansel · Mar 19, 2026

unknownincel · Mar 19, 2026

sounds complicated

dropped comp science

Jason Voorhees · Mar 19, 2026

@imontheloose @dhusc @unstable @Jager

ER1887 · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

I hate working with computers

unstable · Mar 19, 2026

what do you think about non transistor processors for ai?
i think they are all hype.

mcmentalonthemic · Mar 19, 2026

Will read later im playing video games

Jason Voorhees · Mar 19, 2026

mcmentalonthemic said:
Will read later im playing video games

Which one?

Jason Voorhees · Mar 19, 2026

unstable said:
what do you think about non transistor processors for ai?
i think they are all hype.

That's the hardware side of things that I'm not sure about tbh

Jason Voorhees · Mar 19, 2026

@Incelforeever

unstable · Mar 19, 2026

which languages would you advice someone to learn for best job opportunities.

SharpOrange · Mar 19, 2026

unstable said:
what do you think about non transistor processors for ai?
i think they are all hype.

What course is this? Electronics?

SharpOrange · Mar 19, 2026

unstable said:
which languages would you advice someone to learn for best job opportunities.

Hebrew

Jason Voorhees · Mar 19, 2026

unstable said:
which languages would you advice someone to learn for best job opportunities.

Learn python and FASTAPI

mcmentalonthemic · Mar 19, 2026

Jason Voorhees said:
Which one?

ready or not

Jason Voorhees · Mar 19, 2026

SharpOrange said:
What course is this? Electronics?

ECE but no course will teach that specific thing. You'll have to learn that yourself

elliottttt · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

dnr

unstable · Mar 19, 2026

SharpOrange said:
What course is this? Electronics?

nah youtube autism

hyperbeast · Mar 19, 2026

Hire me bhai

birthdefect · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

idk anything about computers so ignore any following idiocy
this is floating point precision errors correct?
this is just unfixable isnt it?
is quantum computing a fad? if not would it fix this? same with tertiary computers, although unrealistic to inplement

Sayori · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

so, since your company works with ai its caused it to stuck rn ?

Wish you luck and hope that you are a crucial member of your company as to not get fired

jeoyw9192 · Mar 19, 2026

NinjaRG9 said:
Always some bullshit with this old nigga

If you have nothing meaningful to say (including questions) stfu.

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

Thing is for instance are you considering decentralized AI networks as well? Practically ud have multiple nodes that might require verification of a certain computation. In what you mention of you don't bit exact outcomes then the nodes could end up disagreeing on the valid model state regardless of the same inputs/outputs.

Another point is the issue of being able to debug at a much larger scale, like when a model training run costs a shit ton of money like in the millions , the bugs here are annoying asf to deal with. I'm talking finding the precise MS where a gradient exploded.

childishkillah · Mar 19, 2026

I didn't know this, what kind of IA are you working with?

Jason Voorhees · Mar 19, 2026

birthdefect said:
idk anything about computers so ignore any following idiocy
this is floating point precision errors correct?
this is just unfixable isnt it?
is quantum computing a fad? if not would it fix this? same with tertiary computers, although unrealistic to inplement

Quantum computing not a fad. Is revolutionary and progressing steadily for certain hard problems but won't fix this for today's Al training. Quantum bits have their own massive noise/error issues. It's a seperate rabbit hole entirely and still mostly lab experiments that are insanely hard to scale reliably

Jason Voorhees · Mar 19, 2026

Sayori said:
so, since your company works with ai its caused it to stuck rn ?

Wish you luck and hope that you are a crucial member of your company as to not get fired

No one is getting fired brah. This issue isn't something specific to our company it's just end game for AI researchers. Not something actively harming us but could be huge bottle neck later.

Jason Voorhees · Mar 19, 2026

jeoyw9192 said:
If you have nothing meaningful to say (including questions) stfu.

Thing is for instance are you considering decentralized AI networks as well? Practically ud have multiple nodes that might require verification of a certain computation. In what you mention of you don't bit exact outcomes then the nodes could end up disagreeing on the valid model state regardless of the same inputs/outputs.

Another point is the issue of being able to debug at a much larger scale, like when a model training run costs a shit ton of money like in the millions , the bugs here are annoying asf to deal with. I'm talking finding the precise MS where a gradient exploded.

This is actually called the determinism gap in AI.

In decentralized Al, if two nodes run the exact same input but use different GPU architectures or sometimes even different driver versions their floating point rounding will diverge. This is why the bit exact reproducible is the holy grail for researchers rn without it you can't easily verify if a node is cheating or just experiencing standard drift you u can't reliably trace when or why something like a gradient explosion happened. It makes all these million dollars AI tuning into a game of blind trial and error.

birthdefect · Mar 19, 2026

Jason Voorhees said:
Quantum computing not a fad. Is revolutionary and progressing steadily for certain hard problems but won't fix this for today's Al training. Quantum bits have their own massive noise/error issues. It's a seperate rabbit hole entirely and still mostly lab experiments that are insanely hard to scale reliably

what about tertiary computers? would floating point precision errors still be an issue? hell what about analogue computers

NinjaRG9 · Mar 19, 2026

jeoyw9192 said:
If you have nothing meaningful to say (including questions)

I know

why u bulleh meh

jeoyw9192 · Mar 19, 2026

Jason Voorhees said:
This is actually called the determinism gap in AI.

In decentralized Al, if two nodes run the exact same input but use different GPU architectures or sometimes even different driver versions their floating point rounding will diverge. This is why the bit exact reproducible is the holy grail for researchers rn without it you can't easily verify if a node is cheating or just experiencing standard drift you u can't reliably trace when or why something like a gradient explosion happened. It makes all these million dollars AI tuning into a game of blind trial and error.

Very interesting I'll take a look; but yeah ur def right Abt it being a game of blind trial and error

can't even fathom dealing with that shit

NinjaRG9 said:
I knowwhy u bulleh meh

Wasn't trying to it's just annoying when comments like urs are made on such threads, u can always save time and choose not to send a message

johnnyapple · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

healthcare or law? Probably already read through this but I remember thinking machines talked about deterministic inference on a single server

Defeating Nondeterminism in LLM Inference

Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example, you might observe that asking ChatGPT the same question multiple times provides different results. This by itself is not surprising...

thinkingmachines.ai

Jason Voorhees · Mar 19, 2026

johnnyapple said:
healthcare or law? Probably already read through this but I remember thinking machines talked about deterministic inference on a single server

Defeating Nondeterminism in LLM Inference

Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example, you might observe that asking ChatGPT the same question multiple times provides different results. This by itself is not surprising...

thinkingmachines.ai

I've read this before. Thinking Machines approach shows that while a single GPU operation can be made deterministic, scaling that to a production is a nightmare.

Jason Voorhees · Mar 19, 2026

@Gomez

Gomez · Mar 19, 2026

Jason Voorhees said:
@Gomez

Time to pick up that job as a carpenter

Glorious King · Mar 19, 2026

the real question is

how do you find time to do this and research more on topics like this?

istg i cant find time to read my own code base brah

@Swarthy Knight

Jason Voorhees · Mar 19, 2026

Glorious King said:
the real question is

how do you find time to do this and research more on topics like this?

istg i cant find time read my own code base brah

@Swarthy Knight

Subscribe to YouTube channels and tech newsletters and learn. I literally read about these things at breakfast or while running listening to podcasts

Glorious King · Mar 19, 2026

Jason Voorhees said:
Subscribe to YouTube channels and tech newsletters and learn. I literally read about these things at breakfast or while running listening to podcasts

drop their names in dms pls

my feed is filled with japan slop and jdm shi

Jason Voorhees · Mar 19, 2026

Glorious King said:
drop their names in dms pls

my feed is filled with japan slop and jdm shi

Will dm you when I'm done shit posting in too lazy to go click share a dozen times now.

johnnyapple · Mar 19, 2026

Jason Voorhees said:
I've read this before. Thinking Machines approach shows that while a single GPU operation can be made deterministic, scaling that to a production is a nightmare.

you sound AI but yeah

Jason Voorhees · Mar 19, 2026

johnnyapple said:
you sound AI but yeah

Because it is I'm on phone. Oneplus Ai

LXR · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

Time to prove your college tag, invent a new paradigm

Jason Voorhees · Mar 19, 2026

johnnyapple said:
you sound AI but yeah

GReycel indian very good

johnnyapple · Mar 19, 2026

Jason Voorhees said:
GReycel indian very good

I dont understand but yes

topology · Mar 19, 2026

Jason Voorhees said:
I'll keep it as simply as possible so anyone can understand. You see all computers store things Os and 1s. Everything is long sequences of Os and 1s but the problem is all systems since the dawn of computers have rounding errors. 0.1+0.2 =/= 0.3 because decimal numbers cannot reliably be represented in Os and 1s.It would be something like 0.30000000000000004 or something equally cursed.

It's a fundamental law of how binary floating point (IEEE 754) works. For 50+ years we have just ignored this because who tf cares but now with AI in the picture. You can't simply ignore it. Millions of matrix multiplications per second,millisecond inferences and perfect consistency across training runs. It means even than tiny errors get magnified into something catastrophic across 175 billion parameters.

This isn't a huge problem generally neural networks don't need mathematical perfection infact gradient descent actually loves a bit of noise and garbage for generalization but the problem is here we are dealing with algorithms. There's ofc workarounds like quantization, tensor cores in GPUs specifically designed to handle this with FP32 but there's none that specifically catering to our needs because we require a deterministic bit exact reproducibility.

Do you work for a massive company? Didn't know your company specialized in AI.

jeoyw9192 · Mar 19, 2026

Jason Voorhees said:
Subscribe to YouTube channels and tech newsletters and learn. I literally read about these things at breakfast or while running listening to podcasts

DM me too should be helpful

Why my company is stuck rn

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

NinjaRG9

.

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

tansel

i love vasiliy stepanov

unknownincel

"Do not go gentle into that good night"

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

ER1887

Certified ND autist

unstable

Saintmaxxer loves all forumers

mcmentalonthemic

WIKID

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

unstable

Saintmaxxer loves all forumers

SharpOrange

lifelong KHHV oldcel

SharpOrange

lifelong KHHV oldcel

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

mcmentalonthemic

WIKID

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

elliottttt

Bronze

unstable

Saintmaxxer loves all forumers

hyperbeast

6'2, HMtn chasing MHtn. Known IOI farmer.

birthdefect

Kraken

Sayori

ascend or die

jeoyw9192

Kraken

childishkillah

Superior specimen

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

birthdefect

Kraken

NinjaRG9

.

jeoyw9192

Kraken

johnnyapple

Iron

Defeating Nondeterminism in LLM Inference

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Defeating Nondeterminism in LLM Inference

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Gomez

under 6'2 = manlet

Glorious King

Trading my sanity for ascension

Jason Voorhees

𝕸𝖊𝖗𝖈𝖊𝖓𝖆𝖗𝖞 𝕮𝖔𝖗𝖕 • 𝟐𝟎𝟐𝟒🥇

Glorious King