I did advanced mathematics and chose physics as one of my elective subjects in school. Nowadays, I do a lot of work based around analytics and forecasting.
“We need to find the average of this.”
“That’s easy. I’ll do some more advanced stuff to really dial in the accuracy.”
“Awesome. What’s the timeframe?”
looks at million row dataset “To find the average? Like a month. Some of these numbers are mispelled words… Why are all these blank?”
“Oh, you’ll have to read this 45 page document that outlines the default values.”
And that’s how roffice maths works. Lots and lots of if conditions, query merges, and meetings with other teams trying to understand why they entered in the thing they entered. By the time the data wrangling phase is complete, you give zero fucks about doing more than supplying the average.
Oh, sorry the 45 page document is for something else. The only person who understands this dataset is Dave and he was made redundant 5 years ago. Anyway, can you get this done today?
Yup this is every job now. Wrangling numbers. The actual job or calculation could be done in days if less. But dealing with dirty information and playing detective which isnt even part of it is the sink hole of every job right now.
If Timmy has 45 pages to read on a bus traveling an average speed of 35 mph with an mean stop distance being 0.7 kms how many stops will Timmy pass before this fucking meeting ends ?
That’s software development for you. Why is that weird value there? Because some guy, at some point, had checked for that and somehow it’s still relevant.
I know of a system that churns through literally millions of transactions representing millions of Euros every day, and their interface has load bearing typos (because Germans in the 90s were really bad at the Englishs).
Geez, that reminds me of a former colleague that, when asked for “the numbers,” would just send screenshots of tables in the ERP system instead of exporting them to a spreadsheet. What’s even worse, usually a lot of values were plain wrong, on one occasion more than half of them.
I did advanced mathematics and chose physics as one of my elective subjects in school. Nowadays, I do a lot of work based around analytics and forecasting.
“We need to find the average of this.”
“That’s easy. I’ll do some more advanced stuff to really dial in the accuracy.”
“Awesome. What’s the timeframe?”
looks at million row dataset “To find the average? Like a month. Some of these numbers are mispelled words… Why are all these blank?”
“Oh, you’ll have to read this 45 page document that outlines the default values.”
And that’s how roffice maths works. Lots and lots of if conditions, query merges, and meetings with other teams trying to understand why they entered in the thing they entered. By the time the data wrangling phase is complete, you give zero fucks about doing more than supplying the average.
Oh, sorry the 45 page document is for something else. The only person who understands this dataset is Dave and he was made redundant 5 years ago. Anyway, can you get this done today?
Dang, I was really hoping this would be one of those stories that goes like:
“How long will that take?”
“It’s a lot of data…like a month?” (But I actually wrote a Python script that compiles and formats it perfectly in like 5 minutes.)
“You’re such a hard worker!”
Shhhh
Yup this is every job now. Wrangling numbers. The actual job or calculation could be done in days if less. But dealing with dirty information and playing detective which isnt even part of it is the sink hole of every job right now.
Is this why chatgpt has a chance at optimizing work? Because it will filter out boring mistakes for you
Lol, I have still written python scripts to deal with them all.
I miss those days…
If Timmy has 45 pages to read on a bus traveling an average speed of 35 mph with an mean stop distance being 0.7 kms how many stops will Timmy pass before this fucking meeting ends ?
Why do I need to know what Timmy is up to, and how much he is reading?
That’s software development for you. Why is that weird value there? Because some guy, at some point, had checked for that and somehow it’s still relevant.
I know of a system that churns through literally millions of transactions representing millions of Euros every day, and their interface has load bearing typos (because Germans in the 90s were really bad at the Englishs).
Geez, that reminds me of a former colleague that, when asked for “the numbers,” would just send screenshots of tables in the ERP system instead of exporting them to a spreadsheet. What’s even worse, usually a lot of values were plain wrong, on one occasion more than half of them.
I think the solution is 42
What is the advanced stuff you can do if you don’t have garbage data?