AI Models from Google, OpenAI, Anthropic Solve 0% of ‘Hard’ Coding Problems

cm0002@lemmy.world · 3 months ago

AI Models from Google, OpenAI, Anthropic Solve 0% of ‘Hard’ Coding Problems

Endmaker@ani.social · edit-2 3 months ago

In the ‘Medium’ difficulty category, OpenAI’s o4-mini-high model scored the highest at 53.5%.

This fits my observation of such models. o4-mini-high is able to help me with 80-90% of the problems at work. For the remaining problems, it would come up with a nonsensical solution and no matter how much I prompt it, it would tunnel-vision on that specific approach. It could never second guess itself and realise that its initial solution is completely off the mark, and try an entirely differently approach. That’s where I usually step in and do the work myself.

It still saves me time with the trivial stuff though.

I can’t say the same for the rest of the LLMs. They are simply no good at coding and just waste my time.

atzanteol@sh.itjust.works · 3 months ago

The claims that AI will be surpassing humans in programming are pretty ridiculous. But let’s be honest - most programming is rather mundane.

Ledivin@lemmy.world · 3 months ago

My productivity has at least tripled since I started using Cursor. People are actually underestimating the effects that AI will have in the industry

PushButton@lemmy.world · 3 months ago

It means the AI is very helpful to you. This also means you are as good as 1/3 of an AI in coding skills…

Which is not a great news for you mate.

atzanteol@sh.itjust.works · 3 months ago

Ah knock it off. Jesus you sound like people in the '90s mocking “intellisense” in the IDE as somehow making programmers “less real programmers”.

It’s all needless gatekeeping and purity test BS. Use tools that are useful. Don’t worry if it makes you less of a man.

Feyd@programming.dev · 3 months ago

It’s not gate keeping it is true. I know devs that say ai tools are useful but all the ones that say it makes them multiples more productive are actually doing negative work because I have to deal with their terrible code they don’t even understand.

Outsider9042@lemmynsfw.com · 3 months ago

About all they are good for is generating boilerplate code. Just far less efficiently than a snippet library.

atzanteol@sh.itjust.works · 3 months ago

Just far less efficiently than a snippet library.

Your snippet library can convert a large JSON file to a Java class using Java property naming conventions and including annotations for Jackson where the names differ from the JSON?

Glitchvid@lemmy.world · 3 months ago

https://transform.tools/json-to-java

technocrit@lemmy.dbzer0.com · edit-2 3 months ago

Thanks for posting. People really out here believing that the most basic scripts are Johnny 5.

atzanteol@sh.itjust.works · 3 months ago

I mean - the AI models do a lot more than just that one task you know. Or are you a “real programmer” who doesn’t need AI tools because “real men” don’t need help?

Outsider9042@lemmynsfw.com · 3 months ago

The keyword is efficiently.

I’m not anti ai. I’m anti burning down the down planet to solve problems that were already solved years ago with a few scripts.

django@discuss.tchncs.de · edit-2 2 months ago

Also: correctly, without inventing or garbling data.

Using a llm for this task will require intensive proof-reading, as it might have changed the data somewhere.

daniskarma@lemmy.dbzer0.com · 3 months ago

They have their uses. For instance the other day I needed to read some assembly and decompiled C, you know how fun that can be. LLM proved quite good at translating it to english. And really speed up the process.

Writing it back wasn’t that good though, just good enough to point in a direction but I still ended up writing the patcher mostly by myself.

Lemminary@lemmy.world · 3 months ago

the other day I needed to read some assembly and decompiled C

As one casually does lol Jokes aside, that’s pretty cool. I wish I had the technical know-how and, most importantly, the patience for it.

FizzyOrange@programming.dev · edit-2 3 months ago

Assembly is very simple (at least RISC-V assembly is which I mostly work with) but also very tedious to read. It doesn’t help that the people who choose the instruction mnemonics have extremely poor taste - e.g. lb, lh, lw, ld instead of load8, load16, load32, load64. Or j instead of jump. Who needs to save characters that much?

The over-abbreviation is some kind of weird flaw that hardware guys all have. I wondered if it comes from labelling pins on PCB silkscreens (MISO, CLK etc)… Or maybe they just have bad taste.

I once worked on a chip that had nested acronyms.

amorpheus@lemmy.world · 3 months ago

Who needs to save characters that much?

Do you realize how old assembly language is?

It predates hard disks by ten years and coincided with the invention of the transistor.

AI Models from Google, OpenAI, Anthropic Solve 0% of ‘Hard’ Coding Problems

AI Models from Google, OpenAI, Anthropic Solve 0% of ‘Hard’ Coding Problems

AI Models from Google, OpenAI, Anthropic Solve 0% of ‘Hard’ Coding Problems | AIM