pull down to refresh
94 sats \ 7 replies \ @noknees 16 Jun \ on: My bad experiences using AI as a physicist science
A LOT!
So I was making this somewhat basic java springboot app and I told ChatGPT to review it.
This app is just supposed to send a Hello message as you can see the GreetingService class.
and when ChatGPT gave the output, it was a 200+ lines of code, wrapped in error handling blocks, 20+ variables (whereas mine has only 1) and things I didn't even learn, just to make it "ERROR FREE"
Like wtf man! I asked you check whether it works or not and you just gave me a whole thesis on it.
Then again one day, I asked him another science question, and it was pretty simple, how to prepare fresh ferrous sulphate because apparantly I could find how to make ferrous sulphate but not "fresh" on the internet.
And he replied with a Flowchart of how to create ferrous sulphate through pyrites. So I straightaway asked my chem teacher and he said that it's a reaction of iron and sulphuric acid filtered later and rebuked for not studying basics thoroughly.
But it wasnt MY fault! I WAS MISLEAD!
Can you brief about it? I might understand something :)
Or is it top-secret?
Can you brief about it? I might understand something :) Or is it top-secret?
Will submit it to the editor this week or next week, hopefully.
I can send you the paper once it's published. For now, I prefer not doxxing myself too much by being detailed about the field that I work in. Even though some people here already more than what is good about me to remain anon~~
Currently I'm pivoting my setup
from:
Synchronous: letting the LLM run unstructured with different models in a pipeline
to:
Asynchronous:
- LLM generates code or human writes it - doesn't matter - and uploads to repo
- Issue detection:
- linting logs one issue per error found
- If none found, LLM can analyze and create issue for the most significant issue - I specifically make the prompt with instruct repeatedly to only report the most significant issue. Works ok with LRM
- Users can of course add issues too, LLM analyzes if its a one-shot or if it needs breakdown
 
- Coding LLM can ingest issue and fix it with a pull req
- Pull req can get reviewed by LRM or human
- Human merges
Everything that can be done with code, like linting, does not use LLMs.
reply
Damn, I just wanted the AI to check if my plant was alive and it built a greenhouse with a self-watering system and an AI-powered scarecrow.
I swear, sometimes ChatGPT doesn’t review code — it rewrites it like it's auditioning for a job at NASA. Like bro, I’m still trying to survive public static void main, not orchestrate microservices across a Kubernetes cluster.
Same thing with chemistry — I asked for a fresh ferrous sulphate recipe and got a mining operation flowchart straight outta a metallurgy PhD thesis. Asked my chem teacher and he just said “use Fe + H₂SO₄ and move on.”
It's like these LLMs read Thus Spoke Zarathustra and thought every answer must ascend the mountain of abstraction before descending to meet us mortals.
“He who climbs upon the highest mountains laughs at all tragedies, real or imagined." — Nietzsche (Clearly what GPT thinks before it answers a 4-mark question.)
But fr tho, loving that async pivot you're on @optimism. Turning LLMs from noisy sidekicks into focused bug-hunters with issue-detection filtering? That’s pretty GOOD
Don't worry I won't steal your repo, I'm building a Human Behaviour Prediction Engine too, https://github.com/axelvyrn/TiresiasIQ (and it's quite good, believe me - i'd like your input)
Also, curious: How are you ranking issue significance without it hallucinating a crisis over a missing semicolon?
reply
How are you ranking issue significance
It doesn't matter. Every task should be small, or otherwise needs breakdown.
without it hallucinating a crisis over a missing semicolon?
It's harder to make it "just fix a semicolon", so in that case using non-llm tools is better, or at least expose the tools needed to the LLM through MCP. Syntax fixing can be done with existing tools, so in this case you just expose an MCP tool, ie: 
code_fixing::correct_semicolons(files[]) that implements the syntactical logic in code, without needing the LLM to actually write correct code.