Mine attempts to lie whenever it can if it doesn’t know something. I will call it out and say that is a lie and it will say “you are absolutely correct” tf.

I was reading into sleeper agents placed inside local LLMs and this is increasing the chance I’ll delete it forever. Which is a shame because it is the new search engine seeing how they ruined search engines

  • rozodru@piefed.social
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    19 hours ago

    Thank you. you’re 100% spot on.

    In my day to day consulting job I deal directly with LLMs and more specifically Claude since most of my clients ended up going with Claude/Claude Code. You pretty much described Claude to a T.

    What companies found that leveraged CC for end to end builds is that constantly Claude Code would claim something was complete or functioning when it simply hadn’t done it. Or, more commonly, would simply make a “#TODO” of whatever feature/function and then claim it was complete. Naturally a vibe coder or anyone else didn’t know any better and when it came time to push said project to production…womp womp it’s actually no where near done.

    So I wouldn’t say Claude lies, sure it gives off the impression that it lies…a lot…I’d just say it’s “lazy” or more accurately it consistently looks for “short cuts” to reach its solution. Even outside of a coding aspect just asking it for a walkthrough or tutorial on say how to fix something it will routinely tell you to skip things or ignore other things in order to get to the solution of an issue regardless of the fact skipping other steps may impact other things.

    Out of all the LLM’s I’ve dealt with, yes, Claude acts as if it’s trying to speed run a solution.