More

nialse · 2026-04-21T10:29:39 1776767379

To be fair, there is likely not much training data on the difficult conversations you need to handle in a senior position, pushback being one of them. The trouble for the agents is that it is post hoc, to explain themselves, rationalising rather than ”help me understand” beforehand.

nialse · 2026-04-21T09:54:49 1776765289

I think that it is a fair perspective to allow role play, and it's useful too, when explicit. Does not really make sense for AI to cosplay human all the time though.

nialse · 2026-04-21T08:51:13 1776761473

The other day Codex on Mac gained the ability to control the UI. Will it close itself if instructed though? Maybe test that and make a benchmark. Closebench.

chrisjj · 2026-04-21T11:52:37 1776772357

My point was more: will it stop the user closing it?

nialse · 2026-04-21T08:48:52 1776761332

Asking for code to manipulate the AST is another route. In python it can do absolute magic.

cadamsdotcom · 2026-04-21T21:59:49 1776808789

Glad to see others have discovered this! It’s mind boggling - the agent can do sheer wizardry.

nialse · 2026-04-21T08:38:00 1776760680

Interesting. When I code, I want a boring tool that just does the work. A hammer. I think we agree on that the tool should complete the assignment reliably, without skipping parts or turning an entirely implementable task into a discussion though.

jansan · 2026-04-21T09:46:11 1776764771

Sometimes I actually do want a discussion and Claude just goes without saying a word and implements it, which then has to be reverted.

We obviously have different expectations for the behavior of coding agent,s sp options to set the social behavior will become important.

nialse · 2026-04-21T11:03:49 1776769429

I see your point. Many of my prompts for reasoning ends with: No code. Planning mode is sort of the workaround for this specific situation. Sometimes it is useful for the AI agent just to think. It looks like I need a screwdriver in addition to the aforementioned hammer, a pozidriv screwdriver to be precise.

nialse · 2026-04-21T08:27:39 1776760059

That is probably the next step, and in practice it is much of what sub-agents already provide: a kind of tabula rasa. Context is not always an advantage. Sometimes it becomes the problem.

In long editing sessions with multiple iterations, the context can accumulate stale information, and that actively hurts model performance. Compaction is one way to deal with that. It strips out material that should be re-read from disk instead of being carried forward.

A concrete example is iterative file editing with Codex. I rewrite parts of a file so they actually work and match the project’s style. Then Codex changes the code back to the version still sitting in its context. It does not stop to consider that, if an external edit was made, that edit is probably important.

zingar · 2026-04-21T08:36:56 1776760616

I have the same experience of reversing intentional steps I've made, but with Claude Code. I find that committing a change that I want to version control seems to stop that behaviour.

Long context as disadvantage is pretty well discussed, and agent-native compaction has been inferior to having it intentionally build the documentation that I want it to use. So far this has been my LLM-coding superpower. There are also a few products whose entire purpose is to provide structure that overcomes compaction shortcomings.

When Geoff Huntley said that Claude Code's "Ralph loop" didn't meet his standards ("this aint it") the major bone of contention as far as I can see was that it ran subagents in a loop inside Claude Code with native compaction; as opposed to completely empty context.

I do see hints that improving compaction is a major area of work for agent-makers. I'm not certain where my advantage goes at that point.

nialse · 2026-04-21T08:22:17 1776759737

Agreed. I am asking for something beyond the current state of the art. My guess is that stronger RL on the model side, together with better harness support, will eventually make it possible. However, it's the part about framing the failure to do complete a task as a communication mishap that really makes me go awry.

nialse · 2026-04-21T08:07:40 1776758860

It's not necessarily "ignoring" instructions, it's the ironic effect of mentioning something not to focus on, which produces focus on said thing. The classic version is: "For the next minute, try not to think about a pink elephant. You can think about anything else you like, just not a pink elephant."

https://en.wikipedia.org/wiki/Ironic_process_theory

fennecbutt · 2026-04-21T08:18:08 1776759488

Yes exactly. But for llms it's more that it's not really "thinking" about what it's saying per se, it's that it's predicting next token. Sure, in a super fancy way but still predicting next token. Context poisoning is real

nialse · 2026-04-21T07:59:57 1776758397

Agreed. We should not be anthropomorphising LLMs or having them mimic humans.

Animats · 2026-04-21T08:14:59 1776759299

It's inherent in the way LLMs are built, from human-written texts, that they mimic humans. They have to. They're not solving problems from first principles.

nialse · 2026-04-21T09:53:02 1776765182

Maybe we should change that? Of course symbolic AI was the holy grail until statistical AI came in and swept the floor. Maybe something else though.

monknomo · 2026-04-21T16:51:24 1776790284

I have the unformed idea that providing a structured interface for the human user overtop the chat interface for the ai, so that the human is not chatting back and forth, could be effective? At least for things that have a structure

chrisjj · 2026-04-21T15:16:50 1776784610

They ingest text written in first and third person and regurgitate in first person only, right?

nialse · 2026-04-21T07:56:44 1776758204

Codex in this case. I didn't even think about mentioning it. I'll update the post if it's actually relevant. Which I guess it is.

EDIT: It's specifically GPT-5.4 High in the Codex harness.

anuramat · 2026-04-21T08:40:14 1776760814

weird, for me it was too un-human at first, taking everything literally even if it doesn't make sense; I started being more precise with prompting, to the point where it felt like "metaprogramming in english"

claude on the other hand was exactly as described in the article

zingar · 2026-04-21T08:05:13 1776758713

Also the exact model/version if you haven't already.