There are likely only so many patterns that can exist and still be usable as a shareable and human reversible email address. Maybe one could have the llm identify these edge cases and write regex/code to parse future instances to convert back to a usable format.
Yes, of course- my point was that the LLM could gradually write functions to parse most of the clever patterns as it comes across them. Use the LLM as a trapdoor when nothing else matches, and in the same text generation create a script to parse the same input next time.
I agree that the LLM is more versatile and retract what I wrote earlier saying that all patterns could be accounted for. I should have said most easy to figure out patterns.