Discussion about this post

User's avatar
Iioo's avatar

" LLM training regimes are designed to explicitly make it really hard for an LLM to ignore the system prompt or get around it in some way"

First time I've ever heard of this! Is there any paper describing how that works?

Expand full comment
1 more comment...

No posts