Improving instruction hierarchy in frontier LLMs

3 views0 likes0 comments

Originally published byOpenAI Blog

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.

Comments (0)

Be the first to comment!

🇺🇸

United States

NORTH AMERICA

More news from United States