This is the official repository for Generative Judge for Evaluating Alignment. We develop Auto-J, a new open-source generative judge that can effectively evaluate different LLMs on how they align to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results