Introduction:

The “src” folder contains the code for all experiments in the paper, but running them might be a bit annoying. Sorry for not well organizing the code more systematically. However, as the code is based on this repo and the official DPO repo, and the experiments are easy to understand, it is easy for you to write your own code using your favorite framework.

The training and probing data:

The training and evaluation pipeline:

Other explanations of the code