It should be independent on any policy.
Curation: Reward Model / Verifier