It should be independent on any policy.

Curation: Reward Model / Verifier