English translation is not available yet. Showing Russian content.

rule-based reward model