Group‑Relative Policy Optimization