Reinforcement learning (RL)
===========================

.. currentmodule:: tunix

.. autosummary::

    GRPOConfig
    GRPOLearner
    RewardFn

    PPOConfig
    PPOLearner

    ClusterConfig
    RLCluster
    RLTrainingConfig
    Role
    RolloutConfig

-------

.. autoclass:: GRPOConfig

-------

.. autoclass:: GRPOLearner

-------

.. autoclass:: RewardFn

-------

.. autoclass:: PPOConfig

-------

.. autoclass:: PPOLearner

-------


.. autoclass:: ClusterConfig

-------

.. autoclass:: RLCluster

-------

.. autoclass:: RLTrainingConfig

-------

.. autoclass:: Role

-------

.. autoclass:: RolloutConfig

