Skip to content

Conversation

@orangeH25
Copy link
Contributor

This PR Add ROLL Docs with Ascend NPU.

ROLL: An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

@orangeH25
Copy link
Contributor Author

Hi @fffrog @noemotiovon @lowdy1, please take a look, thanks!

${custom_env.FrozenLake}
FrozenLakeThink:
${custom_env.FrozenLakeThink}

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In my opinion, it would be better to make as few changes as possible to the official configuration file. Some parameters should remain as they were.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Aside from the required backend adaptations, I only adjusted max_steps, batch sizes and related size parameters. These changes are only for the quick-start example—the goal is simply to make the pipeline runnable on common hardware and finish quickly.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe that is acceptable. Also, please note that Ascend does not currently support colocate mode, and the device_mapping should be updated accordingly.

Copy link
Contributor

@noemotiovon noemotiovon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

请根据已有昇腾产品型号及 CPU 架构等按照 :doc:`快速安装昇腾环境指引 <../ascend/quick_install>` 进行昇腾环境安装。

.. warning::
CANN 最低版本为 8.2.RC1,安装 CANN 时,请同时安装 Kernel 算子包以及 nnal 加速库软件包。
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this limitation from vLLM-Ascend?


RuntimeError: CMake configuration failed: Command '['/pathto/miniconda3/envs/roll/bin/python3.10', '-m', 'pybind11', '--cmake']' returned non-zero exit status 2.

可尝试在 vllm-ascend 目录下 setup.py 文件 151-158 行进行如下修改并重新进行编译:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Currently, installing vLLM version 0.8.4 is a bit complicated. Later, once version 0.11 is supported, we’ll also update the documentation.

- DeepSpeed
- vLLM
- Atlas 900 A2 PODc
* - RLVR
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please check the VLM script when you have time.

Copy link
Contributor

@fffrog fffrog left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@fffrog fffrog merged commit c43a3cf into Ascend:main Nov 25, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants