Skip to content

Commit

Permalink
Sync develop branch
Browse files Browse the repository at this point in the history
  • Loading branch information
samjwu committed Sep 27, 2024
2 parents 4943ec5 + 4795998 commit 6c272d3
Show file tree
Hide file tree
Showing 3 changed files with 55 additions and 10 deletions.
16 changes: 8 additions & 8 deletions docs/conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,12 +5,12 @@
# https://www.sphinx-doc.org/en/master/usage/configuration.html

# ROCm version numbers
rocm_version = '6.2.1'
rocm_multi_versions = '6.2.1 6.2' # in 6.2, the folder names on repo.radeon.com use 6.2 for minor releases
rocm_multi_versions_package_versions = '6.2.1 6.2.0' # however, in multi, the packages use 6.2.0
rocm_directory_version = '6.2.1' # in 6.0 rocm was located in /opt/rocm-6.0.0
amdgpu_version = '6.2.1' # directory in https://repo.radeon.com/rocm/apt/ and https://repo.radeon.com/amdgpu-install/
amdgpu_install_version = '6.2.60201-1' # version in https://repo.radeon.com/amdgpu-install/6.0.2/ubuntu/jammy/
rocm_version = '6.2.2'
rocm_multi_versions = '6.2.2 6.2.1' # in 6.2, the folder names on repo.radeon.com use 6.2 for minor releases
rocm_multi_versions_package_versions = '6.2.2 6.2.1' # however, in multi, the packages use 6.2.0
rocm_directory_version = '6.2.2' # in 6.0 rocm was located in /opt/rocm-6.0.0
amdgpu_version = '6.2.2' # directory in https://repo.radeon.com/rocm/apt/ and https://repo.radeon.com/amdgpu-install/
amdgpu_install_version = '6.2.60202-1' # version in https://repo.radeon.com/amdgpu-install/6.0.2/ubuntu/jammy/


latex_engine = "xelatex"
Expand All @@ -26,8 +26,8 @@
project = "ROCm installation on Linux"
author = "Advanced Micro Devices, Inc."
copyright = "Copyright (c) 2024 Advanced Micro Devices, Inc. All rights reserved."
version = "6.2.1"
release = "6.2.1"
version = "6.2.2"
release = "6.2.2"
setting_all_article_info = True
all_article_info_os = ["linux"]
all_article_info_author = ""
Expand Down
47 changes: 45 additions & 2 deletions docs/install/3rd-party/pytorch-install.rst
Original file line number Diff line number Diff line change
Expand Up @@ -64,8 +64,7 @@ Using a Docker image with PyTorch pre-installed
.. note::

This will automatically download the image if it does not exist on the host. You can also pass the ``-v``
argument to mount any data directories from the host onto the container.
This will automatically download the image if it does not exist on the host. You can also pass the ``-v`` argument to mount any data directories from the host onto the container.

.. _install_pytorch_wheels:
.. _using-wheels-package:
Expand Down Expand Up @@ -502,3 +501,47 @@ ImageNet PyTorch example
.. code-block:: bash
python3 main.py
.. _troubleshooting-pytorch:

Troubleshooting
===============

* What to do if you get the following error when trying to run PyTorch:

.. code-block:: shell
hipErrorNoBinaryForGPU: Unable to find code object for all current devices!
The error denotes that the installation of PyTorch and/or other dependencies or libraries do not support the current GPU. To workaround this issue, use the following steps:

1. Confirm that the hardware supports the ROCm stack. Refer to :ref:`linux-support` and :ref:`windows-support`.

2. Determine the gfx target.

.. code-block:: shell
rocminfo | grep gfx
3. Check if PyTorch is compiled with the correct gfx target.

.. code-block:: shell
TORCHDIR=$( dirname $( python3 -c 'import torch; print(torch.__file__)' ) )
roc-obj-ls -v $TORCHDIR/lib/libtorch_hip.so # check for gfx target
.. note::
Recompile PyTorch with the right gfx target if compiling from the source if
the hardware is not supported.

* What if you are unable to access Docker or GPU in user accounts?

Ensure that the user is added to docker, video, and render Linux groups as described in :ref:`group_permissions`.

* Can you install PyTorch directly on bare metal?

Bare-metal installation of PyTorch is supported through wheels. For more information, see :ref:`using-wheels-package`.

* How do you profile PyTorch workloads?

Use the PyTorch Profiler as described in :ref:`mi300x-pytorch-profiler` to profile GPU kernels on ROCm.
2 changes: 2 additions & 0 deletions docs/install/prerequisites.rst
Original file line number Diff line number Diff line change
Expand Up @@ -190,6 +190,8 @@ To install for the currently active kernel run the command corresponding to your
sudo zypper install kernel-default-devel
.. _group_permissions:

Setting permissions for groups
================================================================

Expand Down

0 comments on commit 6c272d3

Please sign in to comment.