Skip to content

Llm train fp16

Llm train fp16 #78595

Triggered via pull request December 17, 2024 08:07
@ShawnXuanShawnXuan
review_requested #10603
llm_train_fp16
Status Failure
Total duration 43m 42s
Artifacts

test.yml

on: pull_request
Find build cache
27s
Find build cache
Collect information about PR and source
16s
Collect information about PR and source
Mirror third party dependencies
1m 1s
Mirror third party dependencies
License and format
1m 9s
License and format
Static analysis with clang on diff
43m 31s
Static analysis with clang on diff
Matrix: Build OneFlow
Find test cache
9s
Find test cache
Find test cache (distributed)
0s
Find test cache (distributed)
Matrix: Test suite
Matrix: Distributed test suite
Fit to window
Zoom out
Zoom in

Annotations

13 errors and 6 warnings
Find build cache
RPC failed; HTTP 503 curl 22 The requested URL returned error: 503
Find build cache
expected 'packfile'
License and format
Process completed with exit code 1.
License and format
Process completed with exit code 1.
License and format
Process completed with exit code 1.
Test suite (cpu-module)
Process completed with exit code 2.
Test suite (cuda-speed-test)
The job was canceled because "cpu-module" failed.
Test suite (cuda-misc)
The job was canceled because "cpu-module" failed.
Test suite (cuda-misc)
The operation was canceled.
Test suite (cuda-module)
The job was canceled because "cpu-module" failed.
Test suite (cuda-module)
The operation was canceled.
Test suite (cpu-misc)
The job was canceled because "cpu-module" failed.
Test suite (cpu-misc)
The operation was canceled.
Static analysis with clang on diff
Your workflow is using a version of actions/cache that is scheduled for deprecation, actions/cache@v2. Please update your workflow to use the latest version of actions/cache to avoid interruptions. Learn more: https://github.blog/changelog/2024-09-16-notice-of-upcoming-deprecations-and-changes-in-github-actions-services/
Find build cache
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
Build OneFlow (cpu-tsan)
Failed to download action 'https://api.github.com/repos/Oneflow-Inc/get-oneflow/tarball/0a72f25120eff941063c61b4d2755df3a3f07da9'. Error: The request was canceled due to the configured HttpClient.Timeout of 100 seconds elapsing.
Build OneFlow (cpu-tsan)
Back off 17.679 seconds before retry.
Build OneFlow (cpu)
docker-run-use-lld not supported for now
Find test cache
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636