Skip to content

Commit

Permalink
feat: change default value of retry in scheduler (#226)
Browse files Browse the repository at this point in the history
Signed-off-by: Gaius <[email protected]>
  • Loading branch information
gaius-qi authored Jan 11, 2024
1 parent f84408e commit bd55ced
Show file tree
Hide file tree
Showing 3 changed files with 30 additions and 28 deletions.
18 changes: 10 additions & 8 deletions charts/dragonfly/Chart.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ name: dragonfly
description: Dragonfly is an intelligent P2P based image and file distribution system
icon: https://raw.githubusercontent.com/dragonflyoss/Dragonfly2/main/docs/images/logo/dragonfly.svg
type: application
version: 1.1.22
version: 1.1.23
appVersion: 2.1.29
keywords:
- dragonfly
Expand All @@ -26,7 +26,9 @@ sources:

annotations:
artifacthub.io/changes: |
- Remove advertisePort in manager config.
- Change default value of retry in scheduler
- Update dragonfly image tag to v2.1.29.
- Update client image tag to v0.1.12.
artifacthub.io/links: |
- name: Chart Source
Expand All @@ -35,17 +37,17 @@ annotations:
url: https://github.com/dragonflyoss/Dragonfly2
artifacthub.io/images: |
- name: manager
image: dragonflyoss/manager:v2.1.28
image: dragonflyoss/manager:v2.1.29
- name: scheduler
image: dragonflyoss/scheduler:v2.1.28
image: dragonflyoss/scheduler:v2.1.29
- name: client
image: dragonflyoss/client:v0.1.10
image: dragonflyoss/client:v0.1.12
- name: seed-client
image: dragonflyoss/client:v0.1.10
image: dragonflyoss/client:v0.1.12
- name: dfdaemon
image: dragonflyoss/dfdaemon:v2.1.28
image: dragonflyoss/dfdaemon:v2.1.29
- name: trainer
image: dragonflyoss/scheduler:v2.1.28
image: dragonflyoss/scheduler:v2.1.29
- name: triton
image: nvcr.io/nvidia/tritonserver:23.06-py3
Expand Down
20 changes: 10 additions & 10 deletions charts/dragonfly/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -184,7 +184,7 @@ helm delete dragonfly --namespace dragonfly-system
| client.replicas | int | `3` | Number of Pods to launch. |
| client.resources | object | `{"limits":{"cpu":"2","memory":"4Gi"},"requests":{"cpu":"0","memory":"0"}}` | Pod resource requests and limits. |
| client.statefulsetAnnotations | object | `{}` | Statefulset annotations. |
| client.tag | string | `"v0.1.10"` | Image tag. |
| client.tag | string | `"v0.1.12"` | Image tag. |
| client.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| client.tolerations | list | `[]` | List of node taints to tolerate. |
| clusterDomain | string | `"cluster.local"` | Install application cluster domain. |
Expand Down Expand Up @@ -306,7 +306,7 @@ helm delete dragonfly --namespace dragonfly-system
| dfdaemon.pullPolicy | string | `"IfNotPresent"` | Image pull policy. |
| dfdaemon.pullSecrets | list | `[]` (defaults to global.imagePullSecrets). | Image pull secrets. |
| dfdaemon.resources | object | `{"limits":{"cpu":"2","memory":"2Gi"},"requests":{"cpu":"0","memory":"0"}}` | Pod resource requests and limits. |
| dfdaemon.tag | string | `"v2.1.28"` | Image tag. |
| dfdaemon.tag | string | `"v2.1.29"` | Image tag. |
| dfdaemon.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| dfdaemon.tolerations | list | `[]` | List of node taints to tolerate. |
| externalManager.grpcPort | int | `65003` | External GRPC service port. |
Expand Down Expand Up @@ -416,7 +416,7 @@ helm delete dragonfly --namespace dragonfly-system
| manager.service.annotations | object | `{}` | Service annotations. |
| manager.service.labels | object | `{}` | Service labels. |
| manager.service.type | string | `"ClusterIP"` | Service type. |
| manager.tag | string | `"v2.1.28"` | Image tag. |
| manager.tag | string | `"v2.1.29"` | Image tag. |
| manager.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| manager.tolerations | list | `[]` | List of node taints to tolerate. |
| mysql.auth.database | string | `"manager"` | Mysql database name. |
Expand Down Expand Up @@ -459,9 +459,9 @@ helm delete dragonfly --namespace dragonfly-system
| scheduler.config.scheduler.gc.peerTTL | string | `"24h"` | peerTTL is the ttl of peer. If the peer has been downloaded by other peers, then PeerTTL will be reset. |
| scheduler.config.scheduler.gc.pieceDownloadTimeout | string | `"30m"` | pieceDownloadTimeout is the timeout of downloading piece. |
| scheduler.config.scheduler.gc.taskGCInterval | string | `"30m"` | taskGCInterval is the interval of task gc. If all the peers have been reclaimed in the task, then the task will also be reclaimed. |
| scheduler.config.scheduler.retryBackToSourceLimit | int | `30` | retryBackToSourceLimit reaches the limit, then the peer back-to-source. |
| scheduler.config.scheduler.retryInterval | string | `"100ms"` | Retry scheduling interval. |
| scheduler.config.scheduler.retryLimit | int | `40` | Retry scheduling limit times. |
| scheduler.config.scheduler.retryBackToSourceLimit | int | `10` | retryBackToSourceLimit reaches the limit, then the peer back-to-source. |
| scheduler.config.scheduler.retryInterval | string | `"300ms"` | Retry scheduling interval. |
| scheduler.config.scheduler.retryLimit | int | `15` | Retry scheduling limit times. |
| scheduler.config.security.autoIssueCert | bool | `false` | AutoIssueCert indicates to issue client certificates for all grpc call. If AutoIssueCert is false, any other option in Security will be ignored. |
| scheduler.config.security.caCert | string | `""` | CACert is the root CA certificate for all grpc tls handshake, it can be path or PEM format string. |
| scheduler.config.security.certSpec.dnsNames | list | `["dragonfly-scheduler","dragonfly-scheduler.dragonfly-system.svc","dragonfly-scheduler.dragonfly-system.svc.cluster.local"]` | DNSNames is a list of dns names be set on the certificate. |
Expand Down Expand Up @@ -520,7 +520,7 @@ helm delete dragonfly --namespace dragonfly-system
| scheduler.service.labels | object | `{}` | Service labels. |
| scheduler.service.type | string | `"ClusterIP"` | Service type. |
| scheduler.statefulsetAnnotations | object | `{}` | Statefulset annotations. |
| scheduler.tag | string | `"v2.1.28"` | Image tag. |
| scheduler.tag | string | `"v2.1.29"` | Image tag. |
| scheduler.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| scheduler.tolerations | list | `[]` | List of node taints to tolerate. |
| seedClient.config.download.concurrentPieceCount | int | `10` | concurrentPieceCount is the number of concurrent pieces to download. |
Expand Down Expand Up @@ -586,7 +586,7 @@ helm delete dragonfly --namespace dragonfly-system
| seedClient.replicas | int | `3` | Number of Pods to launch. |
| seedClient.resources | object | `{"limits":{"cpu":"2","memory":"4Gi"},"requests":{"cpu":"0","memory":"0"}}` | Pod resource requests and limits. |
| seedClient.statefulsetAnnotations | object | `{}` | Statefulset annotations. |
| seedClient.tag | string | `"v0.1.10"` | Image tag. |
| seedClient.tag | string | `"v0.1.12"` | Image tag. |
| seedClient.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| seedClient.tolerations | list | `[]` | List of node taints to tolerate. |
| seedPeer.config.aliveTime | string | `"0s"` | Daemon alive time, when sets 0s, daemon will not auto exit, it is useful for longtime running. |
Expand Down Expand Up @@ -688,7 +688,7 @@ helm delete dragonfly --namespace dragonfly-system
| seedPeer.replicas | int | `3` | Number of Pods to launch. |
| seedPeer.resources | object | `{"limits":{"cpu":"2","memory":"4Gi"},"requests":{"cpu":"0","memory":"0"}}` | Pod resource requests and limits. |
| seedPeer.statefulsetAnnotations | object | `{}` | Statefulset annotations. |
| seedPeer.tag | string | `"v2.1.28"` | Image tag. |
| seedPeer.tag | string | `"v2.1.29"` | Image tag. |
| seedPeer.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| seedPeer.tolerations | list | `[]` | List of node taints to tolerate. |
| trainer.config.console | bool | `false` | Console shows log on console. |
Expand Down Expand Up @@ -747,7 +747,7 @@ helm delete dragonfly --namespace dragonfly-system
| trainer.service.annotations | object | `{}` | Service annotations. |
| trainer.service.labels | object | `{}` | Service labels. |
| trainer.service.type | string | `"ClusterIP"` | Service type. |
| trainer.tag | string | `"v2.1.28"` | Image tag. |
| trainer.tag | string | `"v2.1.29"` | Image tag. |
| trainer.terminationGracePeriodSeconds | string | `nil` | Pod terminationGracePeriodSeconds. |
| trainer.tolerations | list | `[]` | List of node taints to tolerate. |
| triton.aws | object | `{"accessKeyID":"","region":"","secretAccessKey":""}` | Credentials information. |
Expand Down
20 changes: 10 additions & 10 deletions charts/dragonfly/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -145,7 +145,7 @@ manager:
# -- Image repository.
image: dragonflyoss/manager
# -- Image tag.
tag: v2.1.28
tag: v2.1.29
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down Expand Up @@ -418,7 +418,7 @@ scheduler:
# -- Image repository.
image: dragonflyoss/scheduler
# -- Image tag.
tag: v2.1.28
tag: v2.1.29
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down Expand Up @@ -503,11 +503,11 @@ scheduler:
# -- backToSourceCount is single task allows the peer to back-to-source count.
backToSourceCount: 3
# -- retryBackToSourceLimit reaches the limit, then the peer back-to-source.
retryBackToSourceLimit: 30
retryBackToSourceLimit: 10
# -- Retry scheduling limit times.
retryLimit: 40
retryLimit: 15
# -- Retry scheduling interval.
retryInterval: 100ms
retryInterval: 300ms
gc:
# -- pieceDownloadTimeout is the timeout of downloading piece.
pieceDownloadTimeout: 30m
Expand Down Expand Up @@ -791,7 +791,7 @@ seedClient:
# -- Image repository.
image: dragonflyoss/client
# -- Image tag.
tag: v0.1.10
tag: v0.1.12
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down Expand Up @@ -992,7 +992,7 @@ client:
# -- Image repository.
image: dragonflyoss/client
# -- Image tag.
tag: v0.1.10
tag: v0.1.12
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down Expand Up @@ -1166,7 +1166,7 @@ seedPeer:
# -- Image repository.
image: dragonflyoss/dfdaemon
# -- Image tag.
tag: v2.1.28
tag: v2.1.29
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down Expand Up @@ -1512,7 +1512,7 @@ dfdaemon:
# -- Image repository.
image: dragonflyoss/dfdaemon
# -- Image tag.
tag: v2.1.28
tag: v2.1.29
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down Expand Up @@ -1946,7 +1946,7 @@ trainer:
# -- Image repository.
image: dragonflyoss/trainer
# -- Image tag.
tag: v2.1.28
tag: v2.1.29
# -- Image pull policy.
pullPolicy: IfNotPresent
# -- Image pull secrets.
Expand Down

0 comments on commit bd55ced

Please sign in to comment.