Should add a parent crd to obi : obig #60

Abirdcfly · 2022-10-31T07:39:30Z

Current Limitations:

In obi, it is not enough to use index and label to point to the same pod or node. When a deploy or node is updated, the same index may point to a different pod or node.
The lack of a resource or field to meet the need to monitor all nodes or all pods under a deploy is a requirement often used in schedulers

OBIG example:

apiVersion: arbiter.k8s.com.cn/v1alpha1
kind: ObservabilityIndicantGroup
metadata:
  name: metric-server-node-cpu
spec:
  obiHistoryLimit: 10 # How many additional instances of expired obi to keep, 0 means coincide with the actual resource updates, no expired obi to keep
  # Same as obi below, with only 2 minor differences:
  # 1. no `spec.targetRef.index`
  # 2. `spec.targetRef.kind` only support `Node` and `Deploy` now.
  metric:
    historyLimit: 1
    metricIntervalSeconds: 15
    metrics:
      cpu:
        aggregations:
        - time
        description: ""
        query: ""
        unit: 'm'
    timeRangeSeconds: 3600
  source: metric-server
  targetRef:
    group: ""
    kind: Node
    labels:
      "data-test": "data-test"
    name: ""
    namespace: ""
    version: v1

OBIG running logic:

The logic for running obig is as follows:

When an obig is created to monitor nodes, the observer queries what all the current nodes are, creates an obi for each node (obi spec.targetRef.name will be node name), adds a new obi to monitor the new node when there is a new node, and stops the obi update for the old node when there is a deletion and deletes it after triggering the history length limit spec.obiHistoryLimit.
monitor the obig of deploy in the same way. create obis for each pod (obi spec.targetRef.name will be pod name).

How arbiter-scheduler to use OBIG:

This can be used for the actual resource scheduling of the scheduler:

When the arbiter-scheduler is created. The arbiter-scheduler will automatically create one obig to monitor the actual resource usage of the node's cpu and memory.
The scheduler will only be responsible for creating this obig, the obi creation and update will be done by the observer, and if this obig is already created, the obig already created by user will be used.
The scheduler will only use the obi created by this obig. node.metric.cpu and node.metric.mem (by looking at the obig's ownerReferences) to update the metric that JS will use in the scheduler's Score CRD.

The text was updated successfully, but these errors were encountered:

nkwangleiGIT · 2022-10-31T07:59:36Z

In obi, it is not enough to use index and label to point to the same pod or node. When a deploy or node is updated, the same index may point to a different pod or node.

The lack of a resource or field to meet the need to monitor all nodes or all pods under a deploy is a requirement often used in schedulers

We probably need a OBIG for latter phase, but for now, I think we can make it simple using the approach below:

For pod, we don't need to use metrics from a specified pod, as it maybe recreated anytime, and the restarted one will be probably have different metrics as the one before. So scheduling should not depend on the metrics from specified pod. In most cases, it should be from the service level for more general metrics, or we can use the label/annotation for resource trait.
For node , the name should be more stable than pod, and we can let user create separate OBI for each node manually for now.
So before we have more real user cases, we can keep it simple and make a better decision later.

Abirdcfly added the enhancement New feature or request label Oct 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should add a parent crd to obi : obig #60

Should add a parent crd to obi : obig #60

Abirdcfly commented Oct 31, 2022

nkwangleiGIT commented Oct 31, 2022

Should add a parent crd to obi : obig #60

Should add a parent crd to obi : obig #60

Comments

Abirdcfly commented Oct 31, 2022

Current Limitations:

OBIG example:

OBIG running logic:

How arbiter-scheduler to use OBIG:

nkwangleiGIT commented Oct 31, 2022