You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: site-src/guides/epp-configuration/config-text.md
+33Lines changed: 33 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,6 +26,8 @@ schedulingProfiles:
26
26
- ....
27
27
saturationDetector:
28
28
...
29
+
data:
30
+
...
29
31
featureGates:
30
32
...
31
33
```
@@ -40,6 +42,9 @@ requests to pods. This section is described in more detail in the section [Confi
40
42
The saturationDetector section configures the saturation detector, which is used to determine if special
41
43
action needs to eb taken due to the system being overloaded or saturated. This section is described in more detail in the section [Saturation Detector configuration](#saturation-detector-configuration)
42
44
45
+
The data section configures the data layer, which is used to gather metrics and other data used in making scheduling decisions.
46
+
This section is described in more detail in the section [Data Layer configuration](#data-layer-configuration)
47
+
43
48
The featureGates sections allows the enablement of experimental features of the IGW. This section is
44
49
described in more detail in the section [Feature Gates](#feature-gates)
45
50
@@ -330,6 +335,34 @@ a value of `0.8` will be used.
330
335
metrics are older than this, it might be excluded from "good capacity" considerations or treated
331
336
as having no capacity for safety. This field is optional, if omitted a value of `200ms` will be used.
332
337
338
+
## Data Layer configuration
339
+
340
+
The Data Layer collects metrics and other data used in scheduling decisions made by the various configured
341
+
filters and plugins. The exact data collected varies by the DataSource and Extractors configured. The basic ones
342
+
collect Prometheus metrics from the Model Servers in the InferencePool.
343
+
344
+
The Data Layer is configured via the data section of the overall configuration. It has the following form:
345
+
346
+
```yaml
347
+
data:
348
+
sources:
349
+
- pluginRef: source1
350
+
extractors:
351
+
- extarctor1
352
+
- extractor2
353
+
```
354
+
355
+
The data section has one field *sources* which configures the set of DataSources to be used to gather the metrics
356
+
and other data used for scheduling.
357
+
358
+
Each entry in the sources list has the following fields:
359
+
360
+
- *pluginRef* is a reference to the name of the plugin instance to be used.
361
+
- *extractors* specifies the list of the extractor plugin instances, by name, to be used with this DataSource.
362
+
363
+
**Note**: The names of the plugin instances mentioned above, refer to plugin instances defined in the plugins section
364
+
of the configuration.
365
+
333
366
## Feature Gates
334
367
335
368
The Feature Gates section allows for the enabling of experimental features of the IGW. These experimental
0 commit comments