Skip to content

Commit 48c51a9

Browse files
committed
Updated configuration documentation
Signed-off-by: Shmuel Kallner <kallner@il.ibm.com>
1 parent a2a0cec commit 48c51a9

File tree

1 file changed

+33
-0
lines changed

1 file changed

+33
-0
lines changed

site-src/guides/epp-configuration/config-text.md

Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,8 @@ schedulingProfiles:
2626
- ....
2727
saturationDetector:
2828
...
29+
data:
30+
...
2931
featureGates:
3032
...
3133
```
@@ -40,6 +42,9 @@ requests to pods. This section is described in more detail in the section [Confi
4042
The saturationDetector section configures the saturation detector, which is used to determine if special
4143
action needs to eb taken due to the system being overloaded or saturated. This section is described in more detail in the section [Saturation Detector configuration](#saturation-detector-configuration)
4244
45+
The data section configures the data layer, which is used to gather metrics and other data used in making scheduling decisions.
46+
This section is described in more detail in the section [Data Layer configuration](#data-layer-configuration)
47+
4348
The featureGates sections allows the enablement of experimental features of the IGW. This section is
4449
described in more detail in the section [Feature Gates](#feature-gates)
4550
@@ -330,6 +335,34 @@ a value of `0.8` will be used.
330335
metrics are older than this, it might be excluded from "good capacity" considerations or treated
331336
as having no capacity for safety. This field is optional, if omitted a value of `200ms` will be used.
332337

338+
## Data Layer configuration
339+
340+
The Data Layer collects metrics and other data used in scheduling decisions made by the various configured
341+
filters and plugins. The exact data collected varies by the DataSource and Extractors configured. The basic ones
342+
collect Prometheus metrics from the Model Servers in the InferencePool.
343+
344+
The Data Layer is configured via the data section of the overall configuration. It has the following form:
345+
346+
```yaml
347+
data:
348+
sources:
349+
- pluginRef: source1
350+
extractors:
351+
- extarctor1
352+
- extractor2
353+
```
354+
355+
The data section has one field *sources* which configures the set of DataSources to be used to gather the metrics
356+
and other data used for scheduling.
357+
358+
Each entry in the sources list has the following fields:
359+
360+
- *pluginRef* is a reference to the name of the plugin instance to be used.
361+
- *extractors* specifies the list of the extractor plugin instances, by name, to be used with this DataSource.
362+
363+
**Note**: The names of the plugin instances mentioned above, refer to plugin instances defined in the plugins section
364+
of the configuration.
365+
333366
## Feature Gates
334367

335368
The Feature Gates section allows for the enabling of experimental features of the IGW. These experimental

0 commit comments

Comments
 (0)