PreferredMinThroughput - TypeScript SDK

PreferredMinThroughput type definition

The TypeScript SDK and docs are currently in beta. Report issues on GitHub.

Preferred minimum throughput (in tokens per second). Can be a number (applies to p50) or an object with percentile-specific cutoffs. Endpoints below the threshold(s) may still be used, but are deprioritized in routing. When using fallback models, this may cause a fallback model to be used instead of the primary model if it meets the threshold.

Supported Types

number

1const value: number = 100;

models.PercentileThroughputCutoffs

1const value: models.PercentileThroughputCutoffs = {};

any

1const value: any = 100;