fireworks-prompt-tokens
, fireworks-server-time-to-first-token
).
Streaming requests: Only selected performance metrics, such as “fireworks-server-time-to-first-token,” are available because HTTP headers must be sent before the first token can be streamed. Use the perf_metrics_in_response
body parameter to include all metrics in the last SSE event of the response body.
perf_metrics_in_response
perf_metrics_in_response
parameter to true
in your request. This will include performance data in the response body under the perf_metrics
field.
perf_metrics
field in the final chunk (the one with finish_reason
set). This is because headers may not be accessible during streaming.