Skip to content

Inference

POST /cr/gcloud_predict_firebase_base_oss

Primary prediction endpoint. Supports streaming responses and optional RAG retrieval.

FieldTypeRequiredDescription
promptstringyesUser prompt text.
query_onlystringyesQuery form for routing/retrieval.
model_namestringnoModel name from /cr/get_allowed_models.
streambooleannoDefaults to true.
use_ragbooleannoEnable retrieval context.
sequential_budgetintegernoSequential reasoning budget.
parallel_budgetintegernoParallel reasoning budget.
temperaturenumbernoSampling temperature.

Returns

  • 200 OK as a streaming SSE response when stream=true.
  • Events typically include token chunks and a final final_result payload.
{"type":"final_result","predictions":{"0":{"answers":["..."],"thoughtTraces":["..."]}}}