Throughput capacity typically depends on several factors:
  • Deployment type (serverless or on-demand)
  • Traffic patterns and request patterns
  • Hardware configuration
  • Model size and complexity