AI Gateway

Multi-model routing with smart selection and cost control

The Fiduci AI Gateway combines paid frontier models through the Vercel AI Gateway with free local Mistral inference, automatically routing each request to the best model for the task.

Model selection guide

Each task type maps to a recommended model, balancing capability against cost.

TaskRecommended modelReasonCost
Code generationgpt-4-turboBest at coding$$
Deep analysisclaude-3-opusMost thorough$$$
Quick Q&Agpt-3.5-turboFast & cheap$
Marketing copygpt-4Creative & persuasive$$
Data analysisclaude-3-sonnetExcellent reasoning$$
Local (free)mistral (Ollama)No API key neededFree

Cost optimization strategies

Use the local model first

Run Mistral 7B locally on the M1 via Ollama — free inference with no API key for everyday prompts.

Route by complexity

Send simple Q&A to a cheap model (gpt-3.5-turbo) and reserve premium models for deep analysis.

Batch processing

Group multiple prompts into a single batch request for better rates and higher throughput.

Gateway capabilities

  • 8 different AI models accessible
  • Smart model selection by task
  • Cost tracking across all models
  • Local free inference (Ollama)
  • Batch processing (50+ prompts)
  • Streaming responses
  • 2-3 second response time on M1