Skip to main content

Model Selection Guide

New models drop constantly, so this guide focuses on what's working well with Caret right now. We'll keep it updated as the landscape shifts.

💬Callout

New to model selection? Start with Module 2 of Caret's Learning Path for a comprehensive guide to choosing and configuring models.

What is an AI Model?

Think of an AI model as the "brain" that powers Caret. When you ask Caret to write code, fix bugs, or refactor your project, it's the model that actually understands your request and generates the response.

Key points:

  • Models are trained AI systems that understand natural language and code
  • Different models have different strengths some excel at complex reasoning, others prioritize speed or cost
  • You choose which model Caret uses like picking between different experts for different tasks
  • Models are accessed via API providers - companies like Anthropic, OpenAI, and OpenRouter host these models

Why it matters: The model you choose directly impacts Caret's capabilities, response quality, speed, and cost. A premium model might handle complex refactoring beautifully but cost more, while a budget model works great for routine tasks at a fraction of the price.

How to Select a Model in Caret

Follow these 5 simple steps to get Caret up and running with your preferred AI model:

Step 1: Open Caret Settings

First, you need to access Caret's configuration panel.

Two ways to open settings:

  • Quick method: Click the gear icon (⚙️) in the top-right corner of Caret's chat interface
  • Command palette: Press Cmd/Ctrl + Shift + P → type "Caret: Open Settings"
Caret Settings Panel

The settings panel will open, showing configuration options with "API Provider" at the top.

ℹ️Note

The settings panel remembers your last configuration, so you'll only need to set this up once.

Step 2: Select an API Provider

Choose your preferred AI provider from the dropdown menu.

Caret Settings Panel

Popular providers at a glance:

ProviderBest ForNotes
CaretEasiest setupNo API keys needed, access to multiple models including stealth models
OpenRouterValue seekersMultiple models, competitive pricing
AnthropicReliabilityClaude models, most dependable tool usage
OpenAILatest techGPT models
Google GeminiLarge contextGoogle's AI models
AWS BedrockEnterpriseAdvanced features
OllamaPrivacyRun models locally

See the full provider list for more options including Cerebras, Vertex AI, Azure, and more.

ℹ️Info

Recommended for beginners: Start with Caret as your provider - no API key management needed, instant access to multiple models, and occasional free inferencing through partner providers.

Step 3: Add Your API Key (or Sign In)

The next step depends on which provider you selected.

If you selected Caret as your provider:

  • No API key needed! Simply sign in with your Caret account
  • Click the Sign In button when prompted
  • You'll be redirected to app.caret.bot to authenticate
  • After signing in, return to your IDE

If you selected any other provider:

You'll need to get an API key from your chosen provider:

  1. Visit your provider's website to get an API key:

  2. Generate a new API key on the provider's website

  3. Copy the API key to your clipboard

  4. Paste your key in the "API Key" field in Caret settings

  5. Save automatically - Your key is stored securely in your editor's secrets storage

Caret API Selection
⚠️Warning

Payment required for most providers: Most providers need payment information before generating keys. You only pay for what you use (typically $0.01-$0.10 per coding task).

Step 4: Choose Your Model

Once your API key is added (or you've signed in), the "Model" dropdown becomes available.

Caret Model Selection

Quick model selection guide:

Your PriorityChoose This ModelWhy
Maximum reliabilityClaude Sonnet 4.5Most reliable tool usage, excellent at complex tasks
Best valueDeepSeek V3 or Qwen3 CoderGreat performance at budget prices
Fastest speedQwen3 Coder on CerebrasLightning-fast responses
Run locallyAny Ollama modelComplete privacy, no internet needed
Latest featuresGPT-5OpenAI's newest capabilities

Not sure which to pick? Start with Claude Sonnet 4.5 for reliability or DeepSeek V3 for value.

💡Tip

You can switch models at any time without losing your conversation. Try different models to find what works best for your specific tasks.

See the model comparison tables below for detailed specifications and pricing.

Current Top Models

Here are the most common picks right now:

  • Reliability: Claude Sonnet 4.5
  • Best value: DeepSeek V3 or Qwen3 Coder
  • Fastest: Qwen3 Coder on Cerebras
  • Local: Any Ollama-hosted model
  • Latest features: GPT-5

Step 5: Start Using Caret

Congratulations! You're all set up. Here's how to start coding with Caret:

  1. Type your request in the Caret chat box

    • Example: "Create a React component for a login form"
    • Example: "Debug this TypeScript error"
    • Example: "Refactor this function to be more efficient"
  2. Press Enter or click the send icon to submit

Choosing the Right Model

Selecting the right model involves balancing several factors. Use this framework to find your ideal match:

ℹ️Note

Pro tips: Configure separate models for Plan Mode and Act Mode. Make the most out the each model's strengths. For example, use a budget model for planning discussions and a premium model for implementation.

Key Selection Factors

FactorWhat to ConsiderRecommendation
Task ComplexitySimple fixes vs complex refactoringBudget models for routine tasks; Premium models for complex work
BudgetMonthly spending capacity$10-$30: Budget, $30-$100: Mid-tier, $100+: Premium
Context WindowProject size and file countSmall: 32K-128K, Medium: 128K-200K, Large: 400K+
SpeedResponse time requirementsInteractive: Fast models, Background: Reasoning models OK
Tool ReliabilityComplex operationsClaude excels at tool usage; Test others with your workflow
ProviderAccess and pricing needsOpenRouter: Many options, Direct: Faster/reliable, Local: Privacy

Model Comparison Resources

For detailed model comparisons, pricing, and performance metrics, see:

Open Source vs Closed Source

Open Source Advantages

  • Multiple providers compete to host them
  • Cheaper pricing due to competition
  • Provider choice - switch if one goes down
  • Faster innovation cycles

Open Source Models Available

  • Qwen3 Coder (Apache 2.0)
  • Z AI GLM 4.5 (MIT)
  • Kimi K2 (Open source)
  • DeepSeek series (Various licenses)

Quick Decision Matrix

If you want...Use this
Something that just worksClaude Sonnet 4.5
To save moneyDeepSeek V3 or Qwen3 variants
Huge context windowsGemini 2.5 Pro or Claude Sonnet 4.5
Open sourceQwen3 Coder, Z AI GLM 4.5, or Kimi K2
Latest techGPT-5
SpeedQwen3 Coder on Cerebras (fastest available)

What Others Are Using

Check OpenRouter's Caret usage stats to see real usage patterns from the community.