generated from mintlify/starter
-
Notifications
You must be signed in to change notification settings - Fork 6
Model Selection Guide #150
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
danstarns
wants to merge
11
commits into
main
Choose a base branch
from
model-selection-guide
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
11 commits
Select commit
Hold shift + click to select a range
15914e8
add Model Selection Guide
danstarns 8006249
typo
danstarns a2b064c
trunk
danstarns 015121e
image change
danstarns 30d6068
new lines in cards + trunk fixes
danstarns 8800748
Update agents/model-selection.mdx
danstarns 7a1f72b
Merge branch 'main' into model-selection-guide
danstarns 47f2fc7
Update agents/model-selection.mdx
danstarns ae1c162
updates, less long winded, default to GPT-4.1
danstarns 7f272ba
remove repomix
danstarns 3706034
remove repeated block
danstarns File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,281 @@ | ||
--- | ||
title: "Model Selection Guide" | ||
sidebarTitle: "Choose the Right Model" | ||
description: | ||
"Select the optimal model for your agent based on your goals and use case." | ||
--- | ||
|
||
Most agents work great with **GPT-4.1**, our default model. Generally it's | ||
powerful, reliable, and handles the majority of business use cases effectively. | ||
|
||
As you scale your agent and understand its value, you can always optimize for | ||
cost or explore specialized capabilities. | ||
|
||
 | ||
|
||
<Note> | ||
Hypermode provides access to the most popular open source and commercial | ||
models through our [Model Router](/model-router). We're constantly evaluating | ||
model usage and adding new models to our catalog based on demand. | ||
</Note> | ||
|
||
## Quick start: find your agent type | ||
|
||
Start with **GPT-4.1** for most use cases, then consider these alternatives for | ||
specific needs: | ||
|
||
<CardGroup cols={2}> | ||
<Card title="Sales & CRM Operations" icon="chart-line"> | ||
**Best for:** Lead qualification, call analysis, CRM updates | ||
<br /> | ||
**Recommended:** `GPT-4.1` (our default) | ||
<br /> | ||
**Why:** Excellent structured data extraction, business context understanding | ||
</Card> | ||
|
||
<Card title="Research & Analysis" icon="magnifying-glass"> | ||
**Best for:** Market research, competitor analysis, strategic insights | ||
<br /> | ||
**Recommended:** `O3` for complex reasoning, `GPT-4.1` for most research | ||
<br /> | ||
**Why:** Advanced reasoning, multi-source synthesis, deep analysis | ||
</Card> | ||
|
||
<Card title="Content & Marketing" icon="pen-fancy"> | ||
**Best for:** Social media, blogs, marketing campaigns | ||
<br /> | ||
**Recommended:** `Claude 4 Opus` for creative work, `GPT-4.1` for analysis | ||
<br /> | ||
**Why:** Creative writing, brand voice consistency, platform optimization | ||
</Card> | ||
|
||
<Card title="Data & Operations" icon="table"> | ||
**Best for:** Inventory tracking, spreadsheet analysis, reporting | ||
<br /> | ||
**Recommended:** `GPT-4o Mini` for high volume, `GPT-4.1` for complexity | ||
<br /> | ||
**Why:** Fast processing, cost-effective, reliable for routine tasks | ||
</Card> | ||
|
||
<Card title="Customer Support" icon="headset"> | ||
**Best for:** Scheduling, support tickets, real-time chat | ||
<br /> | ||
**Recommended:** `GPT-4o Mini` for real-time interactions | ||
<br /> | ||
**Why:** Low latency, consistent performance, natural conversation | ||
</Card> | ||
|
||
<Card title="Code & Development" icon="code"> | ||
**Best for:** GitHub bots, code reviews, API development | ||
<br/> | ||
**Recommended:** `GPT-4.1` or `Claude 4 Sonnet` | ||
<br/> | ||
**Why:** Strong code comprehension, security focus, fewer hallucinations | ||
</Card> | ||
</CardGroup> | ||
|
||
## When to consider alternatives | ||
|
||
After you've proven value with GPT-4.1, consider these alternatives for specific | ||
needs: | ||
|
||
### For creative and editorial work | ||
|
||
**Claude 4 Opus** - Superior for content creation, writing, and editorial tasks | ||
|
||
- Marketing copy and campaigns | ||
- Long-form content and storytelling | ||
- Brand voice consistency | ||
- Creative problem solving | ||
|
||
### For high-volume, cost-sensitive operations | ||
|
||
**GPT-4o Mini** - Fast and economical for routine tasks | ||
|
||
- Customer support and scheduling | ||
- Data processing and reporting | ||
- High-frequency operations (1000+ interactions/day) | ||
- Simple automation workflows | ||
|
||
### For complex reasoning and analysis | ||
|
||
**O3** - Advanced reasoning for sophisticated business challenges | ||
|
||
- Strategic planning and analysis | ||
- Complex research and insights | ||
- Multi-step problem solving | ||
- High-stakes decision support | ||
|
||
## Business-focused model recommendations | ||
|
||
### For sales and go-to-market teams | ||
|
||
<Card title="GTM Operations Agent" icon="handshake"> | ||
**Use Case:** Analyze sales calls, update CRM, qualify leads | ||
|
||
**Primary Choice:** `GPT-4.1` - Our default model with excellent structured data | ||
extraction | ||
|
||
**Alternative:** `Claude 4 Sonnet` - Superior business context understanding | ||
|
||
**Why These Work:** | ||
|
||
- Strong performance with sales terminology and CRM integration | ||
- Reliable field mapping and data accuracy | ||
- Professional communication tone | ||
|
||
**Example:** [Go-to-market Engineer](/agents/agent-gallery/gtm-engineer) | ||
updating Attio CRM from call transcripts | ||
|
||
</Card> | ||
|
||
### For marketing and content teams | ||
|
||
<Card title="Content Creation Agent" icon="pen-to-square"> | ||
**Use Case:** Social media posts, blog content, marketing campaigns | ||
|
||
**Primary Choice:** `Claude 4 Opus` - Superior creative capabilities and nuanced | ||
writing **Alternative:** `GPT-4.1` - Reliable for analysis and structured | ||
content | ||
|
||
**Why These Work:** | ||
|
||
- High-quality, engaging content generation | ||
- Brand voice consistency across platforms | ||
- Platform-specific content optimization | ||
|
||
**Example:** [Social Media Expert](/agents/agent-gallery/social-media) creating | ||
targeted content campaigns | ||
|
||
</Card> | ||
|
||
### For research and strategy teams | ||
|
||
<Card title="Market Research Agent" icon="chart-bar"> | ||
**Use Case:** Competitive analysis, industry trends, strategic insights | ||
|
||
**Primary Choice:** `O3` - Advanced reasoning with chain-of-thought processing | ||
|
||
**Alternative:** `Claude 4 Opus` - Excellent synthesis of multiple sources | ||
|
||
**Why These Work:** | ||
|
||
- Superior multi-step reasoning for complex analysis | ||
- Large context windows for extensive document processing | ||
- Strong capability for strategic insights | ||
|
||
**Example:** [Market Research Expert](/agents/agent-gallery/market-researcher) | ||
analyzing company intelligence | ||
|
||
</Card> | ||
|
||
### For operations and data teams | ||
|
||
<Card title="Data Processing Agent" icon="database"> | ||
**Use Case:** Inventory management, spreadsheet analysis, operational reporting | ||
|
||
**Primary Choice:** `GPT-4o Mini` - Cost-effective with reliable data handling | ||
|
||
**Alternative:** `GPT-4.1` - Better for complex data analysis | ||
|
||
**Why These Work:** | ||
|
||
- Fast processing for large volumes of data | ||
- Low cost per operation for routine tasks | ||
- Consistent performance for automated workflows | ||
|
||
**Example:** [Inventory Tracker](/agents/agent-gallery/inventory-tracker) | ||
monitoring stock levels and sales patterns | ||
|
||
</Card> | ||
|
||
### For customer success teams | ||
|
||
<Card title="Support and Scheduling Agent" icon="calendar-clock"> | ||
**Use Case:** Customer support, appointment scheduling, real-time assistance | ||
|
||
**Primary Choice:** `GPT-4o Mini` - Fast response times with natural language | ||
understanding | ||
|
||
**Alternative:** `GPT-4.1` - Better for complex customer issues | ||
|
||
**Why These Work:** | ||
|
||
- Sub-second response times for real-time interactions | ||
- Reliable performance under varying loads | ||
- Natural conversation flow and context understanding | ||
|
||
**Example:** | ||
[Workout Scheduling Agent](/agents/agent-gallery/workout-scheduling) managing | ||
calendar integration | ||
|
||
</Card> | ||
|
||
## Model capabilities at a glance | ||
|
||
| Model | Best For | When to Choose | | ||
| ----------------- | ------------------------- | ----------------------------------------- | | ||
| **GPT-4.1** | General business use | Start here for any use case (our default) | | ||
| **Claude 4 Opus** | Creative & editorial work | Writing, content, brand voice | | ||
| **O3** | Complex reasoning | Strategic decisions, deep analysis | | ||
| **GPT-4o Mini** | High-volume operations | Cost optimization, simple tasks | | ||
|
||
## Getting started | ||
|
||
1. **Create your agent** with GPT-4.1 (the default) | ||
2. **Focus on instructions and connections** to define your agent's role | ||
3. **Test with real examples** from your workflow | ||
4. **Refine and iterate** based on results | ||
5. **Consider alternatives** once you understand value and patterns | ||
|
||
<Tip> | ||
**Value first, optimization second.** GPT-4.1 handles 90% of business use | ||
cases effectively. Prove your agent's value before optimizing for cost or | ||
specialized capabilities. | ||
</Tip> | ||
|
||
## Need help choosing? | ||
|
||
**Most users:** Start with GPT-4.1 and focus on your agent's instructions and | ||
connections. | ||
|
||
- **Creative teams:** Consider Claude 4 Opus for writing and content creation. | ||
- **High-volume operations:** Try GPT-4o Mini for cost-effective automation. | ||
- **Strategic work:** Explore O3 for complex analysis and reasoning. | ||
|
||
Remember: you can always change your model choice as your needs evolve. The most | ||
important factors are clear instructions and the right connections for your | ||
agent's role. | ||
|
||
<Tip> | ||
Use the [Model Router](/model-router) to easily experiment with models from | ||
different providers without changing your integration code. The unified API | ||
makes it simple to switch between OpenAI, Anthropic, Google, and Meta models | ||
for systematic comparison. | ||
</Tip> | ||
|
||
--- | ||
|
||
## References and additional resources | ||
|
||
This guide is based on industry best practices and community insights from | ||
leading AI development communities. For deeper technical insights and ongoing | ||
discussions about model selection, see: | ||
|
||
- [LLM Developers: How Do You Pick the Right LLM?](https://www.reddit.com/r/LLMDevs/comments/1k6phix/how_do_you_guys_pick_the_right_llm_for_your/) | ||
- [Generative AI: How to Select an LLM for a Use Case](https://www.reddit.com/r/generativeAI/comments/1ay5drl/how_to_select_a_llm_for_a_use_case/) | ||
- [Choosing the Right Language Model for Your Use Case](https://medium.com/data-science/choosing-the-right-language-model-for-your-nlp-use-case-1288ef3c4929) | ||
- [GitHub Copilot AI Model Selection](https://docs.github.com/en/copilot/using-github-copilot/ai-models/choosing-the-right-ai-model-for-your-task) | ||
- [JetBrains AI: How to Choose the Right LLM](https://blog.jetbrains.com/ai/2025/03/how-to-choose-the-right-llm/#) | ||
- [How to Choose Right LLM for Your Organisation](https://www.linkedin.com/pulse/how-choose-right-llm-your-organisation-gagan-agrawal-ms0oc/) | ||
- [OpenAI Model Selection Guide](https://platform.openai.com/docs/guides/model-selection) | ||
- [How to Select Right LLM Model for Your Use Case](https://masteringllm.medium.com/how-to-select-right-llm-model-for-your-use-case-d4d9670e06f3) | ||
- [How to Choose an AI Model for Your Business](https://www.leewayhertz.com/how-to-choose-an-ai-model/) | ||
- [Choosing the Right LLM](https://www.labellerr.com/blog/chosing-the-right-llm/) | ||
- [Choosing the Best LLM Model: A Strategic Guide](https://medium.com/@social_65128/choosing-the-best-llm-model-a-strategic-guide-for-your-organizations-needs-f64794ead5e9) | ||
- [DataRobot: How to Choose the Right LLM for Your Use Case](https://www.datarobot.com/blog/how-to-choose-the-right-llm-for-your-use-case/) | ||
- [TechTarget: How to Choose the Right LLM for Your Needs](https://www.techtarget.com/searchenterpriseai/tip/How-to-choose-the-right-LLM-for-your-needs) | ||
- [LangDB: Choosing the Right LLM for the Job](https://blog.langdb.ai/choosing-the-right-llm-for-the-job-with-langdb) | ||
|
||
For the most current model availability and pricing, always refer to the | ||
[Hypermode Model Router documentation](/model-router). |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.