gaschz/trilium

Fork 0

mirror of https://github.com/zadam/trilium.git synced 2025-12-05 15:04:24 +01:00

perf3ct 065740eabc

feat(docs): completely redo documentation

2025-08-21 15:55:44 +00:00

8.5 KiB

Vendored

Raw Blame History

Setup and Configuration

This guide provides step-by-step instructions for setting up AI features in Trilium Notes with each supported provider.

Prerequisites

Before configuring AI features, ensure you have:

Trilium Notes installed and running
An active internet connection (for cloud providers)
API keys for your chosen provider (OpenAI or Anthropic)
Ollama installed locally (for local AI option)

Initial Setup

Step 1: Access AI Settings

Open Trilium Notes
Navigate to the main menu
Select Options
Scroll down to find AI/LLM section
Click to expand the AI/LLM settings panel

Step 2: Enable AI Features

In the AI/LLM settings panel, locate the "Enable AI Features" checkbox
Check the box to enable AI functionality
You should see a confirmation message: "AI features enabled"

Provider-Specific Setup

OpenAI Configuration

Obtaining an API Key

Visit OpenAI Platform
Sign up or log in to your account
Navigate to API Keys section
Click Create new secret key
Copy the key immediately (it won't be shown again)
Store the key securely

Configuration Steps

In Trilium's AI/LLM settings, select OpenAI from the provider dropdown

Enter your configuration:

API Key: sk-...your-key-here...
Base URL: https://api.openai.com/v1 (default)
Default Model: gpt-4-turbo-preview (recommended)

Click Test Connection to verify setup

For embeddings, select:

Embedding Model: text-embedding-3-small (cost-effective)
or
Embedding Model: text-embedding-3-large (higher quality)

Model Selection Guide

Model	Best For	Context Window	Cost
gpt-4-turbo-preview	Complex reasoning, analysis	128K tokens	Higher
gpt-4	High-quality responses	8K tokens	Highest
gpt-3.5-turbo	Quick responses, general use	16K tokens	Lower

Cost Considerations

Chat: Approximately $0.01-0.03 per 1K tokens for GPT-4
Embeddings: ~$0.0001 per 1K tokens for ada-002
Monthly estimate: Light use ~$5-10, Heavy use ~$20-50

Anthropic Configuration

Obtaining an API Key

Visit Anthropic Console
Create an account or sign in
Navigate to API Keys
Generate a new API key
Copy and store securely

Configuration Steps

Select Anthropic from the provider dropdown

Enter your configuration:

API Key: sk-ant-...your-key-here...
Base URL: https://api.anthropic.com (default)
Default Model: claude-3-sonnet-20240229 (balanced)

Test the connection to ensure it works

Model Selection Guide

Model	Best For	Context Window	Speed
claude-3-opus	Most capable, complex tasks	200K tokens	Slower
claude-3-sonnet	Balanced performance	200K tokens	Medium
claude-3-haiku	Fast responses, simple tasks	200K tokens	Fastest

Cost Considerations

Opus: ~$15 per million input tokens
Sonnet: ~$3 per million input tokens
Haiku: ~$0.25 per million input tokens
Monthly estimate: Similar to OpenAI depending on usage

Ollama Configuration (Local AI)

Installing Ollama

Windows

Download from ollama.com
Run the installer
Open terminal and verify: ollama --version

macOS

# Using Homebrew
brew install ollama

# Or download from website
curl -fsSL https://ollama.com/install.sh | sh

Linux

curl -fsSL https://ollama.com/install.sh | sh

Downloading Models

Chat Models

# Recommended models for chat
ollama pull llama3        # 8B parameters, balanced
ollama pull mistral      # 7B parameters, fast
ollama pull phi3         # 3.8B parameters, lightweight
ollama pull qwen2:7b     # 7B parameters, multilingual

Embedding Models

# Recommended for embeddings
ollama pull mxbai-embed-large    # Best quality
ollama pull nomic-embed-text     # Good balance
ollama pull all-minilm           # Lightweight

Configuration Steps

Ensure Ollama is running:

ollama serve  # Usually starts automatically

In Trilium, select Ollama from the provider dropdown
Enter configuration:
```
Base URL: http://localhost:11434
```
Click Refresh Models button to load available models

Select your models:

Chat Model: llama3 (or your preferred model)
Embedding Model: mxbai-embed-large

Performance Optimization

System Requirements

Minimum: 8GB RAM for 7B models
Recommended: 16GB RAM for smooth operation
GPU: NVIDIA GPU with 6GB+ VRAM significantly improves speed

Model Selection by Hardware

RAM	Recommended Models
8GB	phi3, tinyllama
16GB	mistral, llama3:7b
32GB+	llama3:13b, mixtral

Embedding Configuration

What are Embeddings?

Embeddings are numerical representations of your notes that enable:

Semantic search (finding conceptually similar content)
Intelligent context selection for AI conversations
Discovering relationships between notes

Setting Up Embeddings

Choose an Embedding Provider
- Use the same provider as your chat model for simplicity
- Or mix providers (e.g., Ollama for embeddings, OpenAI for chat)
Select Embedding Model
- OpenAI: text-embedding-3-small (cost-effective)
- Ollama: mxbai-embed-large (high quality, local)
Generate Initial Embeddings
- Click Recreate All Embeddings button
- This process runs in the background
- Progress shown in embedding statistics

Monitoring Embedding Generation

The embedding statistics show:

Embeddings Status:
- Total Notes: 1,234
- Embedded: 1,200
- Pending: 34
- Failed: 0

Embeddings are automatically updated when notes are:

Created
Modified
Deleted

Model Parameters

Temperature

Controls randomness in responses:

0.0: Deterministic, focused
0.7: Balanced (default)
1.0: Creative, varied

Max Tokens

Maximum response length:

Default: 4000 tokens (~3000 words)
Increase for longer responses
Decrease to control costs

System Prompt

Customize AI behavior:

You are a helpful assistant integrated with my personal knowledge base.
Focus on accuracy and cite specific notes when providing information.

Advanced Settings

API Configuration

Custom Endpoints

For OpenAI-compatible services:

Base URL: https://your-custom-endpoint.com/v1
API Key: your-api-key

Proxy Configuration

If behind a corporate proxy:

Set system environment variables
Or configure in Trilium's network settings

Rate Limiting

Configure to avoid API limits:

Requests per minute: 60
Tokens per minute: 90000
Concurrent requests: 3

Timeout Settings

Connection timeout: 30 seconds
Request timeout: 120 seconds

Testing Your Configuration

Connection Test

Click Test Connection button
Expected response: "Connection successful"
If failed, check:
- API key validity
- Network connectivity
- Firewall settings

Feature Test

Open "Chat with Notes"
Type: "Hello, can you see my notes?"
The AI should respond and attempt to access your notes
Verify embeddings are being used for context

Troubleshooting Setup Issues

Common Issues and Solutions

Issue	Solution
"Invalid API Key"	Double-check key, ensure no extra spaces
"Connection timeout"	Check firewall, proxy settings
"Model not found"	Verify model name, pull model for Ollama
"Embeddings not generating"	Check provider settings, recreate embeddings
"Out of memory" (Ollama)	Use smaller model, close other applications

Logs and Diagnostics

Check logs for detailed error information:

Location: [Trilium Data]/logs/
Look for entries with [AI] or [LLM] tags

Migration Between Providers

To switch providers:

Export Important Chats (if needed)
Change Provider in settings
Update API Keys and configuration
Recreate Embeddings with new provider
Test Connection before use

Note: Embedding recreation is necessary when switching embedding providers to maintain search quality.

Next Steps

After successful setup:

Review Features and Usage to learn how to use AI features
Explore Advanced Configuration for optimization
Read Security and Privacy for best practices

8.5 KiB Vendored Raw Blame History

Setup and Configuration

Prerequisites

Initial Setup

Step 1: Access AI Settings

Step 2: Enable AI Features

Provider-Specific Setup

OpenAI Configuration

Obtaining an API Key

Configuration Steps

Model Selection Guide

Cost Considerations

Anthropic Configuration

Obtaining an API Key

Configuration Steps

Model Selection Guide

Cost Considerations

Ollama Configuration (Local AI)

Installing Ollama

Windows

macOS

Linux

Downloading Models

Chat Models

Embedding Models

Configuration Steps

Performance Optimization

System Requirements

Model Selection by Hardware

Embedding Configuration

What are Embeddings?

Setting Up Embeddings

Monitoring Embedding Generation

Model Parameters

Temperature

Max Tokens

System Prompt

Advanced Settings

API Configuration

Custom Endpoints

Proxy Configuration

Rate Limiting

Timeout Settings

Testing Your Configuration

Connection Test

Feature Test

Troubleshooting Setup Issues

Common Issues and Solutions

Logs and Diagnostics

Migration Between Providers

Next Steps

8.5 KiB

Vendored

Raw Blame History