ADR 0005: Ollama Integration Strategy

Status

Accepted

For our market analysis feature, we needed to integrate Ollama to perform AI-powered market comparisons. We had two main options:

Both options provide access to Ollama's functionality, but they differ in several aspects that influenced our decision.

We chose to use the direct Ollama REST API approach over OllamaJS.

Direct Control
- Full control over request/response handling
- Ability to implement custom retry logic
- Easier to debug network issues
- No additional dependency layer
Performance
- No overhead from client library abstractions
- Direct HTTP calls are more efficient
- Better control over timeouts and connection pooling
Flexibility
- Easier to adapt to API changes
- Can implement custom caching strategies
- More control over error handling
Deployment
- No additional npm dependencies
- Simpler deployment process
- Reduced package size

Additional Abstraction Layer
- Potential version compatibility issues
- Less control over underlying HTTP requests
- Dependency on third-party maintenance
Limited Customization
- Predefined error handling
- Fixed retry strategies
- Less flexibility in request configuration
Version Lock-in
- Need to wait for library updates for new Ollama features
- Potential conflicts with other dependencies

We implemented the Ollama integration using:

Example configuration:

OLLAMA_URL: 'http://ollama:11434';
OLLAMA_API_TIMEOUT: '60000';

Monitoring
- Implement detailed request/response logging
- Add performance metrics tracking
- Monitor model usage patterns
Optimization
- Cache frequently used prompts
- Implement connection pooling
- Add request queuing for high load
Features
- Support for streaming responses
- Batch request handling
- Model performance tracking