Ollama Chat Model node
The Ollama Chat Model node allows you use local Llama 2 models with conversational agents.
On this page, you’ll find the node parameters for the Ollama Chat Model node, and links to more resources.
Note
Credentials You can find authentication information for this node here.
Note
Parameter resolution in sub-nodes Sub-nodes behave differently to other nodes when processing multiple items using an expression.
Most nodes, including root nodes, take any number of items as input, process these items, and output the results. You can use expressions to refer to input items, and the node resolves the expression for each item in turn. For example, given an input of five name values, the expression {{ $json.name }} resolves to each name in turn.
In sub-nodes, the expression always resolves to the first item. For example, given an input of five name values, the expression {{ $json.name }} always resolves to the first name.
Node parameters
Model: Select the model that generates the completion. Choose from:
Llama2
Llama2 13B
Llama2 70B
Llama2 Uncensored
Refer to the Ollama Models Library documentation for more information about available models.
Node options
Sampling Temperature: Use this option to control the randomness of the sampling process. A higher temperature creates more diverse sampling, but increases the risk of hallucinations.
Top K: Enter the number of token choices the model uses to generate the next token.
Top P: Use this option to set the probability the completion should use. Use a lower value to ignore less probable options.
Templates and examples
Common issues
For common questions or issues and suggested solutions, refer to Common issues.
Self-hosted AI Starter Kit
New to working with AI and using self-hosted n8n? Try n8n’s self-hosted AI Starter Kit to get started with a proof-of-concept or demo playground using Ollama, Qdrant, and PostgreSQL.