1012 B

Raw Blame History

Ollama Autocoder

A simple to use Ollama autocompletion engine with options exposed and streaming functionality

Requirements

Ollama must be serving on the API endpoint applied in settings
- For installation of Ollama, visit ollama.ai
Ollama must have the model applied in settings installed.

How to Use

In a text document, press space. The option Autocomplete with Ollama will appear. Press enter to start generation.
- Alternatively, you can run the Autocomplete with Ollama command from the command pallete (or set a keybind).
After startup, the tokens will be streamed to your cursor.
To stop the generation early, press the "Cancel" button on the "Ollama Autocoder" notification
Once generation stops, the notification will disappear.

Notes

For fastest results, an Nvidia GPU or Apple Silicon is recommended. CPU still works on small models.
The prompt only sees behind the cursor. The model is unaware of text in front of its position.