This is a bilingual sentence embeddings model based on Sentence Transformers. It understands German, English, and code.
Go to file
Falko Victor Habel b85c35b720 added example code and readme 2024-10-31 18:00:09 +01:00
.gitignore added example code and readme 2024-10-31 18:00:09 +01:00
LICENSE Initial commit 2024-06-30 09:21:38 +00:00
README.md added example code and readme 2024-10-31 18:00:09 +01:00
example.py added example code and readme 2024-10-31 18:00:09 +01:00

README.md

Fabelous Embedder Mini

This repository contains a custom sentence transformer model named fabelous-mini-embedder, trained on 13 different programming languages and English.

Description

The model was built using the Sentence Transformer library, which provides an easy-to-use interface for working with sentence embeddings, including pre-trained models.

In addition to fabelous-mini-embedder, we also have a proprietary model called fabelous-embedder-base. They are trained on a vast dataset covering various programming languages, including:

  • Python
  • Java
  • Go
  • C++
  • TypeScript

Example Usage

Heres how to use the model to generate sentence embeddings:

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("fabelous-mini-embedder")

instruction = "This is an example sentence"
embeddings = model.encode(instruction)

print(embeddings)  # Output: (array of numerical embeddings)

The generated embeddings can be used for tasks like semantic search or classification.

Future Improvements

We are currently working on the second generation of our model, focusing on:

  • Enhanced Multilingual Support: Adding support for German in addition to English.
  • Expanded Dataset: Increasing the code dataset significantly to improve model performance and accuracy.

We welcome contributions from the community to help us achieve these goals!

Installation

To install the required libraries, run:

pip install sentence-transformers