This is a set of mobile friendly LLAMA and GPT2-based GenAI LLM models for text line rewriting with iOS and WebGL wrappers beased on LLama.cpp.
We ship it with three model quantised resolutions, try it online entirely in your browser with the quantised (simplified) one :
- Automated NPC Dialogue stylisation demo (LLAMA based Q4_K_M ~110mb)
- Single Prompt Debug demo (LLAMA based Q8_0 ~170mb)
Inside the package, you will also find Q8_0, Q4 and original bf16 (~321mb) resolutions of our models ready to run inside a cross-platform game engine!
You give it:
```
<input> How are you today? <inputEnds>
<style> Pirate's Poetic Question <styleEnds>
<output>
```
It prints out: Hey there, how fares ye today?
We hope that this model and wrapper can help you save a day! (or, more realistically, 3 months with a dedicated team of 3 people in our estimates).
The bounded model inference wrapper is based on open-source model runtime and supports the latest 405b models, so you can try them out on platforms with enough RAM/time available.
The bundled wrapper allows: