LLaMA and other LLM locally on iOS and MacOS.

Absolutely free, open source and private.


One app for MacOS an iOS

Absolutely free

LLM Farm provides all features absolutely free of charge! No hidden fees, subscriptions or feature limitations - all features are available for use at no additional cost.

View on github
Google Design
Open Source

Open Source

The core is a Swift library based on llama.cpp, ggml and other open source projects that allows you to perform various inferences. A class hierarchy has been developed that allows you to add your own inference.

View Core repo

Features

Various inferences
LLaMA, Phi, Gemma
RWKV, Starcoder
Qwen, GPT2 + Cerebras
StableLM, And more
Various sampling methods
Temperature
Mirostat v1,v2
Greedy
Grammar
Metal
Metal acceleration makes it possible to run models directly on a mobile device.
Model settings templates
Allow you to quick configure downloaded model for each device
LoRA
LLM Farm offers the ability to connect to LLM LoRa adapters and train them with FineTune right on your mobile device.
Multimodal
LLM Farm supports multimodal models such as LLaVA, MobileVLM, etc.

Popular models

You can download this models from huginfaces

LLaMA 3

Popular model from Meta, device with 8GB Ram required.

8B
Phi 3

Popular model from Microsoft is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets

3B
Gemma

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

2B
7B
MobileVLM

MobileVLM is a competent multimodal vision language model (MMVLM) targeted to run on mobile devices.

3B
Marx

This is a model, which are GPT-2 models intended to generate prompt texts for imaging AIs.

3B
TinyLlama

The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens.

1.1B
Qwen1.5

Popular Chinese model

4B