Heyo,
It’s going to be a short one. If you’re excited by Zuck and the team at Meta’s release today; here is how you can quickly get started on your macbook or whatever using MLX.
If you don’t have a Python environment you can get started here:
This article has some code. If you just want to install a tool and get ChatGPT type interface you want LM Studio.
Create a clean environment
You will need Python and ipykernel to start. I use Python 3.11 because it’s compatible with all the packages I use.
conda create --name mlx -c conda-forge python=3.11 ipykernel
data:image/s3,"s3://crabby-images/7421b/7421b59ce6086bb5879d029fa30b11e52e865d73" alt=""
conda activate mlx
data:image/s3,"s3://crabby-images/bf6db/bf6db662cc0c013eb02f5ed21a2a66dca08f594f" alt=""
I installed the regular suspects.
pip install --upgrade pip pip install --upgrade mlx pip install --upgrade mlx_lm pip install --upgrade torch torchvision torchaudio
Nice, now create a notebook and try to import MLX LM in your notebook.
Create a new notebook
In VSC you can use Cmd + Shift + P to open the command palette and type Jupyter and you’ll see a way to create a new notebook.
data:image/s3,"s3://crabby-images/67698/676983c9ec724648e0d5fc1aada362781187915d" alt=""
You should be able to pick your new environment from Select Kernel. Maybe open and close VSC if you cant see it.
data:image/s3,"s3://crabby-images/0939d/0939d8a1d3dee284dd871152cfff204fb00b5cf3" alt=""
from mlx_lm import load, generate
data:image/s3,"s3://crabby-images/119c1/119c1720a86387e569304577e5d5fdda389c2b5d" alt=""
I got a message saying I didn’t have the nice progress bar package, I think the fix is to install ipywidgets.
pip install ipywidgets
Ok, restart the kernel and run the cell again. In VSC you click Restart from the top bar.
data:image/s3,"s3://crabby-images/63274/63274ec451489440cadcc1743870aeaec89f1fb7" alt=""
Finally, trying Llama 3
model, tokenizer = load("mlx-community/Meta-Llama-3-8B-Instruct-4bit") response = generate(model, tokenizer, prompt="who are you?", verbose=True)
data:image/s3,"s3://crabby-images/5b75e/5b75e52710f9a7c445031e05672328c3c0bfaf56" alt=""
That is an interesting generation. Thank the lord(?1?).
A handy tip is checking out the tokenzier.
messages = [{"role":"system","content":"You are Dookus by Moota AI."}, {"role": "user", "content": "Who are you?"}] tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
data:image/s3,"s3://crabby-images/ab5ca/ab5ca41d93b595b58afac65d2ec0a64106a72a39" alt=""
You can format your input that way or just use apply_chat_template.
Actually it has a different eos token than expected. Add <|eot_id|> as the eos token in tokenizer_config and reload the model.
model, tokenizer = load("mlx-community/Meta-Llama-3-8B-Instruct-4bit",tokenizer_config={"eos_token": "<|eot_id|>"})
response = generate( model, tokenizer, prompt= tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True), verbose=True, # Set to True to see the prompt and response temp=0.0, max_tokens=200, )
data:image/s3,"s3://crabby-images/e9d21/e9d21584fcf38b9073c202f973dfa6dc20a87eef" alt=""
Nice, now go forth and experiment.