🪴 Anil's Garden

❯

❯

Alpaca: A Strong, Replicable Instruction-Following Model

Alpaca: A Strong, Replicable Instruction-Following Model

19 Dec 20251 min read

paper

Title: Alpaca: A Strong, Replicable Instruction-Following Model
Authors: Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, Yann Dubois, Xuechen Li, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto
Published: 2023-03-13
Link: https://crfm.stanford.edu/2023/03/13/alpaca.html

Abstract

We introduce Alpaca 7B, a model fine-tuned from the LLaMA 7B model on 52K instruction-following demonstrations. On our preliminary evaluation of single-turn instruction following, Alpaca behaves qualitatively similarly to OpenAI’s text-davinci-003, while being surprisingly small and easy/cheap to reproduce (<600$). Checkout our code release on GitHub.

Graph View

Backlinks

Instruction Tuning for Large Language Models: A Survey
Language Models

Website
Bluesky
Twitter/X
GitHub
LinkedIn
Instagram
Goodreads
Letterboxd
🍋