Jurassic-1 Language Models
Jurassic-1 (J1) is the first generation in a series of large language models trained and made widely accessible by AI21 Labs. There are two versions of Jurassic-1, differing by size:
- J1-Jumbo, with 178B parameters, is the largest and most sophisticated language model ever released for general use by developers. Jumbo is the most capable model in the J1 family, but it's also the slowest and most expensive to run.
- J1-Grande, with 17B parameters, is faster and more affordable than Jumbo, and more capable than Large.
- J1-Large, with 7.5B parameters, is smaller, faster and more affordable but overall less capable than Jumbo, though still very effective for many use-cases.
A complete description of Jurassic-1, including benchmarks and quantitative comparisons with other models, can be found in our technical paper.
All J1 models were trained on a massive corpus of English text, making them highly versatile general purpose text-generators, capable of composing human-like text and solving complex tasks such as question answering, text classification and many others.
J1 can be applied to virtually any language task by crafting a suitable prompt, containing a description of the task and/or a few examples, a process commonly known as “prompt engineering”. If you’re looking for inspiration, you can find example use-cases implemented with prompt engineering in our blog post. Popular use-cases include generating marketing copy, powering chatbots and assisting creative writing.
With trial and error, you should be able to bootstrap a prompt that produces good results for your use-case. However, to achieve even better quality and scale-up your app, we recommend that you train a custom model.