Databricks Dolly
The world's first truly open instruction-tuned LLM.
Overview
Databricks' Dolly is an instruction-following large language model trained on the Databricks machine learning platform. It is based on the EleutherAI pythia model family and fine-tuned on a high-quality, human-generated instruction-following dataset, crowdsourced among Databricks employees. Dolly is licensed for commercial use, allowing organizations to build their own instruction-following LLMs.
✨ Key Features
- Instruction-following capabilities
- Based on the EleutherAI pythia model family
- Trained on a human-generated instruction dataset (databricks-dolly-15k)
- Licensed for commercial use
- Open-source training code and dataset
🎯 Key Differentiators
- Licensed for commercial use
- Trained on a high-quality, human-generated instruction dataset
Unique Value: Provides a truly open and commercially-usable instruction-tuned LLM, empowering organizations to create their own customized models.
🎯 Use Cases (4)
🏆 Alternatives
Offers a commercially permissive license and a high-quality, human-generated training dataset, which some other instruction-tuned models lack.
💻 Platforms
✅ Offline Mode Available
🔌 Integrations
💰 Pricing
Free tier: Free for research and commercial use.
🔄 Similar Tools in Open Source LLMs
Meta Llama 3
A family of pretrained and instruction-tuned generative text models from Meta....
Mistral AI
A French company specializing in high-performance, efficient, and accessible large language models....
EleutherAI
A non-profit AI research group focused on open-source AI research and the development of large langu...
Qwen
A series of large language and multimodal models developed by Alibaba Cloud, with many variants dist...
Google Gemma
A family of lightweight, open models built from the same research and technology used to create the ...
Falcon
A family of open-source large language models available in various parameter sizes, released under t...