
28 lines
1.8 KiB

# Model Collection
import { Callout, FileTree } from 'nextra-theme-docs'
<Callout emoji="⚠️">
This section is under heavy development.
This section consists of a collection and summary of notable and foundational LLMs.
## Models
| Model | Description |
| --- | --- |
| [BERT]( | Bidirectional Encoder Representations from Transformers |
| [RoBERTa]( | A Robustly Optimized BERT Pretraining Approach |
| [ALBERT]( | A Lite BERT for Self-supervised Learning of Language Representations |
| [XLNet]( | Generalized Autoregressive Pretraining for Language Understanding and Generation |
| [GPT]( | Improving Language Understanding by Generative Pre-Training |
| [GPT-2]( | Language Models are Unsupervised Multitask Learners |
| [GPT-3]( | Language Models are Few-Shot Learners |
| [T5]( | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer |
| [CTRL]( | CTRL: A Conditional Transformer Language Model for Controllable Generation |
| [BART]( | Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension |
| [Chinchilla]( et al. 2022) | Shows that for a compute budget, the best performances are not achieved by the largest models but by smaller models trained on more data. |