Properties

Pre-training Datasets

Scaling Laws

Emergent Abilities

Hallucination

Architectures

Mostly Transformers

GPT

Generative Pre-trained Transformers

llm-family-tree