WikiAINeural NetworksTransformersLlmPropertiesPre-training Datasets Scaling Laws Emergent Abilities Hallucination ArchitecturesMostly TransformersGPT Generative Pre-trained TransformersAttentionTransformers