Ahead of AI - Scraped Articles Index

Scraped on: 2026-01-27T18:08:43.608Z

Source: https://magazine.sebastianraschka.com

Total Articles: 65


Articles

# Title Date Images Words Status
2 The State Of LLMs 2025: Progress, Problems, and Predictions JUL 19, 2025 28 ~6857 ✅ Complete
2 The State Of LLMs 2025: Progress, Problems, and Predictions JUL 19, 2025 28 ~6857 ✅ Complete
1 Categories of Inference-Time Scaling for Improved LLM Reasoning JUL 19, 2025 17 ~7335 ✅ Complete
1 Categories of Inference-Time Scaling for Improved LLM Reasoning JUL 19, 2025 17 ~7335 ✅ Complete
3 LLM Research Papers: The 2025 List (July to December) JUL 19, 2025 16 ~3314 ✅ Complete
3 LLM Research Papers: The 2025 List (July to December) JUL 19, 2025 16 ~3314 ✅ Complete
4 From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates JUL 19, 2025 22 ~5889 ✅ Complete
4 From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates JUL 19, 2025 22 ~5889 ✅ Complete
5 Beyond Standard LLMs JUL 19, 2025 28 ~7404 ✅ Complete
5 Beyond Standard LLMs JUL 19, 2025 28 ~7404 ✅ Complete
6 Understanding the 4 Main Approaches to LLM Evaluation (From Scratch) JUL 19, 2025 18 ~6473 ✅ Complete
6 Understanding the 4 Main Approaches to LLM Evaluation (From Scratch) JUL 19, 2025 18 ~6473 ✅ Complete
7 Understanding and Implementing Qwen3 From Scratch JUL 19, 2025 21 ~7964 ✅ Complete
7 Understanding and Implementing Qwen3 From Scratch JUL 19, 2025 21 ~7964 ✅ Complete
8 From GPT-2 to gpt-oss: Analyzing the Architectural Advances JUL 19, 2025 26 ~5647 ✅ Complete
8 From GPT-2 to gpt-oss: Analyzing the Architectural Advances JUL 19, 2025 26 ~5647 ✅ Complete
9 The Big LLM Architecture Comparison FEB 5, 2025 60 ~12056 ✅ Complete
9 The Big LLM Architecture Comparison FEB 5, 2025 60 ~12056 ✅ Complete
10 LLM Research Papers: The 2025 List (January to June) JUL 19, 2025 11 ~4119 ✅ Complete
10 LLM Research Papers: The 2025 List (January to June) JUL 19, 2025 11 ~4119 ✅ Complete
11 Understanding and Coding the KV Cache in LLMs from Scratch JUL 19, 2025 15 ~3110 ✅ Complete
11 Understanding and Coding the KV Cache in LLMs from Scratch JUL 19, 2025 15 ~3110 ✅ Complete
12 Coding LLMs from the Ground Up: A Complete Course JUL 19, 2025 1 ~1017 ✅ Complete
12 Coding LLMs from the Ground Up: A Complete Course JUL 19, 2025 1 ~1017 ✅ Complete
13 The State of Reinforcement Learning for LLM Reasoning JUL 19, 2025 36 ~7957 ✅ Complete
13 The State of Reinforcement Learning for LLM Reasoning JUL 19, 2025 36 ~7957 ✅ Complete
14 First Look at Reasoning From Scratch: Chapter 1 JUL 19, 2025 7 ~3935 ✅ Complete
14 First Look at Reasoning From Scratch: Chapter 1 JUL 19, 2025 7 ~3935 ✅ Complete
15 The State of LLM Reasoning Model Inference JUL 19, 2025 26 ~4627 ✅ Complete
15 The State of LLM Reasoning Model Inference JUL 19, 2025 26 ~4627 ✅ Complete
16 Understanding Reasoning LLMs JUL 19, 2025 18 ~4363 ✅ Complete
16 Understanding Reasoning LLMs JUL 19, 2025 18 ~4363 ✅ Complete
17 Noteworthy AI Research Papers of 2024 (Part Two) JUL 19, 2025 22 ~5751 ✅ Complete
17 Noteworthy AI Research Papers of 2024 (Part Two) JUL 19, 2025 22 ~5751 ✅ Complete
18 Noteworthy AI Research Papers of 2024 (Part One) JUL 19, 2025 11 ~3402 ✅ Complete
18 Noteworthy AI Research Papers of 2024 (Part One) JUL 19, 2025 11 ~3402 ✅ Complete
19 LLM Research Papers: The 2024 List JUL 19, 2025 2 ~6515 ✅ Complete
19 LLM Research Papers: The 2024 List JUL 19, 2025 2 ~6515 ✅ Complete
20 Understanding Multimodal LLMs JUL 19, 2025 31 ~5021 ✅ Complete
20 Understanding Multimodal LLMs JUL 19, 2025 31 ~5021 ✅ Complete
21 Building A GPT-Style LLM Classifier From Scratch JUL 19, 2025 21 ~4475 ✅ Complete
21 Building A GPT-Style LLM Classifier From Scratch JUL 19, 2025 21 ~4475 ✅ Complete
22 Building LLMs from the Ground Up: A 3-hour Coding Workshop JUL 19, 2025 1 ~355 ✅ Complete
22 Building LLMs from the Ground Up: A 3-hour Coding Workshop JUL 19, 2025 1 ~355 ✅ Complete
23 New LLM Pre-training and Post-training Paradigms JUL 19, 2025 21 ~4595 ✅ Complete
23 New LLM Pre-training and Post-training Paradigms JUL 19, 2025 21 ~4595 ✅ Complete
24 Instruction Pretraining LLMs JUL 19, 2025 18 ~6797 ✅ Complete
24 Instruction Pretraining LLMs JUL 19, 2025 18 ~6797 ✅ Complete
25 Developing an LLM: Building, Training, Finetuning JUL 19, 2025 0 ~204 ✅ Complete
25 Developing an LLM: Building, Training, Finetuning JUL 19, 2025 0 ~204 ✅ Complete
26 LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments JUL 19, 2025 19 ~4398 ✅ Complete
26 LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments JUL 19, 2025 19 ~4398 ✅ Complete
27 How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? JUL 19, 2025 19 ~5651 ✅ Complete
27 How Good Are the Latest Open LLMs? And Is DPO Better Than PPO? JUL 19, 2025 19 ~5651 ✅ Complete
28 Using and Finetuning Pretrained Transformers JUL 19, 2025 9 ~3541 ✅ Complete
28 Using and Finetuning Pretrained Transformers JUL 19, 2025 9 ~3541 ✅ Complete
29 Tips for LLM Pretraining and Evaluating Reward Models JUL 19, 2025 15 ~5333 ✅ Complete
29 Tips for LLM Pretraining and Evaluating Reward Models JUL 19, 2025 15 ~5333 ✅ Complete
30 A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM Research JUL 19, 2025 18 ~5229 ✅ Complete
30 A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM Research JUL 19, 2025 18 ~5229 ✅ Complete
31 Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch JUL 19, 2025 13 ~3241 ✅ Complete
31 Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch JUL 19, 2025 13 ~3241 ✅ Complete
32 Model Merging, Mixtures of Experts, and Towards Smaller LLMs JUL 19, 2025 21 ~5792 ✅ Complete
32 Model Merging, Mixtures of Experts, and Towards Smaller LLMs JUL 19, 2025 21 ~5792 ✅ Complete
33 Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs JUL 19, 2025 21 ~4997 ✅ Complete
33 Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs JUL 19, 2025 21 ~4997 ✅ Complete
34 Ten Noteworthy AI Research Papers of 2023 JUL 19, 2025 27 ~4553 ✅ Complete
34 Ten Noteworthy AI Research Papers of 2023 JUL 19, 2025 27 ~4553 ✅ Complete
35 Tackling Hallucinations, Boosting Reasoning Abilities, and New Insights into the Transformer Architecture JUL 19, 2025 19 ~5369 ✅ Complete
35 Tackling Hallucinations, Boosting Reasoning Abilities, and New Insights into the Transformer Architecture JUL 19, 2025 19 ~5369 ✅ Complete
36 Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) JUL 19, 2025 17 ~3605 ✅ Complete
36 Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) JUL 19, 2025 17 ~3605 ✅ Complete
37 A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs JUL 19, 2025 11 ~3559 ✅ Complete
37 A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs JUL 19, 2025 11 ~3559 ✅ Complete
38 AI and Open Source in 2023 JUL 19, 2025 18 ~3238 ✅ Complete
38 AI and Open Source in 2023 JUL 19, 2025 18 ~3238 ✅ Complete
39 LLM Business and Busyness: Recent Company Investments and AI Adoption, New Small Openly Available LLMs, and LoRA Research JUL 19, 2025 14 ~3326 ✅ Complete
39 LLM Business and Busyness: Recent Company Investments and AI Adoption, New Small Openly Available LLMs, and LoRA Research JUL 19, 2025 14 ~3326 ✅ Complete
40 From Self-Alignment to LongLoRA JUL 19, 2025 23 ~1976 ✅ Complete
40 From Self-Alignment to LongLoRA JUL 19, 2025 23 ~1976 ✅ Complete
41 LLM Training: RLHF and Its Alternatives JUL 19, 2025 18 ~3362 ✅ Complete
41 LLM Training: RLHF and Its Alternatives JUL 19, 2025 18 ~3362 ✅ Complete
42 The Missing Bits: Llama 2 Weights Have Changed JUL 19, 2025 10 ~1283 ✅ Complete
42 The Missing Bits: Llama 2 Weights Have Changed JUL 19, 2025 10 ~1283 ✅ Complete
43 New Foundation Models: CodeLlama and other highlights in Open-Source AI JUL 19, 2025 22 ~4244 ✅ Complete
43 New Foundation Models: CodeLlama and other highlights in Open-Source AI JUL 19, 2025 22 ~4244 ✅ Complete
44 Llama 2, Flash-Attention 2, and More JUL 19, 2025 15 ~1250 ✅ Complete
44 Llama 2, Flash-Attention 2, and More JUL 19, 2025 15 ~1250 ✅ Complete
45 Large Language Models and Nearest Neighbors JUL 19, 2025 11 ~3039 ✅ Complete
45 Large Language Models and Nearest Neighbors JUL 19, 2025 11 ~3039 ✅ Complete
46 Long Contexts and Scaling Transformers to 1,000,000,000 Tokens JUL 19, 2025 23 ~1822 ✅ Complete
46 Long Contexts and Scaling Transformers to 1,000,000,000 Tokens JUL 19, 2025 23 ~1822 ✅ Complete
47 State of Computer Vision 2023: From Vision Transformers to Neural Radiance Fields JUL 19, 2025 18 ~3132 ✅ Complete
47 State of Computer Vision 2023: From Vision Transformers to Neural Radiance Fields JUL 19, 2025 18 ~3132 ✅ Complete
48 Accelerating PyTorch Model Training JUL 19, 2025 15 ~2002 ✅ Complete
48 Accelerating PyTorch Model Training JUL 19, 2025 15 ~2002 ✅ Complete
49 Understanding Encoder And Decoder LLMs JUL 19, 2025 5 ~1566 ✅ Complete
49 Understanding Encoder And Decoder LLMs JUL 19, 2025 5 ~1566 ✅ Complete
50 Direct-Preference Optimization for Human Feedback and More JUL 19, 2025 25 ~1999 ✅ Complete
50 Direct-Preference Optimization for Human Feedback and More JUL 19, 2025 25 ~1999 ✅ Complete
51 LLM Tuning & Dataset Perspectives JUL 19, 2025 19 ~3462 ✅ Complete
51 LLM Tuning & Dataset Perspectives JUL 19, 2025 19 ~3462 ✅ Complete
52 About LayerNorm Variants in the Original Transformer Paper, and Some Other Interesting Historical Tidbits About LLMs JUL 19, 2025 6 ~1156 ✅ Complete
52 About LayerNorm Variants in the Original Transformer Paper, and Some Other Interesting Historical Tidbits About LLMs JUL 19, 2025 6 ~1156 ✅ Complete
53 Finetuning LLMs Efficiently with Adapters JUL 19, 2025 12 ~1830 ✅ Complete
53 Finetuning LLMs Efficiently with Adapters JUL 19, 2025 12 ~1830 ✅ Complete
54 Transformers for Long Inputs and Less Training Data JUL 19, 2025 13 ~1772 ✅ Complete
54 Transformers for Long Inputs and Less Training Data JUL 19, 2025 13 ~1772 ✅ Complete
55 Insights from Large-Scale LLM Training Runs JUL 19, 2025 13 ~2923 ✅ Complete
55 Insights from Large-Scale LLM Training Runs JUL 19, 2025 13 ~2923 ✅ Complete
56 Understanding Parameter-Efficient LLM Finetuning: Prompt Tuning And Prefix Tuning JUL 19, 2025 8 ~1016 ✅ Complete
56 Understanding Parameter-Efficient LLM Finetuning: Prompt Tuning And Prefix Tuning JUL 19, 2025 8 ~1016 ✅ Complete
57 Finetuning Large Language Models JUL 19, 2025 9 ~2246 ✅ Complete
57 Finetuning Large Language Models JUL 19, 2025 9 ~2246 ✅ Complete
58 Understanding Large Language Models JUL 19, 2025 21 ~3501 ✅ Complete
58 Understanding Large Language Models JUL 19, 2025 21 ~3501 ✅ Complete
59 Large Language Models 3.0 JUL 19, 2025 16 ~4087 ✅ Complete
59 Large Language Models 3.0 JUL 19, 2025 16 ~4087 ✅ Complete
60 TrAIn Differently: Do We Need Reinforcement Learning with Human Feedback (RLHF)? JUL 19, 2025 17 ~4586 ✅ Complete
60 TrAIn Differently: Do We Need Reinforcement Learning with Human Feedback (RLHF)? JUL 19, 2025 17 ~4586 ✅ Complete
61 RevAIval of Ideas: From Next-Generation Convolutional Neural Networks to LLMs JUL 19, 2025 26 ~4682 ✅ Complete
61 RevAIval of Ideas: From Next-Generation Convolutional Neural Networks to LLMs JUL 19, 2025 26 ~4682 ✅ Complete
62 Looking Back at 2022: A Big Year For AI JUL 19, 2025 15 ~3059 ✅ Complete
62 Looking Back at 2022: A Big Year For AI JUL 19, 2025 15 ~3059 ✅ Complete
63 Launching Large Language Models and Open Source Software JUL 19, 2025 18 ~2936 ✅ Complete
63 Launching Large Language Models and Open Source Software JUL 19, 2025 18 ~2936 ✅ Complete
64 Transformers, Fast and Slow: New Developments in Language Processing JUL 19, 2025 13 ~2953 ✅ Complete
64 Transformers, Fast and Slow: New Developments in Language Processing JUL 19, 2025 13 ~2953 ✅ Complete
65 A Diffusion of Innovations: Recent Developments in Generative Learning JUL 19, 2025 12 ~2110 ✅ Complete
65 A Diffusion of Innovations: Recent Developments in Generative Learning JUL 19, 2025 12 ~2110 ✅ Complete