Back

Tags: #llm

Dec 3, 2024

LLM Inference Became A Systems Problem

How batching, caching, quantization, and speculative decoding changed serving economics.

8 min
Oct 7, 2024

Agentic Workflows Need State And Guardrails

Why useful LLM agents depend on tools, state machines, evals, and careful failure handling.

7 min
- llm
- agents
- tools
- automation
Aug 27, 2024

Multimodal LLMs Became Product Infrastructure

How vision, audio, and text models changed document workflows, support, and automation.

5 min
- llm
- multimodal
- vision
- automation
Jun 18, 2024

Small Language Models Found Their Lane

Why smaller LLMs became useful for routing, extraction, classification, and edge workflows.

5 min
- llm
- slm
- fine-tuning
- inference
Apr 9, 2024

Long Context Is Not A Retrieval Strategy

Longer context windows help, but they do not replace retrieval, ranking, and context design.

7 min
- llm
- rag
- long-context
- retrieval
Feb 14, 2024

Structured Outputs Made LLMs Easier To Ship

Why schemas, tool calls, and constrained decoding changed how production LLM apps are built.

6 min
- llm
- structured-outputs
- agents
- json
Dec 18, 2023

LLM Evals Became the New Unit Test

Why evaluation became a core skill for building reliable LLM applications.

9 min
- llm
- evaluation
- rag
- prompt-engineering
Oct 5, 2023

QLoRA Made Fine-Tuning Feel Practical

How LoRA and QLoRA changed the economics of adapting LLMs.

8 min
- llm
- fine-tuning
- qlora
- lora