@aiDotEngineer · English

🤖 AI Engineer

AI Engineer conference talks, workshops, and panels. Frontier labs, agents, RAG, evals, the AI engineering toolchain.

678 videos · 20 topics · Browse by category → · YouTube ↗ · NotebookLM ↗

Topics

🤖 Agents 110

LLM agents that plan, call tools, and act in loops. LangGraph, CrewAI, AutoGen, custom orchestration, multi-agent systems, agent reliability.

agentsmulti-agentmcpworkflows

💻 Code Generation 79

AI coding tools and agents. Cursor, Devin, Copilot internals, SWE-bench, agentic refactoring, repo-scale understanding.

coding-agentscode-generationai-codingagents

📐 Evals 58

How to actually measure LLM and agent quality — golden sets, LLM-as-judge, regression gates, production tracing, observability.

evalsobservabilityagentsllm-as-judge

💼 AI Business 54

Going to market with AI. Pricing, GTM, build-vs-buy, moats, enterprise adoption, vertical agents, ROI stories.

ai-businessstartupsenterprise-aihiring

🔎 RAG 48

Retrieval-augmented generation — chunking, embeddings, hybrid search, rerankers, citation, evaluation. The dominant pattern for grounding LLMs in private data.

raggraphragneo4jknowledge-graphs

💬 LLM Apps 47

End-to-end LLM-powered applications. Prompt + context plumbing, structured outputs, retry & repair, user feedback loops.

agentsllm-appsevalsrag

🏗️ AI Infrastructure 36

GPU clusters, training stacks, autoscaling inference, data pipelines, feature stores, observability for AI workloads.

agentsinfrastructuregpusai-infrastructure

✨ Product & UX 30

Designing AI features users actually want. Latency, trust, streaming, citations, undo, the "AI moment" in a product.

productdesignmultimodalagents

🔌 MCP 30

Model Context Protocol — how clients (Claude, Cursor, IDEs) connect to servers that expose tools, resources, and prompts.

mcpagentsanthropicenterprise

🧠 Foundation Models 30

Frontier LLM training, architecture choices, scaling, post-training (SFT/RLHF/DPO), evaluation, releases from OpenAI, Anthropic, Google, Meta, Mistral, etc.

foundation-modelsgeminiopen-weightsgemma

⚡ Inference & Serving 28

Throughput and latency engineering. Continuous batching, paged attention, quantization, speculative decoding, vLLM/TensorRT/SGLang.

inferenceon-devicegpunvidia

🎙️ Voice 26

Real-time voice AI. ASR (Whisper), TTS, turn detection, latency, voice agents for phones, support, accessibility.

voicepipecatagentsvoice-ai

🎯 Fine-Tuning 20

Adapting pre-trained models — full SFT, LoRA/QLoRA, DPO, preference tuning. When fine-tuning beats prompting + RAG.

fine-tuningrlunslothlora

🛡️ Safety & Alignment 19

Prompt injection defenses, jailbreak resistance, hallucination mitigation, PII handling, red-teaming, responsible scaling.

securitysafetyred-teamingprompt-injection

🎨 Multimodal 18

Vision-language models, video understanding, image generation, multimodal agents. GPT-4V, Claude vision, Gemini, open-source VLMs.

multimodalvideo-generationveorobotics

🛠️ Tools & Frameworks 16

The AI engineering toolchain — LangChain, LlamaIndex, DSPy, LangGraph, LangSmith, Braintrust, Inspect, AGENTS.md.

agentstypescriptdspytool-calling

✏️ Prompt Engineering 12

Prompting patterns — few-shot, chain-of-thought, ReAct, structured output, prompt management at scale.

prompt-engineeringevalsprompt-optimizationclaude

🧮 Embeddings & Vector DBs 6

Embedding models, chunking, hybrid retrieval, vector store choice (Pinecone, Qdrant, Weaviate, pgvector), reranking.

embeddingsvector-searchrecsysmultimodal

📦 Misc 6

Talks that span multiple themes, panels, opening keynotes, and general AI Engineer content.

communityroboticshumanoidopen-source

🔬 Research 4

Frontier research talks — new architectures, training techniques, theoretical insights, paper deep-dives.

agiworld-modelscode-generationmeta-fair