#llm

14 results

tech Explainer

June 6, 2026

How AI Reshapes How You Think: The Cognitive Shift Beyond the Tool

AI tools change more than your speed — they change how you think. The shift from 'how to do it' to 'what to do' and 'is this right?' has real long-term implications for engineers.

#ai #cognitive-change #llm #productivity #thinking #knowledge-work

tech How-To

June 4, 2026

How to Use Codex, Hermes, and Other AI Coding Agents for Free (Long-Term)

OpenAI Codex CLI and multiple AI coding agents have free tiers. The key is understanding each tool's quota mechanism, how to combine them to extend free usage, and when paid tiers are actually worth it.

#openai-codex #ai-coding #agent #free-tier #developer-tools #llm

tech How-To

June 3, 2026

Building a Video Production AI Agent with LangGraph: Lesson 3

Build a video production AI Agent with LangGraph that handles research, scripting, and storyboarding — the key is state machine design and conditional edges for error handling.

#ai-agent #langgraph #python #llm #workflow #automation

tech How-To

May 28, 2026

AI Agent Bills Exploding? A Practical Guide to Model and Tool Selection

AI agent billing spikes come from three places: using a stronger model than the task requires, no depth limit on tool call loops, and context window waste from passing full history every round. The correct cost control strategy is matching model capability to task complexity, not using the strongest model for everything.

#ai #llm #cost-optimization #agent #engineering

tech Explainer

May 23, 2026

DeepSeek V4: 1.6 Trillion Parameter Open-Source Model Challenges GPT-5, Runs on Huawei Chips

DeepSeek V4 is a 1.6T parameter MoE open-source model with 1M token context that claims to outperform GPT-5.2 on some benchmarks — and is DeepSeek's first model optimized for Huawei Ascend chips.

#ai #deepseek #llm #open-source #china-tech

tech Explainer

May 20, 2026

How AI Agents Work, and What Is Harness Engineering?

AI Agents let models perceive environments and act autonomously. Harness Engineering is the discipline that makes them reliable — the scaffolding that turns a smart-but-unpredictable model into a deployable engineering system.

#ai-agent #harness-engineering #llm #system-design #ai-engineering

tech How-To

May 19, 2026

I Built a Fully Automatic Mansplainer

Built an LLM-powered bot that explains anything with condescending overconfidence. 90% of the engineering went into system prompt design, not code.

#llm #prompt-engineering #side-project #chatbot #claude-api

tech Explainer

May 12, 2026

Goodbye, Reptile Warriors: Python's Role Shift in the Age of AI

Python is still the dominant language for AI development, but the rise of AI coding tools is blurring the line between 'writing Python code' and 'doing AI development' — this is what that shift actually means.

#python #ai #programming-languages #developer-tools #llm

tech Deep Dive

May 10, 2026

KV Cache: The Most Critical Optimization in LLM Inference

KV Cache reduces autoregressive Transformer generation from O(n²) — recomputing the full sequence for every new token — to O(n) per step, which is the core reason modern LLM inference is fast enough to be usable.

#kv-cache #llm #inference-optimization #transformer #ai #machine-learning

tech Deep Dive

May 9, 2026

How DeepSeek V3 Challenged Billion-Dollar AI Systems for $5.6M

DeepSeek V3's 671B-parameter MoE architecture trained on just 2.78M H800 GPU-hours matches near-GPT-4 performance across multiple benchmarks, with API pricing at one-tenth of OpenAI's equivalent.

#deepseek #ai #open-source-models #moe #llm

tech Explainer

May 9, 2026

OpenAI's o3, o4-mini, and GPT-4.1: The Good, the Bad, and the Insane

OpenAI released three models in spring 2025: GPT-4.1 for coding and instruction-following, o3 as the strongest reasoning model, and o4-mini hitting remarkable math and code performance at low cost — but the pricing strategy and API access limits left developers with mixed feelings.

#openai #o3 #o4-mini #gpt-4-1 #ai #llm

tech Explainer

May 7, 2026

Why Your AI Agent Gets Worse Over Time — Context Rot Explained

AI agents degrading over long sessions isn't a model problem — it's a context problem. As the context window fills with failed attempts, outdated code, and contradictory instructions, signal-to-noise ratio drops. The fix is treating context like RAM, not a filing cabinet.

#ai #agent #context-engineering #llm #prompt-engineering

tech Explainer

May 3, 2026

LLM Inference in Three Layers: Decoding, Workflow, and Reasoning

LLM output quality is determined at three distinct layers: token-level decoding strategy, task-level workflow design, and model-level reasoning capability. Knowing which layer your problem lives in is the fastest path to fixing it.

#ai #llm #inference #chain-of-thought #decoding-strategies #ai-agent #machine-learning

tech Explainer

April 28, 2026

What Games Can We Build with a Small Model (10B Active Parameters)?

Small language models around 10B parameters can run on local hardware in real time, enabling dynamic NPC dialogue, procedural narrative generation, and adaptive game content. Research shows SLMs approach large model quality on short, well-constrained creative tasks — the key is curated training data and constrained inference design.

#game-dev #llm #small-model #ai #npc #interactive-fiction