home/tags/#LLM

#LLM

16 posts

// posts tagged #llm
GLM 5.2 Outperforms Claude on Semgrep's Cybersecurity Benchmarks

GLM 5.2 from Zhipu AI tops Semgrep's internal cybersecurity benchmark suite, outranking Claude. What the numbers mean — and don't mean — for security teams.

Building AI Agents with MCP: Stop Writing Glue Code That Breaks

MCP standardizes how LLM agents invoke tools and APIs. Here's the architecture, the real productivity unlock, and what production deployments actually cost you.

Stop Trusting AI, Start Designing It: GraphRAG + MCP for Large Codebases

AI hallucinations aren't a trust problem — they're a design problem. Here's how GraphRAG and MCP reshape what AI can reliably do in production codebases.

AI Doesn't Hallucinate. Your Architecture Does.

Hallucination isn't a bug in LLMs—it's the mechanism. The real problem is misallocating non-determinism in your system design.

Structured Output From Local LLMs: Getting Reliable JSON with Ollama and Zod

Learn how to get perfectly validated JSON from local LLMs using Ollama's structured output mode and Zod schemas. No more broken JSON responses.

GLM 5.2 Released: Zhipu's Open-Weights Model Challenges Closed AI

Zhipu AI drops GLM 5.2 with open weights, 1M context, and competitive benchmarks. Here's what developers need to know about China's latest flagship LLM.

GLM 5.2 Released: Zhipu AI's Latest Model Targets Developer Workflows

Zhipu AI announces GLM 5.2, the newest iteration of their open-weight language model series, bringing enhanced reasoning and coding capabilities to developers.

The Inverted LLM Stack: Teaching Cheap Models to Learn From Expensive Ones

Most agent frameworks assume expensive frontier models. One developer flipped the script—using cheap LLMs as workers and expensive ones as teachers.

NeuralBridge: Open-Source SDK Auto-Heals LLM Agent Failures in 19 Microseconds

New Apache 2.0 SDK promises 84% recovery rate for production LLM failures with minimal overhead. Self-healing agents without the complexity.

Building Production AI Pipelines: Lessons from Processing 10K+ Jobs Daily

A deep dive into the architecture, cost tradeoffs, and failure modes of scaling LLM-powered agent pipelines in production environments.