Artificial Intelligence | The Coders Blog | Home

on-device AI LiteRT NPU edge AI mobile real-time privacy embedded AI

Building Real-World On-Device AI with LiteRT and NPU

Learn how to implement powerful on-device AI applications using LiteRT and Neural Processing Units (NPUs).

The Coders Blog

May 6, 2026

AI PyTorch Google Colossus GCSF machine learning training performance distributed computing

Google Colossus on PyTorch via GCSF: Speeding Up AI Training

Discover how Google Colossus, integrated with PyTorch via GCSF, significantly accelerates AI model training.

The Coders Blog

May 6, 2026

Gemini embeddings multimodal AI RAG AI agents LLM computer vision retrieval augmented generation

Building with Gemini Embedding 2: Agentic Multimodal RAG

Harness Gemini Embedding 2 to create sophisticated agentic multimodal RAG systems for advanced AI applications.

The Coders Blog

May 6, 2026

LLM inference TPU Google AI acceleration performance machine learning large language models

3X Speed Boost: Supercharging LLM Inference on Google TPUs

Achieve a threefold increase in LLM inference speed by leveraging Google TPUs for optimized machine learning performance.

The Coders Blog

May 6, 2026

AWS Amazon WorkSpaces AI workflow cloud virtual desktop productivity modernization

AI Revolutionizes Workflows: Amazon WorkSpaces Embraces the Future

Amazon WorkSpaces now leverages AI to modernize workflows, enhancing productivity and efficiency for users.

The Coders Blog

May 6, 2026

deep learning AI theory neural networks machine learning AI research

A Theory of Deep Learning: Understanding the Fundamentals

Exploring a new theory that aims to provide a deeper understanding of the core principles behind deep learning.

The Coders Blog

May 6, 2026

Gemma 4 MTP LLM AI model release new technology deep learning

Gemma 4 MTP Released: A New Era for AI Models

The release of Gemma 4 MTP signifies a potential advancement in AI model capabilities and architecture.

The Coders Blog

May 6, 2026

DeepSeek V4 LLM inference AI costs performance metrics cost-effectiveness

DeepSeek V4: Measuring the 17x Cheaper LLM Inference

An analysis measuring the cost-effectiveness of DeepSeek V4, demonstrating a significant reduction in LLM inference expenses.

The Coders Blog

May 6, 2026

Qwen LLM quantization BF16 AI performance large language models

Qwen 3.6 27B Quantization: A Deep Dive into Quality

A detailed quality comparison of Qwen 3.6 27B quantizations, including BF16, explores performance trade-offs in large language models.

The Coders Blog

May 6, 2026

LLM inference Qwen MTP AI optimization

2.5x Faster LLM Inference: Qwen 3.6 27B Achieves Breakthrough with MTP

Achieve a significant speed-up in Large Language Model inference using Qwen 3.6 27B with the MTP optimization technique.

The Coders Blog

May 6, 2026

Google Cloud fraud detection reCAPTCHA cybersecurity AI

Google Cloud's Fraud Defense: The Next Generation of reCAPTCHA

Discover how Google Cloud is evolving its fraud defense with advanced AI and threat detection capabilities.

The Coders Blog

May 6, 2026

diffusion models generative AI machine learning deep learning mathematics

Unlocking Generative Power: Understanding the Integral of Diffusion Models

Delve into the mathematical underpinnings of diffusion models and their integrals for advanced AI generation.

The Coders Blog

May 6, 2026

hiring talent acquisition freelance specialized skills recruitment

AI-Native Startups and the Rise of Fractional Engineers

Examining the demand for fractional engineers within AI-native startups, reflecting a new model for acquiring specialized tech talent.

The Coders Blog

May 6, 2026

AI LLM knowledge base documentation data management

Hallucinopedia: Taming AI-Generated Knowledge

Showcasing Hallucinopedia, a new tool designed to effectively manage and curate information from AI models.

The Coders Blog

May 6, 2026

AI Claude Anthropic large language models compute cloud

Anthropic Expands Claude Access with Higher Usage Limits

Anthropic significantly raises usage limits for its Claude AI model and secures a compute deal, paving the way for broader AI adoption.

The Coders Blog

May 6, 2026

AI agents sandbox development tools transactional verifiable Show HN

Tilde.run: A New Transactional Agent Sandbox

Show HN: Tilde.run introduces a novel agent sandbox featuring transactional and verifiable capabilities for AI development.

The Coders Blog

May 6, 2026

AI agentic coding vibe coding software engineering LLMs

Vibe Coding vs. Agentic Engineering: A Collision Course for Software Teams

The rapid rise of vibe coding is colliding with agentic engineering, creating a critical juncture that demands scrutiny of accountability, quality, and human oversight.

The Coders Blog

May 6, 2026

Gemma 4 LLM AI inference performance optimization machine learning multi-token prediction deep learning

Gemma 4: Faster AI Inference Through Advanced Multi-Token Prediction

Explore how Gemma 4 achieves faster inference with innovative multi-token prediction techniques, boosting LLM performance.

The Coders Blog

May 6, 2026

Meta AI content moderation ethical AI social media platform policy governance

Zuckerberg Authorized Meta's AI Content Moderation: A Deep Dive

Unpacking the executive authorization behind Meta's AI-driven content moderation strategies and their implications.

The Coders Blog

May 6, 2026

AI accent modification call center speech AI customer experience voice modulation ethics

Telus AI: Altering Call Agent Accents for Customer Experience

Telus employs AI to modify call agent accents, raising questions about authenticity and standardization in customer service.

The Coders Blog

May 6, 2026

Building Real-World On-Device AI with LiteRT and NPU

Google Colossus on PyTorch via GCSF: Speeding Up AI Training

Building with Gemini Embedding 2: Agentic Multimodal RAG

3X Speed Boost: Supercharging LLM Inference on Google TPUs

AI Revolutionizes Workflows: Amazon WorkSpaces Embraces the Future

A Theory of Deep Learning: Understanding the Fundamentals

Gemma 4 MTP Released: A New Era for AI Models

DeepSeek V4: Measuring the 17x Cheaper LLM Inference

Qwen 3.6 27B Quantization: A Deep Dive into Quality

2.5x Faster LLM Inference: Qwen 3.6 27B Achieves Breakthrough with MTP

Google Cloud's Fraud Defense: The Next Generation of reCAPTCHA

Unlocking Generative Power: Understanding the Integral of Diffusion Models

AI-Native Startups and the Rise of Fractional Engineers

Hallucinopedia: Taming AI-Generated Knowledge

Anthropic Expands Claude Access with Higher Usage Limits

Tilde.run: A New Transactional Agent Sandbox

Vibe Coding vs. Agentic Engineering: A Collision Course for Software Teams

Gemma 4: Faster AI Inference Through Advanced Multi-Token Prediction

Zuckerberg Authorized Meta's AI Content Moderation: A Deep Dive

Telus AI: Altering Call Agent Accents for Customer Experience

Converters

Formatters

Encoder / Decoder

Generators

Design & Utility

Join out mailing list