Machine Learning | The Coders Blog | Home

AI PyTorch Google Colossus GCSF machine learning training performance distributed computing

Google Colossus on PyTorch via GCSF: Speeding Up AI Training

Discover how Google Colossus, integrated with PyTorch via GCSF, significantly accelerates AI model training.

The Coders Blog

May 6, 2026

Gemini embeddings multimodal AI RAG AI agents LLM computer vision retrieval augmented generation

Building with Gemini Embedding 2: Agentic Multimodal RAG

Harness Gemini Embedding 2 to create sophisticated agentic multimodal RAG systems for advanced AI applications.

The Coders Blog

May 6, 2026

LLM inference TPU Google AI acceleration performance machine learning large language models

3X Speed Boost: Supercharging LLM Inference on Google TPUs

Achieve a threefold increase in LLM inference speed by leveraging Google TPUs for optimized machine learning performance.

The Coders Blog

May 6, 2026

deep learning AI theory neural networks machine learning AI research

A Theory of Deep Learning: Understanding the Fundamentals

Exploring a new theory that aims to provide a deeper understanding of the core principles behind deep learning.

The Coders Blog

May 6, 2026

Gemma 4 MTP LLM AI model release new technology deep learning

Gemma 4 MTP Released: A New Era for AI Models

The release of Gemma 4 MTP signifies a potential advancement in AI model capabilities and architecture.

The Coders Blog

May 6, 2026

Qwen LLM quantization BF16 AI performance large language models

Qwen 3.6 27B Quantization: A Deep Dive into Quality

A detailed quality comparison of Qwen 3.6 27B quantizations, including BF16, explores performance trade-offs in large language models.

The Coders Blog

May 6, 2026

LLM inference Qwen MTP AI optimization

2.5x Faster LLM Inference: Qwen 3.6 27B Achieves Breakthrough with MTP

Achieve a significant speed-up in Large Language Model inference using Qwen 3.6 27B with the MTP optimization technique.

The Coders Blog

May 6, 2026

diffusion models generative AI machine learning deep learning mathematics

Unlocking Generative Power: Understanding the Integral of Diffusion Models

Delve into the mathematical underpinnings of diffusion models and their integrals for advanced AI generation.

The Coders Blog

May 6, 2026

Gemma 4 LLM AI inference performance optimization machine learning multi-token prediction deep learning

Gemma 4: Faster AI Inference Through Advanced Multi-Token Prediction

Explore how Gemma 4 achieves faster inference with innovative multi-token prediction techniques, boosting LLM performance.

The Coders Blog

May 6, 2026

LLM AI machine learning deep learning model training NLP artificial intelligence

From Zero to LLM: The Technical Journey of Training Models from Scratch

A comprehensive guide to the data, compute, and architectural considerations involved in building your own Large Language Model.

The Coders Blog

May 5, 2026

LLM Quantization AI inference Deep learning Model compression Performance Optimization Intel AutoRound

Beyond Brute Force: Advanced LLM Quantization for Production AI [2026]

Don't let massive LLMs cripple your compute budget. Explore Intel's AutoRound, a cutting-edge quantization algorithm crucial for efficient, performant AI. Optimize your models today!

The Coders Blog

May 1, 2026

Grok LLM AI Models x.ai API Development Generative AI Model Performance AI Trends

Grok 4.3: Is x.ai's Latest LLM a Real Leap or Just More Hype? [2026]

Grok 4.3 is here. We dive deep into x.ai's new model, dissecting its technical advancements, API changes, and what developers should know. Read our sharp take now!

The Coders Blog

May 1, 2026

PyTorch Lightning malware dependency management supply chain attacks vulnerability Python MLOps

Critical Alert: Shai-Hulud Malware Discovered in PyTorch Lightning Dependencies

A new report details the Shai-Hulud malware found in PyTorch Lightning, exposing the urgent need for robust supply chain security in ML development. Learn more.

The Coders Blog

May 1, 2026

LLMs AI Agents Mistral AI Distributed Systems Developer Tools AI Infrastructure Agentic AI API Development

Mistral Medium 3.5: The Agentic Future of LLMs Is Remote, Not Just Local (2026)

Mistral's latest LLM, Medium 3.5, emphasizes remote agents. What does this mean for building scalable, intelligent AI applications? Read our deep dive.

The Coders Blog

Apr 29, 2026

LLMs vector space neural networks reasoning NLP AI limitations transformer architecture semantic representation

Beyond Language: Why LLM Reasoning Needs to Embrace Vector Space Now

Natural language limits current LLMs. This piece argues for a shift to vector space reasoning to unlock true intelligence and overcome scaling hurdles. Learn more.

The Coders Blog

Apr 29, 2026

LLM Training Historical Data Code Generation AI Research Language Models Cognitive Computing Model Architecture Dataset Bias

The Unfrozen Caveman Coder: What a Pre-1931 LLM Reveals About AI's Core Logic

A 13B LLM trained exclusively on pre-1931 text can still learn to code. This time-frozen AI challenges assumptions on data bias and reasoning. Discover the implications for future LLM development. Read more.

The Coders Blog

Apr 29, 2026

Microsoft VibeVoice Open-Source Voice AI Expressive Speech Synthesis Long-Form Text-to-Speech Multi-Speaker Audio Generation VibeVoice ASR AI Speech Recognition Zero-Shot Voice Cloning VibeVoice Architecture Real-time TTS Conversational AI Development

Microsoft VibeVoice: Open-Source Frontier Models for Next-Gen Expressive Long-Form Voice AI

Introduction: The Evolving Landscape of Voice AI The demand for natural, expressive, and scalable voice interactions …

Distinguished Engineer

Apr 28, 2026

Google Colossus on PyTorch via GCSF: Speeding Up AI Training

Building with Gemini Embedding 2: Agentic Multimodal RAG

3X Speed Boost: Supercharging LLM Inference on Google TPUs

A Theory of Deep Learning: Understanding the Fundamentals

Gemma 4 MTP Released: A New Era for AI Models

Qwen 3.6 27B Quantization: A Deep Dive into Quality

2.5x Faster LLM Inference: Qwen 3.6 27B Achieves Breakthrough with MTP

Unlocking Generative Power: Understanding the Integral of Diffusion Models

Gemma 4: Faster AI Inference Through Advanced Multi-Token Prediction

From Zero to LLM: The Technical Journey of Training Models from Scratch

Beyond Brute Force: Advanced LLM Quantization for Production AI [2026]

Grok 4.3: Is x.ai's Latest LLM a Real Leap or Just More Hype? [2026]

Critical Alert: Shai-Hulud Malware Discovered in PyTorch Lightning Dependencies

Mistral Medium 3.5: The Agentic Future of LLMs Is Remote, Not Just Local (2026)

Beyond Language: Why LLM Reasoning Needs to Embrace Vector Space Now

The Unfrozen Caveman Coder: What a Pre-1931 LLM Reveals About AI's Core Logic

Microsoft VibeVoice: Open-Source Frontier Models for Next-Gen Expressive Long-Form Voice AI

Converters

Formatters

Encoder / Decoder

Generators

Design & Utility

Join out mailing list