skip to content
luminary.blog
by Oz Akan

Technical

RSS feed

Posts in 2025

  • $Understanding ML Numerical Formats

    Understanding INT4, INT8, FP16, BF16, and TF32 formats in machine learning - their precision, speed, and memory trade-offs for training and inference.

    number one sketched
  • $What do GPT-OSS and Gemma 3 really offer?

    GPT-OSS and Gemma 3: two new small-but-powerful language models pushing the boundaries.

    baby robot
  • $What are Positional Embeddings?

    The mathematical technique that teaches AI models where each word sits in a sequence.

    suprised robot
  • $Words, Tokens and Embeddings

    How language models convert token IDs into meaningful vector representations that capture semantic relationships.

    happy robot
  • $Subword Tokenization Algorithms

    Understanding the algorithms behind tokenization in Large Language Models.

    cute robot sketch
  • $What is LLM Inference?

    Understanding how Large Language Models generate text through the inference process.

    cute robot sketch
  • $CUDA Programming: An Introduction

    Getting started with CUDA programming: Hello Threads

    car sketch
  • $A Tale of Software Rot

    A fictional story about systematically eliminating duplication to prevent software rot in a growing engineering organization.

    leaves
  • $SOLID Principles with Examples

    It is better with examples.

    ui images
  • $OAuth 2.0 Flows, Security and Best Practices

    Comprehensive OAuth 2.0 reference covering authorization flows, PKCE, token security, SPA patterns, and implementation best practices with detailed diagrams.

    deer