<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Notes on knowledged.to</title><link>https://knowledged.to/tags/notes/</link><description>Recent content in Notes on knowledged.to</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sat, 16 May 2026 21:38:50 +0530</lastBuildDate><atom:link href="https://knowledged.to/tags/notes/index.xml" rel="self" type="application/rss+xml"/><item><title>AI Agents in Go</title><link>https://knowledged.to/notes/ml/ai-agents-in-go/</link><pubDate>Sat, 16 May 2026 15:58:30 +0000</pubDate><guid>https://knowledged.to/notes/ml/ai-agents-in-go/</guid><description>Guide to building AI agents in Go using agent SDKs, with a minimal runnable example covering LLM integration, tools, and multi-agent workflows.</description></item><item><title>Open-weight Models</title><link>https://knowledged.to/notes/ml/open-weight-models/</link><pubDate>Sun, 26 Apr 2026 15:36:37 +0000</pubDate><guid>https://knowledged.to/notes/ml/open-weight-models/</guid><description>Explanation of open-weight models, their differences from closed and open-source models, and why they matter for local AI deployment and customization.</description></item><item><title>Cross-Entropy in AI</title><link>https://knowledged.to/notes/ml/cross-entropy-in-ai/</link><pubDate>Sat, 25 Apr 2026 16:47:31 +0000</pubDate><guid>https://knowledged.to/notes/ml/cross-entropy-in-ai/</guid><description>Explanation of cross-entropy as a loss function in AI, including intuition, formal definition, examples, and relationship to entropy and KL divergence</description></item><item><title>AI Prompts: System Prompt and Other Types</title><link>https://knowledged.to/notes/ml/ai-prompts/</link><pubDate>Thu, 16 Apr 2026 22:42:26 +0530</pubDate><guid>https://knowledged.to/notes/ml/ai-prompts/</guid><description>Overview of the different types of AI prompts including system, user, few-shot, zero-shot, chain-of-thought, meta, and retrieval-augmented prompts</description></item><item><title>Elastic Looped Transformers (ELT)</title><link>https://knowledged.to/notes/ml/elastic-looped-transformers/</link><pubDate>Thu, 16 Apr 2026 22:36:40 +0530</pubDate><guid>https://knowledged.to/notes/ml/elastic-looped-transformers/</guid><description>Overview of Elastic Looped Transformers, an adaptive compute architecture that loops a shallow transformer block multiple times to dynamically allocate compute based on input complexity</description></item><item><title>Tempo Framework</title><link>https://knowledged.to/notes/ml/tempo-framework/</link><pubDate>Thu, 16 Apr 2026 22:15:24 +0530</pubDate><guid>https://knowledged.to/notes/ml/tempo-framework/</guid><description>Overview of Tempo, a query-aware temporal compression framework for long-video understanding in multimodal AI, using a small VLM to filter relevant frames before passing a condensed representation to a large model</description></item><item><title>Memory-Augmented Architectures</title><link>https://knowledged.to/notes/ml/memory-augmented-architectures/</link><pubDate>Thu, 16 Apr 2026 22:04:57 +0530</pubDate><guid>https://knowledged.to/notes/ml/memory-augmented-architectures/</guid><description>Overview of memory-augmented neural network architectures that add dynamic external memory to models, covering NTMs, RAG, Memorizing Transformers, Titans, and practical implications for building persistent AI agents</description></item><item><title>Forward Pass and Single Pass in LLMs</title><link>https://knowledged.to/notes/ml/forward-pass-and-single-pass/</link><pubDate>Thu, 16 Apr 2026 21:19:49 +0530</pubDate><guid>https://knowledged.to/notes/ml/forward-pass-and-single-pass/</guid><description>Explanation of forward pass and single pass in LLMs, how transformer computation flows from embedding to output logits, and how speculative decoding exploits transformer parallelism to reduce large-model forward passes</description></item><item><title>Speculative Decoding</title><link>https://knowledged.to/notes/ml/speculative-decoding/</link><pubDate>Thu, 16 Apr 2026 20:54:05 +0530</pubDate><guid>https://knowledged.to/notes/ml/speculative-decoding/</guid><description>Explanation of speculative decoding, an inference optimization that uses a fast draft model to propose tokens verified in parallel by a large model, achieving 2–3x throughput gains with identical output quality</description></item><item><title>What Are Model Weights in an LLM?</title><link>https://knowledged.to/notes/ml/llm-model-weights/</link><pubDate>Mon, 13 Apr 2026 19:17:58 +0530</pubDate><guid>https://knowledged.to/notes/ml/llm-model-weights/</guid><description>Explanation of what model weights are in LLMs, how they encode learned behaviour, why parameter count matters, and how systems like Ollama load them into memory</description></item><item><title>GGUF Models</title><link>https://knowledged.to/notes/ml/gguf-models/</link><pubDate>Fri, 10 Apr 2026 09:11:52 +0530</pubDate><guid>https://knowledged.to/notes/ml/gguf-models/</guid><description>Overview of the GGUF binary format for storing and distributing LLMs locally, including quantization levels, key characteristics, and popular runtimes like llama.cpp and Ollama</description></item><item><title>Prompt Bias in AI</title><link>https://knowledged.to/notes/ml/prompt-bias-in-ai/</link><pubDate>Thu, 09 Apr 2026 20:58:42 +0530</pubDate><guid>https://knowledged.to/notes/ml/prompt-bias-in-ai/</guid><description>Explanation of prompt bias, how prompt wording and framing skew AI outputs, common forms including leading questions and assumption bias, and practical advice for writing neutral prompts</description></item><item><title>Primacy Bias in LLM Style Selection</title><link>https://knowledged.to/notes/ml/primacy-bias-in-llm-style-selection/</link><pubDate>Wed, 08 Apr 2026 21:43:53 +0530</pubDate><guid>https://knowledged.to/notes/ml/primacy-bias-in-llm-style-selection/</guid><description>Explanation of primacy bias in LLM selector prompts, how alphabetical candidate ordering caused over-selection of certain styles in BHQ, and fixes using deterministic non-lexicographic shuffling</description></item><item><title>Slack MCP Ideas</title><link>https://knowledged.to/notes/devops/slack-mcp-ideas/</link><pubDate>Wed, 08 Apr 2026 15:12:20 +0530</pubDate><guid>https://knowledged.to/notes/devops/slack-mcp-ideas/</guid><description>Ideas for using Slack MCP to monitor automation opportunities and identify duplicated efforts within an organisation</description></item><item><title>ELO Scoring for AI Models</title><link>https://knowledged.to/notes/ml/elo-scoring-for-ai-models/</link><pubDate>Tue, 07 Apr 2026 22:15:56 +0530</pubDate><guid>https://knowledged.to/notes/ml/elo-scoring-for-ai-models/</guid><description>Explanation of how ELO scoring is applied to rank AI models via human preference votes, including the math, strengths, weaknesses, and real-world use in Chatbot Arena</description></item><item><title>Knowledge Distillation</title><link>https://knowledged.to/notes/ml/knowledge-distillation/</link><pubDate>Mon, 06 Apr 2026 22:43:20 +0530</pubDate><guid>https://knowledged.to/notes/ml/knowledge-distillation/</guid><description>Overview of knowledge distillation, how student models learn from teacher model outputs, and applications including edge deployment, speculative decoding, and LLM training</description></item><item><title>Training-Free GRPO</title><link>https://knowledged.to/notes/ml/training-free-grpo/</link><pubDate>Mon, 06 Apr 2026 22:33:04 +0530</pubDate><guid>https://knowledged.to/notes/ml/training-free-grpo/</guid><description>Overview of Training-Free GRPO, a method that improves LLM agent performance by updating model context (experience library) instead of parameters, achieving RL-like gains at a fraction of the cost</description></item><item><title>Attention Mechanism</title><link>https://knowledged.to/notes/ml/attention-mechanism/</link><pubDate>Mon, 06 Apr 2026 22:18:46 +0530</pubDate><guid>https://knowledged.to/notes/ml/attention-mechanism/</guid><description>Explanation of the attention mechanism in AI, including self-attention, cross-attention, multi-head attention, and the mathematical formulation behind Transformers</description></item><item><title>Transformer Architecture</title><link>https://knowledged.to/notes/ml/transformer-architecture/</link><pubDate>Mon, 06 Apr 2026 22:15:42 +0530</pubDate><guid>https://knowledged.to/notes/ml/transformer-architecture/</guid><description>Overview of the Transformer architecture including self-attention, multi-head attention, positional encoding, encoder-decoder structure, and key variants like BERT, GPT, and T5</description></item><item><title>Recurrent Neural Networks (RNNs)</title><link>https://knowledged.to/notes/ml/recurrent-neural-networks/</link><pubDate>Mon, 06 Apr 2026 22:12:16 +0530</pubDate><guid>https://knowledged.to/notes/ml/recurrent-neural-networks/</guid><description>Overview of RNNs, their memory mechanism, common variants (LSTM, GRU), use cases, and how they compare to Transformers</description></item><item><title>RLHF and DPO: Aligning AI to Human Preferences</title><link>https://knowledged.to/notes/ml/rlhf-and-dpo/</link><pubDate>Mon, 06 Apr 2026 22:03:50 +0530</pubDate><guid>https://knowledged.to/notes/ml/rlhf-and-dpo/</guid><description>Comparison of RLHF and DPO alignment techniques, covering their pipelines, strengths, weaknesses, and where each is used in practice</description></item><item><title>Instruction Tuning</title><link>https://knowledged.to/notes/ml/instruction-tuning/</link><pubDate>Mon, 06 Apr 2026 21:54:49 +0530</pubDate><guid>https://knowledged.to/notes/ml/instruction-tuning/</guid><description>Overview of instruction tuning, how it works, dataset construction, and variants like RLHF, RLAIF, and DPO</description></item><item><title>Perplexity in Language Models</title><link>https://knowledged.to/notes/ml/perplexity-in-language-models/</link><pubDate>Mon, 06 Apr 2026 21:46:03 +0530</pubDate><guid>https://knowledged.to/notes/ml/perplexity-in-language-models/</guid><description>Explanation of perplexity as a language model evaluation metric, including the formula, intuition, caveats, and relationship to cross-entropy loss</description></item><item><title>Model Quantization</title><link>https://knowledged.to/notes/ml/model-quantization/</link><pubDate>Mon, 06 Apr 2026 21:39:37 +0530</pubDate><guid>https://knowledged.to/notes/ml/model-quantization/</guid><description>Overview of model quantization techniques, precision levels, and trade-offs for reducing neural network memory and improving inference speed</description></item><item><title>GCloud Quick Reference</title><link>https://knowledged.to/notes/devops/gcloud-quick-reference/</link><pubDate>Mon, 06 Apr 2026 19:06:46 +0530</pubDate><guid>https://knowledged.to/notes/devops/gcloud-quick-reference/</guid><description>Quick reference for common gcloud commands including authentication, project setup, and GKE cluster configuration</description></item><item><title>Kubernetes Port Forward</title><link>https://knowledged.to/notes/devops/kubernetes-port-forward/</link><pubDate>Mon, 06 Apr 2026 18:59:45 +0530</pubDate><guid>https://knowledged.to/notes/devops/kubernetes-port-forward/</guid><description>Quick reference for kubectl port-forward command syntax</description></item></channel></rss>