All Posts

68 posts

Time-Series Foundations: Transformers, Diffusion, and Why They're Different— Part 1

Before we build anything, we need a clear mental model of what these architectures actually do — not the marketing version, the mechanistic one.

Jun 2, 2026Read →

tradingtrading

Data Pipeline: NASDAQ-100, Options Chain, and Sentiment— Part 2

How to align three different data sources into one consistent daily dataset — and why every alignment decision is also a leakage decision.

Jun 2, 2026Read →

tradingtrading

Feature Engineering: From OHLCV to a Rich Feature Matrix— Part 3

What each feature actually measures, why it might matter for predicting NASDAQ-100 returns, and how to compute it without accidentally injecting future information.

Jun 2, 2026Read →

tradingtrading

The Standard Transformer: Architecture, Training, and Why It Collapsed— Part 4

We built it. We ran it. It predicted the same thing for every stock. Here's exactly what happened and what it reveals about applying transformers to financial data.

Jun 2, 2026Read →

tradingtrading

Temporal Fusion Transformer: Gating, Memory, and Sector Learnability— Part 5

The architecture that finally worked — and a precise explanation of why each component addresses a specific failure mode of the Standard Transformer on financial data.

Jun 2, 2026Read →

tradingtrading

Validation Discipline: Catching Leakage, Rank IC, and the Economic Viability Wall— Part 6

The step most AI trading projects skip — and the reason most of them fail when they reach real capital. A rigorous validation framework for quantitative ML.

Jun 2, 2026Read →

tradingtrading

LightGBM as Signal Engine: Feature Sets, Clean Candidates, and OOS Testing— Part 7

The model that finally survived adversarial validation — and why gradient boosting often beats deep learning on structured financial data.

Jun 2, 2026Read →

tradingtrading

System Architecture: Signal → Strategy → Execution— Part 8

A statistically real signal is not yet a tradeable system. This is the architecture that converts +0.08 Rank IC into a deployable paper-trading framework — and why the layers between prediction and profit are harder than the model itself.

Jun 2, 2026Read →

tradingtrading

Building a Quantitative Trading System — Series Index

A practitioner's journey: from raw market data through Temporal Fusion Transformers, LightGBM, signal validation, and live trading architecture. All concepts defined, all mistakes documented.

Jun 2, 2026Read →

ml-infraAWS

Building Production ML Infrastructure on AWS with Kiro: Part 1 — Network Foundation— Part 1

How we used Kiro's spec-driven workflow to go from a rough idea to fully deployed AWS network infrastructure across two environments — VPCs, NAT, CI/CD, and audit logging — in a single session.

May 20, 2026Read →

ml-infraAWS

Building Production ML Infrastructure on AWS with Kiro: Part 2 — Multi-Account Landing Zone— Part 2

Migrating from a single-account setup to a proper multi-account AWS landing zone with Organizations, Control Tower, Tailscale hybrid networking, centralized security, and cost guardrails — all built spec-first with Kiro.

May 22, 2026Read →

gpuGPU

What Actually Happens When Your Python Calls the GPU?— Part 1

A hands-on journey from a single line of PyTorch code to the silicon on an NVIDIA GB10 Spark — tracing a matrix multiply through the full execution stack and profiling it with Nsight Systems.

May 1, 2026Read →

gpuGPU

What Actually Happens When Your Python Calls the GPU? — Part 2— Part 2

Fixing the empty Nsight profile from Part 1 — running nsys inside the container, reading real CUDA API data, and finding the exact kernel name executing on the GB10 silicon.

May 1, 2026Read →

gpuTensorRT-LLM

3,000 Tokens Per Second on Blackwell: A TensorRT-LLM FP8 Optimization Journey

From FP16 baseline to 3,030 tok/s on the NVIDIA GB10 Blackwell — a step-by-step account of what actually moved the needle and what didn't.

Apr 25, 2026Read →

ai-infraSpeculative Decoding

Speculative Decoding Latency Optimization on TensorRT-LLM with Llama 3 Models

Tuning speculative decoding using Llama 3 8B as the draft model and Llama 3 70B as the target to reduce decode latency.

Apr 20, 2026Read →

gpuCUDA

CUDA Graph Capture for Low-Latency LLM Decode on RTX 4090

Using CUDA graph capture in TensorRT-LLM to eliminate kernel launch overhead during the decode phase of Llama 3 8B on RTX 4090 — benchmarks, trade-offs, and lessons learned.

Apr 20, 2026Read →

ai-infraTensorRT-LLM

Evaluating Inflight Batching and Scheduler Policies in TensorRT-LLM Triton Backend

Configured inflight batching in TensorRT-LLM Triton backend for Llama 3 8B model, tuning parameters to optimize GPU utilization and balance latency and throughput under bursty loads.

Apr 20, 2026Read →

gpuAWQ

AWQ vs. FP8 Quantization: Balancing Accuracy and Throughput for Llama 3 70B

Comparing AWQ INT4 and FP8 quantization methods on the Llama 3 70B model across accuracy benchmarks (MMLU, HumanEval, GSM8K, MT-Bench) and inference throughput measurements on an H100 GPU.

Apr 20, 2026Read →

ai-infralocal LLMs

Automated Content Publishing Pipeline with Local LLMs

How I built a pipeline that turns raw session notes into structured markdown blog posts using local LLMs — from fswatch trigger to GitHub push, fully automated.

Apr 15, 2026Read →

ai-infraQwen 2.5 Coder

Running Qwen 2.5 Coder Locally with OpenCode: A Private AI Coding Assistant

A complete setup guide for running Qwen 2.5 Coder locally via Ollama and connecting it to OpenCode, creating a private, offline-capable AI coding assistant in your terminal.

Apr 10, 2026Read →

ai-infraai

anandus.ai

Technical writeup of the anandus.ai platform: RAG-based portfolio intelligence, Azure OpenAI integration, and multi-repo indexing.

Mar 15, 2026Read →

technical-referencebenchmark

Mac Studio Local LLM Benchmark Report

Mac Studio M4 Max overnight benchmark report: token throughput, latency, and memory bandwidth across 12+ LLM models at various quantization levels.

Mar 8, 2026Read →

ai-infrarag

anandus.ai — Technical Deep Dive: AI Profile Portfolio with RAG & MCP

A collection of blog posts covering AI infrastructure, RAG pipelines, trading signal governance, and production ML systems.

Mar 5, 2026Read →

technical-referenceagentic

Local Agentic Control Plane Runbook

Operational runbook for the local agentic control plane: starting services, monitoring agent health, debugging tool calls, and log inspection.

Feb 22, 2026Read →

tradingtrading

Top 10 Fundamental Growth Stocks for 2026 - MikaMirAI Trading Research

Top 10 fundamental growth stocks for 2026: screening methodology, financial ratios, and sector analysis using quantitative filters.

Feb 20, 2026Read →

technical-referencesemis

Semiconductors TFT Ranking Engine Strategy

SEMIS ranking engine strategy: semiconductor sector rotation using TFT confidence scores, momentum filters, and risk-adjusted position weights.

Feb 15, 2026Read →

technical-referencetft

SEMICONDUCTORS TFT RANKING ENGINE — EXECUTION PLAYBOOK

TFT execution playbook: trade entry rules, position sizing, stop-loss logic, and SEMIS sector rotation signals from the NDX100 TFT model.

Feb 8, 2026Read →

ai-infraagentic

Building a Local Agentic Control Plane with Mac Mini, Mac Studio, Ollama, Qwen, Matrix, and iPhone Approval

How to build a local agentic control plane: architecture, MCP server integration, and self-hosted AI orchestration patterns.

Feb 5, 2026Read →

technical-referencetrading

Day 7 Onward — Market Regime Intelligence Roadmap

Day 7 regime intelligence roadmap: planned enhancements to market regime detection, cross-asset signals, and adaptive position sizing.

Feb 1, 2026Read →

technical-referencesentiment

Financial Sentiment Models, Phases, GDELT, and Industry Usage

Financial sentiment model phases and industry notes: FinBERT fine-tuning, sector-specific adjustments, and real-time scoring architecture.

Jan 25, 2026Read →

technical-referenceldtm

LDTM v2 — GPU Performance Analysis & Optimization Guide

LDTM v2 performance analysis: walk-forward returns, Sharpe ratios, maximum drawdown, and regime-conditional signal quality metrics.

Jan 18, 2026Read →

tradingtrading

I Built an AI That Watches 103 Stocks Simultaneously — Here's What It Found

Overview of the LDTM v2 trading system: LSTM-based temporal models, walk-forward validation, and live deployment architecture.

Jan 15, 2026Read →

technical-referenceldtm

LDTM v2 — Operations Runbook

Operational runbook for LDTM v2: daily data refresh, model retraining schedule, signal generation pipeline, and Azure deployment steps.

Jan 10, 2026Read →

technical-referencetft

Cross-Asset TFT Model: Mathematical Walkthrough and Data Flow

Math walkthrough of the cross-asset Temporal Fusion Transformer: variable selection networks, gated residuals, and multi-horizon attention.

Jan 5, 2026Read →

technical-referenceldtm

Codex LD-TM LSTM Implementation, Architecture, Model Design, and Runbook

Step-by-step implementation runbook for the LDTM LSTM model: data prep, training configuration, Azure ML integration, and deployment checklist.

Dec 22, 2025Read →

technical-referenceensemble

Codex Modular Signal Architecture Requirements

Requirements for the baseline ensemble model: LightGBM, gradient boosting, feature importance, stacking architecture, and benchmark targets.

Dec 15, 2025Read →

tradingtrading

From Daily Bars to Market Regime Intelligence on a DGX

Building a market regime intelligence layer: detecting bull, bear, and sideways markets using unsupervised classification on price features.

Dec 12, 2025Read →

technical-referencepostgresql

Trading Database — Access Runbook

Runbook for accessing the DGX trading PostgreSQL database: connection methods, read-only credentials, query examples, and maintenance procedures.

Dec 8, 2025Read →

technical-referenceazure

ElasticNet Dev-Test-Prod Promotion Design

Dev-test-prod promotion pipeline design for Azure ML workloads: environment isolation, artifact promotion, gate criteria, and rollback strategy.

Dec 5, 2025Read →

technical-referencesecurity

Attack Vector Catalog — anandus.ai

Attack vectors for AI-powered trading and agentic systems: prompt injection patterns, model extraction risks, and defense strategies.

Dec 1, 2025Read →

technical-referenceaws

Requirements Document

Requirements for the AWS AI governance control plane: model registry, audit logging, policy enforcement, and responsible AI controls.

Nov 28, 2025Read →

technical-referencegpu

Design Document: NVIDIA AWS GPU Certification Study System

Design specification for the NVIDIA AWS GPU certification environment: P3 instance setup, deep learning AMI, benchmarking, and cost controls.

Nov 22, 2025Read →

technical-referencesecurity

Agentic AI Security Requirements — anandus.ai

Security requirements for agentic AI systems: capability boundaries, tool use restrictions, memory isolation, and adversarial input handling.

Nov 20, 2025Read →

technical-referencegpu

Requirements Document

Requirements for the NVIDIA AWS GPU certification project: compute targets, software stack, test harness, and acceptance criteria.

Nov 15, 2025Read →

technical-referenceaws

Requirements Document

Requirements specification for the MM AWS multi-account landing zone: OU structure, SCPs, Control Tower guardrails, and account vending.

Nov 8, 2025Read →

technical-referencedeployment

mikamirAI Deployment Runbook

Deployment runbook for the MikaMirAI SPA: Azure Static Web Apps, GitHub Actions CI/CD pipeline, custom domain, and rollback procedures.

Nov 5, 2025Read →

technical-referenceaws

Design Document: MM AWS Network Infrastructure

AWS network infrastructure design: dual-VPC layout, subnet allocation, NAT strategy, Tailscale mesh routing, and peering topology.

Nov 1, 2025Read →

technical-referenceaws

Requirements Document

Formal requirements for the MM AWS network infrastructure: VPC CIDRs, subnet tiers, NAT instance vs gateway decision, and Tailscale connectivity.

Oct 22, 2025Read →

technical-referenceaws

Building ML Infrastructure on AWS with Kiro: Part 2 — Multi-Account Landing Zone

Deploying the MM AWS multi-account landing zone: Control Tower setup, OU hierarchy, SCP guardrails, and account factory automation.

Oct 18, 2025Read →

technical-referenceldtm

LDTM v2 — Complete Architecture & Implementation Reference

LDTM v2 architecture and implementation: bidirectional LSTM with attention for multi-asset return prediction and risk-adjusted signal generation.

Oct 15, 2025Read →

tradingtrading

AI Trading Strategies for 2025: A Comprehensive Guide - MikaMirAI Trading Research

AI-driven trading strategies for 2024: applying machine learning to equity signal generation, regime detection, and risk management.

Oct 10, 2025Read →

technical-referencendx100

NDX-100 v2: Regime-Aware Cross-Sectional Ranking System

NDX100 v2 architecture: Temporal Fusion Transformer ensemble for NASDAQ-100 signal generation, feature engineering pipeline, and serving design.

Oct 8, 2025Read →

technical-referenceaws

Building ML Infrastructure on AWS with Kiro: Part 1 — Network Foundation

Building the MM AWS network infrastructure: VPC design, subnet layout, NAT instance vs gateway trade-offs, and Tailscale mesh connectivity.

Oct 5, 2025Read →

technical-referencetransformer

AutoTransformer — Kiro-Style Design Document

KIRO design for the AutoTransformer project: automated architecture search over temporal Transformer variants for financial time series.

Oct 1, 2025Read →

technical-referenceportfolio

Design Document: AI-Powered Profile Portfolio

Design specification for the AI-powered portfolio site: multi-repo RAG indexing, semantic search, visitor analytics, and Azure Functions backend.

Sep 22, 2025Read →

technical-referencerag

Design Document: Multi-Repo GitHub Documentation Indexing

Multi-repo indexing design: chunking strategy, embedding pipeline, cross-repo retrieval scoring, and incremental update protocol.

Sep 15, 2025Read →

technical-referencetrading

Portfolio Stories — DGX Trading System

Portfolio stories from the DGX trading system: building production quant infrastructure, lessons learned, and system evolution.

Sep 10, 2025Read →

ai-inframikamirai

Welcome to MikaMirAI - The Future of Trading Research - MikaMirAI Trading Research

Welcome to MikaMirAI — the intersection of quantitative trading, AI infrastructure, GPU computing, and production ML systems.

Sep 5, 2025Read →

technical-referenceldtm

Project: LDTM — Long-Term Deep Temporal Model

Project spotlight: LDTM — Long-duration Temporal Model for multi-asset return prediction and systematic trading signals.

Sep 1, 2025Read →

gpugpu

What Actually Happens When Your Python Calls the GPU?

GPU zero-to-one: tracing a single PyTorch matrix multiply from Python through CUDA all the way to NVIDIA silicon, profiled with Nsight Systems.

Aug 25, 2025Read →

technical-referenceautotransformer

Project: AutoTransformer — Bounded Self-Improving Architecture Search

Project spotlight: AutoTransformer — automated Transformer architecture search for financial time series forecasting.

Aug 20, 2025Read →

technical-referencesentiment

Project: Sentiment Engine — Financial NLP Research Program

Project spotlight: the DGX Sentiment Engine — FinBERT-based news sentiment pipeline for real-time trading signal generation.

Aug 12, 2025Read →

technical-referenceprofile

All Posts

Time-Series Foundations: Transformers, Diffusion, and Why They're Different— Part 1

Data Pipeline: NASDAQ-100, Options Chain, and Sentiment— Part 2

Feature Engineering: From OHLCV to a Rich Feature Matrix— Part 3

The Standard Transformer: Architecture, Training, and Why It Collapsed— Part 4

Temporal Fusion Transformer: Gating, Memory, and Sector Learnability— Part 5

Validation Discipline: Catching Leakage, Rank IC, and the Economic Viability Wall— Part 6

LightGBM as Signal Engine: Feature Sets, Clean Candidates, and OOS Testing— Part 7

System Architecture: Signal → Strategy → Execution— Part 8

Building a Quantitative Trading System — Series Index

Building Production ML Infrastructure on AWS with Kiro: Part 1 — Network Foundation— Part 1

Building Production ML Infrastructure on AWS with Kiro: Part 2 — Multi-Account Landing Zone— Part 2

What Actually Happens When Your Python Calls the GPU?— Part 1

What Actually Happens When Your Python Calls the GPU? — Part 2— Part 2

3,000 Tokens Per Second on Blackwell: A TensorRT-LLM FP8 Optimization Journey

Speculative Decoding Latency Optimization on TensorRT-LLM with Llama 3 Models

CUDA Graph Capture for Low-Latency LLM Decode on RTX 4090

Evaluating Inflight Batching and Scheduler Policies in TensorRT-LLM Triton Backend

AWQ vs. FP8 Quantization: Balancing Accuracy and Throughput for Llama 3 70B

Automated Content Publishing Pipeline with Local LLMs

Running Qwen 2.5 Coder Locally with OpenCode: A Private AI Coding Assistant

anandus.ai

Mac Studio Local LLM Benchmark Report

anandus.ai — Technical Deep Dive: AI Profile Portfolio with RAG & MCP

Local Agentic Control Plane Runbook

Top 10 Fundamental Growth Stocks for 2026 - MikaMirAI Trading Research

Semiconductors TFT Ranking Engine Strategy

SEMICONDUCTORS TFT RANKING ENGINE — EXECUTION PLAYBOOK

Building a Local Agentic Control Plane with Mac Mini, Mac Studio, Ollama, Qwen, Matrix, and iPhone Approval

Day 7 Onward — Market Regime Intelligence Roadmap

Financial Sentiment Models, Phases, GDELT, and Industry Usage

LDTM v2 — GPU Performance Analysis & Optimization Guide

I Built an AI That Watches 103 Stocks Simultaneously — Here's What It Found

LDTM v2 — Operations Runbook

Cross-Asset TFT Model: Mathematical Walkthrough and Data Flow

Codex LD-TM LSTM Implementation, Architecture, Model Design, and Runbook

Codex Modular Signal Architecture Requirements

From Daily Bars to Market Regime Intelligence on a DGX

Trading Database — Access Runbook

ElasticNet Dev-Test-Prod Promotion Design

Attack Vector Catalog — anandus.ai

Requirements Document

Design Document: NVIDIA AWS GPU Certification Study System

Agentic AI Security Requirements — anandus.ai

Requirements Document

Requirements Document

mikamirAI Deployment Runbook

Design Document: MM AWS Network Infrastructure

Requirements Document

Building ML Infrastructure on AWS with Kiro: Part 2 — Multi-Account Landing Zone

LDTM v2 — Complete Architecture & Implementation Reference

AI Trading Strategies for 2025: A Comprehensive Guide - MikaMirAI Trading Research

NDX-100 v2: Regime-Aware Cross-Sectional Ranking System

Building ML Infrastructure on AWS with Kiro: Part 1 — Network Foundation

AutoTransformer — Kiro-Style Design Document

Design Document: AI-Powered Profile Portfolio

Design Document: Multi-Repo GitHub Documentation Indexing

Portfolio Stories — DGX Trading System

Welcome to MikaMirAI - The Future of Trading Research - MikaMirAI Trading Research

Project: LDTM — Long-Term Deep Temporal Model

What Actually Happens When Your Python Calls the GPU?

Project: AutoTransformer — Bounded Self-Improving Architecture Search

Project: Sentiment Engine — Financial NLP Research Program

Anand Ranganathan — AI Architect Lead

aisecuritypages/secure-ai-governance-lab

aisecuritypages/local-harness

aisecuritypages/aiseclab-aissistant-technical-writeup

secure-ai-governance-lab-complete-writeup

newbie-guide