A small Recursive Language Model: let any LLM run code on its context instead of stuffing it into the prompt.
-
Updated
May 21, 2026 - Python
A small Recursive Language Model: let any LLM run code on its context instead of stuffing it into the prompt.
Stateless LLM runtime that dynamically routes, loads, executes, and unloads models per request with bounded VRAM caching and intelligent model selection.
🎛️ All-in-one control center for Windows system tweaks and optimizations.
40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment
Amazing Latency Performance Audit
Comprehensive guide for tuning Linux network stack buffers (socket, TCP, qdisc, NIC rings) on RHEL/OEL 8. Includes detailed documentation, RTT-based buffer calculations, tuning profiles for low-latency and high-throughput scenarios, and production-ready shell scripts for validation and monitoring.
**Arc-Rides MAX FPS Booster** is an advanced Windows performance optimization tool designed to increase FPS, reduce lag, and boost gaming performance on Windows 10 and 11. This FPS booster optimizes system settings, disables unnecessary background processes, lowers latency, and maximizes PC speed for smooth, stable, high-performance gameplay.
Ultimate Windows optimization toolkit for gamers and power users. PowerShell scripts, registry tweaks, driver configs, debloat tools, telemetry blockers, and performance boosters. Reduce input lag, boost FPS, clean bloatware, disable tracking, and fine-tune Windows 10/11 for maximum gaming performance. One-click automation with backup support.
This repo focuses on latency-aware resource optimization for Kubernetes
SIMD-Accelerated RLNC Sidecar for Deterministic MEV & HFT Networking
A Delay-Aware Damping Framework for Latency-Constrained Feedback Systems.
Experimental weights optimization for high-speed inference on distributed clusters.
Latency-optimized IoT automation system using dual-core ESP32 and ESP8266 with OTA updates
AmazeGaming is a lightweight C# utility designed for real-time automatic optimization of gaming processes. It dynamically allocates system resources, minimizes input lag, and prioritizes the active gaming window to ensure peak performance.
AmazeAim is a lightweight system utility designed for competitive gamers to minimize mouse input lag and stabilize polling rate consistency on Windows.
Optimize Windows 10/11 with PowerShell tweaks, debloat tools, privacy fixes, and performance tuning for gamers and power users
UCB bandit-based model serving optimizer with automatic latency/accuracy/cost tradeoff. 86.7% P99 latency reduction with zero accuracy degradation in production simulation.
Memory-Block Protocol (MBP) for reducing latency in long-context GPT conversations
Real-time computer vision system using OpenCV/DLib for facial landmark tracking. Optimized for edge deployment with <30ms latency and 95%+ accuracy.
AI Cost and Latency Optimization Platform with Real-time Token Profiling and Smart Model Routing.
Add a description, image, and links to the latency-optimization topic page so that developers can more easily learn about it.
To associate your repository with the latency-optimization topic, visit your repo's landing page and select "manage topics."