- research
- journey
- bug-fixes
- reflection
•
•
•
-
Claude Code with local vLLM: client validation, model aliases, and a working settings.json
Run Claude Code against local vLLM without Anthropic API access: why common env-only recipes fail, the alias + settings.json pattern that works, and when this matters if you cannot register or use the Claude API.
-
Stable tool calling for Qwen 3.5 27B/35B on vLLM: template, parser, and mixed-GPU fixes
Debugging notes on Jinja chat templates, qwen3_xml vs qwen3_coder parsers, mixed-GPU FP8 drift, and SFT-distilled checkpoints when running Qwen 3.5 27B/35B-class models for long agentic sessions on vLLM.
-
Workaround for Enabling NCCL P2P Communication for NVIDIA RTX 4090 Workstations
What NCCL P2P means, why it matters on multi-GPU workstations, how Resizable BAR fits in, and a concrete setup path for RTX 4090.
-
IELTS - After Class Note, Week 8
Final-week listening, writing, speaking, and reading reminders for IELTS.
-
IELTS - After Class Note, Week 7
Listening, writing task structure, and vocabulary from IELTS week 7.