artlu's Bear Blog

How I Use AI Today

After a brief conversation with my elderly parents, I thought it'd be useful to jot these techniques down.

All of the below is based on my current understanding, and subject to change.

Day-to-day chat

General privacy expectation is to have less privacy than web search (AI is more intrusive, more subject to data mining, monitoring, and censorship). Users are exposed to similar levels of pervasive, non-consensual tracking, as using Facebook, Instagram, and YouTube.

Research

One magic phrase. Use it without ever changing it, to get more impartial answers: "tldr - steelman - critique"

Doesn't need orchestration of multiple agents. Single ones have always been good enough for my uses

Interactive / Manual Coding

Cursor - mostly use Composer 1.5 - a nice middle ground between Devin, Moonshot Kimi K2, Claude Sonnet, Codex, Gemini

I can't trust models to implement new things. I 99% trust them to write detailed specs after interactive discussion, that I barely read again. Trust individual stages of TDD red-green-refactor loops. The refactor loop is the haziest, a bit of a cargo cult. Unpopular opinion: asking one model to review the other model is a cargo cult

GLM Plan is 90% of Claude subscription at 1/10th the cost, has been difficult in my usage to hit rate limits, but I am inefficiently vigilant about my consumption

Claude Code - preferred tool for big planning + "Ralph loop". Often run it inside Cursor so I can use familiar methods to see what it is doing, but I push myself to use IDE's less. In the long run, expect to use coding IDE's like we use disassemblers today. Not that worried about vendor lock-in for now, because Zhipu (and others!) offers a drop-in replacement plan

OpenCode - useful backup to Claude Code, that enforces generality and offsets vendor lock-in even more

skills / hooks / mcp's / frameworks:

Agents orchestration

sensitive data - since context is grabbed by every major LLM provider

Data

Other tools

Try again every so often: Codex (inside Cursor), Gemini CLI just to see what has changed

I'm about 10% python, 70% Typescript, 20% vanilla html + css. Astral uv and bun are awesome and highly agent-friendly. mypy is way more annoying to incorporate into an agentic workflow than tsc -b --noEmit