open_zoom/docs/progress.md at main · Cem-Kaya/open_zoom

Split the codebase into app, capture, common, d3d12, cuda, and ui modules with mirrored public headers.

Implement Media Foundation camera enumeration and per-camera mode discovery.

Build the CPU frame pipeline for conversion, rotation, zoom, blur, temporal smoothing, and debug compositing.

Bring up the Direct3D 12 presenter, including GPU texture readback.

Enable the CUDA interop processing path with CPU fallback.

Add persistent settings storage in %APPDATA%\OpenZoom\OpenZoom\settings.json.

Add processed photo capture and processed H.264 MP4 recording.

Add rotation-aware focus controls, joystick navigation, mouse pan, and wheel zoom/pan.

Add release-bundle scripting and a minimal validation harness entry point.

Introduce a two-stage quick-mode plus advanced-tuning UI model with promotable custom presets.

Add OCR/VLM assistive-mode scaffolding to the config model and UI.

Add a working assistive overlay plus OCR/VLM runtime hooks for live analysis.

Provide feedback