Building a filesystem-backed retrieval workflow similar to OpenViking — looking to exchange architecture experience #2151
Replies: 4 comments 2 replies
-
|
I’ve been thinking about a hybrid context workflow that may balance both retrieval efficiency and reasoning quality. One possible architecture: DeepSeek V4 Pro / Flash GPT / Claude The intuition is that these models may have different strengths:
So instead of asking one model to do everything, the pipeline becomes: Context Build / Index / Cache Framework Search / Broad Traversal Deep Reasoning / Synthesis / Final Output This may help balance:
In a way, this feels less like “model comparison” and more like: Hybrid Model Routing for Long Context Systems. Curious whether others building context-heavy workflows (RAG / memory / agent / knowledge systems) are exploring similar split-routing strategies. |
Beta Was this translation helpful? Give feedback.
-
|
Another observation from my long-context workflow: When context gets compressed, the system does not necessarily lose the retrieval framework. Repeated structural signals such as:
tend to survive compression because they appear repeatedly and function as high-weight context signals. So the retrieval framework itself gradually becomes part of the compressed long-term context. This means context compression is not only about reducing tokens. In other words: Index structure can become compression-resistant if it is repeated, stable, and central enough. |
Beta Was this translation helpful? Give feedback.
-
I also put together a small architecture sketch here: Mostly around cache, retrieval, compression, and long-context reasoning. Happy to exchange ideas if anyone is exploring similar workflows. |
Beta Was this translation helpful? Give feedback.
-
|
This architecture also seems to pair well with Windsurf Inline / Copilot Inline. Since they mainly operate on short local context windows, stable file/block structures make generated outputs much more pattern-consistent. In practice, local code similarity within a context segment becomes very high. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi OpenViking team,
I’m ZinanOu, currently building a local knowledge workflow with an architecture that feels quite similar to OpenViking.
Some overlapping ideas include:
My use case is more document / math knowledge workflow oriented, but I found many architectural ideas close to OpenViking’s retrieval and context design.
I’ve been documenting part of my work here:
https://no8ah.cn
I’d really love to exchange some architecture design experience, especially around:
@zhoujh01 if this overlaps with retrieval architecture, I’d really appreciate any thoughts or experience you’d be willing to share.
Thanks — looking forward to learning from the team.
Beta Was this translation helpful? Give feedback.
All reactions