PR: Complete Document Processing Pipeline #5
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "development_dioxus"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Complete document processing pipeline with hash-based change detection, local ONNX embeddings, URL-based git processing, and UI fixes.
Changes
Hash-Based Change Detection:
.ai/*.tomlfor each processed file.ai/→ reprocesseshero_embedder Integration:
generate_embeddings()URL-based Processing with Git Push:
.ai/→ Upload to Redis.ai/metadata to remote (useSKIP_PUSH=1to disable)docs_geomind,docs_mycelium,docs_owhUI Fixes:
Demo Scripts:
demo_basic.rs- Local collection processingdemo_geomind.rs- geomind/docs_geomind with git pushdemo_mycelium.rs- mycelium/docs_mycelium with git pushdemo_ourworld.rs- ourworld/docs_owh with git pushdemo_all.rs- Process all namespacesdemo-basic,demo-geomind,demo-mycelium,demo-ourworld,demo-all,demo-resetDocumentation:
docs/specs.mdwith full technical specificationsQuick Start
Next Steps (in TODO.md)
- Fix Cargo.toml dependencies to use git URLs instead of local paths - Rename binaries to hero_books and hero_books_client - Update port to 8883 per hero_ports registry - Add context-based RPC endpoint /api/{context}/books/rpc for hero_osis-sdk53b5227153to2c7183ad8d