Using a coding agent? Run this to install the Atulya docs skill:

npx skills add https://github.com/eight-atulya/atulya --skill atulya-docs

Codebases

Developer Intelligence

Deterministic code understanding before memory hydration

Atulya Codebases turns a repository into a reviewable ASD snapshot first, then hydrates memory only after explicit approval. That keeps code intelligence fast, mechanical, and token-efficient while preventing silent memory drift.

Explore The Lifecycle

Coding Agent Guide

See The Control Plane

ASD First

Parse before memory

A codebase import creates a reviewable mechanical snapshot first instead of immediately mutating Atulya memory.

Review Gate

Human approval is explicit

Repo map, symbol search, and impact analysis are available before approval, while recall and reflect stay tied to the last approved snapshot.

Typed Surfaces

Chunk review and routing

ZIP import, GitHub import, refresh, review, chunks, files, symbols, impact, routing, research queue, and approve are all exposed as typed APIs that fit real developer workflows.

Operator UX

Control Plane review loop

The docs and UI are designed around the actual sequence a team follows: import, inspect, compare, approve, and only then hydrate memory.

The Core Promise

If you only remember one thing, remember this:

ASD gives immediate structural understanding of the repo
review routing decides which semantic chunks belong in memory versus research
approval applies only the chunks already routed to memory
Atulya memory remains stable until a human accepts the new snapshot

Why Companies Care

Codebases is designed to improve developer efficiency in ways that compound:

no persistent clone-heavy indexing workflow
no import-time LLM indexing bill
deterministic symbol and dependency extraction for supported languages
review-before-memory so teams do not silently pollute reasoning state
explicit refresh so GitHub-backed repos stay cheap when nothing changed

That combination is especially useful for:

engineering teams reviewing large repos
platform teams operating shared memory banks
agentic coding workflows where token spend and auditability both matter

Why Coding Agents Get Faster

The feature matters most when you look at the full agent loop instead of only import.

Agent need	Codebases answer
Understand repo structure quickly	ASD builds the repo map, symbols, chunks, and impact graph immediately
Avoid spending tokens on every import	Parsing stays mechanical first and memory hydration is explicit
Keep shared memory trustworthy	The latest snapshot is reviewable before it can affect `recall` or `reflect`
Stage only the valuable parts of a repo	Chunk routing separates `memory`, `research`, `dismissed`, and `unrouted` work
Choose between speed and richer memory formation	The memory modal now exposes `ASD Direct` versus `Retain Pipeline`

Release Readiness At A Glance

Surface	What is ready
Import	ZIP and public GitHub archive import
Parse	ASD-first snapshot creation with chunk graph and deterministic diagnostics
Review	Review queue, repo map, symbol search, impact, research queue, approved memory history
Memory publish	Explicit approval with `ASD Direct` or `Retain Pipeline` selection
Refresh	GitHub refresh with `noop` when the commit SHA has not changed
Operator UX	Progressive loading for large repos and modal detail flows instead of full-page overload

How The Pieces Fit Together

Layer	Primary job	Why it matters
Archive import	Normalize ZIP and GitHub sources into one pipeline	Keeps ingestion predictable and clone-free
ASD mechanical parse	Build files, symbols, chunks, clusters, and graph edges	Gives coding agents structural understanding before memory
Review routing	Decide what belongs in `memory`, `research`, or nowhere	Prevents low-value code from polluting shared reasoning
Memory ingestion	Publish approved chunks with `ASD Direct` or `Retain Pipeline`	Lets teams balance speed against richer memory formation
Memory-backed reasoning	Power `recall` and `reflect` from the approved snapshot	Keeps shared reasoning conservative and auditable

What ASD Owns

ASD is the proprietary mechanical code-intelligence layer. In v1 it owns:

archive extraction and filtering
path normalization
language detection
tree-sitter parsing for supported languages
symbol extraction
import and dependency edge construction
normalized repo-map metadata

Deep parsing is strongest today for:

Python
JavaScript
TypeScript
JSX
TSX

Unsupported languages still appear in the manifest and file map, but they do not pretend to have deep graph intelligence.

How The System Is Split

Layer	What it is for
`Codebases`	Deterministic code intelligence
`retain`	General memory ingestion
`recall`	Memory retrieval
`reflect`	Memory-backed reasoning

This separation is intentional.

Code understanding should be mechanical first. Memory-backed reasoning should happen only after the reviewed source state is approved.

Operator Decision Table

If the team needs...	Use this path
Exact deterministic code persistence with minimal overhead	`ASD Direct`
Richer semantic linking into Atulya memory	`Retain Pipeline`
Structural review without touching memory yet	Review Queue + Repo Map + Symbol Search + Impact
Deeper follow-up without publishing to memory	Research Queue
Trusted reasoning against the current approved repo state	`recall` and `reflect` after approval

Where To Go Next

If you want to...	Start here
Understand the exact state machine	Codebases Lifecycle
Inspect the UI review loop	Codebases Control Plane
Integrate the endpoints directly	Codebases API
Optimize coding-agent workflows	Codebases For Coding Agents
Understand auto-triage, gold artifacts, and intent curation	Codebases Code Intelligence

Practical Guidance

Situation	Recommended move
Private or curated repo snapshot	ZIP import
Public repo with cheap explicit refresh	GitHub import
Large repo where only a few code regions matter	Route only high-value chunks to memory
Core subsystem that future agents should deeply understand	Approve through `Retain Pipeline`
Broad repo sync where cost and determinism matter more	Approve through `ASD Direct`

Use ZIP import when:

the repo is private
you want an offline or curated snapshot
you want exact archive control

Use GitHub import when:

the repo is public
you want explicit refresh against a ref
you want a no-op result when the commit SHA has not changed

Before Versus After Approval

Stage	What developers can do	What memory sees
Parsed but unapproved	Inspect repo map, symbols, chunks, impact, and routes	Memory stays on the older approved snapshot or empty
Routed for memory	Choose the best chunks and select the ingest mode	Memory still does not move until approval runs
Approved	Query approved chunk history and trust the new snapshot	`recall` and `reflect` use the approved codebase documents

What This Is Not

Codebases is not trying to replace general memory operations.

Instead:

Codebases is the precise structural layer
retain, recall, and reflect stay the semantic and reasoning layer
approval is the bridge between the two

That is what keeps the fast path efficient and the memory path trustworthy.

Deterministic code understanding before memory hydration

Parse before memory

Human approval is explicit

Chunk review and routing

Control Plane review loop

The Core Promise​

Why Companies Care​

Why Coding Agents Get Faster​

Release Readiness At A Glance​

How The Pieces Fit Together​

What ASD Owns​

How The System Is Split​

Operator Decision Table​

Where To Go Next​

Practical Guidance​

Before Versus After Approval​

What This Is Not​