Skip to main content

Codebases

Developer Intelligence

Deterministic code understanding before memory hydration

Atulya Codebases turns a repository into a reviewable ASD snapshot first, then hydrates memory only after explicit approval. That keeps code intelligence fast, mechanical, and token-efficient while preventing silent memory drift.

ASD First

Parse before memory

A codebase import creates a reviewable mechanical snapshot first instead of immediately mutating Atulya memory.

Review Gate

Human approval is explicit

Repo map, symbol search, and impact analysis are available before approval, while recall and reflect stay tied to the last approved snapshot.

Typed Surfaces

Chunk review and routing

ZIP import, GitHub import, refresh, review, chunks, files, symbols, impact, routing, research queue, and approve are all exposed as typed APIs that fit real developer workflows.

Operator UX

Control Plane review loop

The docs and UI are designed around the actual sequence a team follows: import, inspect, compare, approve, and only then hydrate memory.

The Core Promise

If you only remember one thing, remember this:

  • ASD gives immediate structural understanding of the repo
  • review routing decides which semantic chunks belong in memory versus research
  • approval applies only the chunks already routed to memory
  • Atulya memory remains stable until a human accepts the new snapshot

Why Companies Care

Codebases is designed to improve developer efficiency in ways that compound:

  • no persistent clone-heavy indexing workflow
  • no import-time LLM indexing bill
  • deterministic symbol and dependency extraction for supported languages
  • review-before-memory so teams do not silently pollute reasoning state
  • explicit refresh so GitHub-backed repos stay cheap when nothing changed

That combination is especially useful for:

  • engineering teams reviewing large repos
  • platform teams operating shared memory banks
  • agentic coding workflows where token spend and auditability both matter

Why Coding Agents Get Faster

The feature matters most when you look at the full agent loop instead of only import.

Agent needCodebases answer
Understand repo structure quicklyASD builds the repo map, symbols, chunks, and impact graph immediately
Avoid spending tokens on every importParsing stays mechanical first and memory hydration is explicit
Keep shared memory trustworthyThe latest snapshot is reviewable before it can affect recall or reflect
Stage only the valuable parts of a repoChunk routing separates memory, research, dismissed, and unrouted work
Choose between speed and richer memory formationThe memory modal now exposes ASD Direct versus Retain Pipeline

Release Readiness At A Glance

SurfaceWhat is ready
ImportZIP and public GitHub archive import
ParseASD-first snapshot creation with chunk graph and deterministic diagnostics
ReviewReview queue, repo map, symbol search, impact, research queue, approved memory history
Memory publishExplicit approval with ASD Direct or Retain Pipeline selection
RefreshGitHub refresh with noop when the commit SHA has not changed
Operator UXProgressive loading for large repos and modal detail flows instead of full-page overload

How The Pieces Fit Together

LayerPrimary jobWhy it matters
Archive importNormalize ZIP and GitHub sources into one pipelineKeeps ingestion predictable and clone-free
ASD mechanical parseBuild files, symbols, chunks, clusters, and graph edgesGives coding agents structural understanding before memory
Review routingDecide what belongs in memory, research, or nowherePrevents low-value code from polluting shared reasoning
Memory ingestionPublish approved chunks with ASD Direct or Retain PipelineLets teams balance speed against richer memory formation
Memory-backed reasoningPower recall and reflect from the approved snapshotKeeps shared reasoning conservative and auditable

What ASD Owns

ASD is the proprietary mechanical code-intelligence layer. In v1 it owns:

  • archive extraction and filtering
  • path normalization
  • language detection
  • tree-sitter parsing for supported languages
  • symbol extraction
  • import and dependency edge construction
  • normalized repo-map metadata

Deep parsing is strongest today for:

  • Python
  • JavaScript
  • TypeScript
  • JSX
  • TSX

Unsupported languages still appear in the manifest and file map, but they do not pretend to have deep graph intelligence.

How The System Is Split

LayerWhat it is for
CodebasesDeterministic code intelligence
retainGeneral memory ingestion
recallMemory retrieval
reflectMemory-backed reasoning

This separation is intentional.

Code understanding should be mechanical first. Memory-backed reasoning should happen only after the reviewed source state is approved.

Operator Decision Table

If the team needs...Use this path
Exact deterministic code persistence with minimal overheadASD Direct
Richer semantic linking into Atulya memoryRetain Pipeline
Structural review without touching memory yetReview Queue + Repo Map + Symbol Search + Impact
Deeper follow-up without publishing to memoryResearch Queue
Trusted reasoning against the current approved repo staterecall and reflect after approval

Where To Go Next

If you want to...Start here
Understand the exact state machineCodebases Lifecycle
Inspect the UI review loopCodebases Control Plane
Integrate the endpoints directlyCodebases API
Optimize coding-agent workflowsCodebases For Coding Agents
Understand auto-triage, gold artifacts, and intent curationCodebases Code Intelligence

Practical Guidance

SituationRecommended move
Private or curated repo snapshotZIP import
Public repo with cheap explicit refreshGitHub import
Large repo where only a few code regions matterRoute only high-value chunks to memory
Core subsystem that future agents should deeply understandApprove through Retain Pipeline
Broad repo sync where cost and determinism matter moreApprove through ASD Direct

Use ZIP import when:

  • the repo is private
  • you want an offline or curated snapshot
  • you want exact archive control

Use GitHub import when:

  • the repo is public
  • you want explicit refresh against a ref
  • you want a no-op result when the commit SHA has not changed

Before Versus After Approval

StageWhat developers can doWhat memory sees
Parsed but unapprovedInspect repo map, symbols, chunks, impact, and routesMemory stays on the older approved snapshot or empty
Routed for memoryChoose the best chunks and select the ingest modeMemory still does not move until approval runs
ApprovedQuery approved chunk history and trust the new snapshotrecall and reflect use the approved codebase documents

What This Is Not

Codebases is not trying to replace general memory operations.

Instead:

  • Codebases is the precise structural layer
  • retain, recall, and reflect stay the semantic and reasoning layer
  • approval is the bridge between the two

That is what keeps the fast path efficient and the memory path trustworthy.