Configuration Reference
Configuration Reference¶
This page documents all configuration types and their defaults across all languages.
Span¶
Byte and line/column range in source code.
Represents both byte offsets (for slicing) and human-readable line/column positions (for display and diagnostics).
| Field | Type | Default | Description |
|---|---|---|---|
start_byte |
int |
— | Start byte |
end_byte |
int |
— | End byte |
start_line |
int |
— | Start line |
start_column |
int |
— | Start column |
end_line |
int |
— | End line |
end_column |
int |
— | End column |
ProcessResult¶
Complete analysis result from processing a source file.
Contains metrics, structural analysis, imports/exports, comments,
docstrings, symbols, diagnostics, and optionally chunked code segments.
Fields are populated based on the ProcessConfig flags.
| Field | Type | Default | Description |
|---|---|---|---|
language |
str |
— | Language |
metrics |
FileMetrics |
— | Metrics (file metrics) |
structure |
list[StructureItem] |
[] |
Structure |
imports |
list[ImportInfo] |
[] |
Imports |
exports |
list[ExportInfo] |
[] |
Exports |
comments |
list[CommentInfo] |
[] |
Comments |
docstrings |
list[DocstringInfo] |
[] |
Docstrings |
symbols |
list[SymbolInfo] |
[] |
Symbols |
diagnostics |
list[Diagnostic] |
[] |
Diagnostics |
chunks |
list[CodeChunk] |
[] |
Text chunks for chunking/embedding |
FileMetrics¶
Aggregate metrics for a source file.
| Field | Type | Default | Description |
|---|---|---|---|
total_lines |
int |
— | Total lines |
code_lines |
int |
— | Code lines |
comment_lines |
int |
— | Comment lines |
blank_lines |
int |
— | Blank lines |
total_bytes |
int |
— | Total bytes |
node_count |
int |
— | Number of nodes |
error_count |
int |
— | Number of errors |
max_depth |
int |
— | Maximum depth |
StructureItem¶
A structural item (function, class, struct, etc.) in source code.
| Field | Type | Default | Description |
|---|---|---|---|
kind |
StructureKind |
StructureKind.FUNCTION |
Kind (structure kind) |
name |
str \| None |
None |
The name |
visibility |
str \| None |
None |
Visibility |
span |
Span |
— | Span (span) |
children |
list[StructureItem] |
[] |
Children |
decorators |
list[str] |
[] |
Decorators |
doc_comment |
str \| None |
None |
Doc comment |
signature |
str \| None |
None |
Signature |
body_span |
Span \| None |
None |
Body span (span) |
CommentInfo¶
A comment extracted from source code.
| Field | Type | Default | Description |
|---|---|---|---|
text |
str |
— | Text |
kind |
CommentKind |
CommentKind.LINE |
Kind (comment kind) |
span |
Span |
— | Span (span) |
associated_node |
str \| None |
None |
Associated node |
DocstringInfo¶
A docstring extracted from source code.
| Field | Type | Default | Description |
|---|---|---|---|
text |
str |
— | Text |
format |
DocstringFormat |
DocstringFormat.PYTHON_TRIPLE_QUOTE |
Format (docstring format) |
span |
Span |
— | Span (span) |
associated_item |
str \| None |
None |
Associated item |
parsed_sections |
list[DocSection] |
[] |
Parsed sections |
DocSection¶
A section within a docstring (e.g., Args, Returns, Raises).
| Field | Type | Default | Description |
|---|---|---|---|
kind |
str |
— | Kind |
name |
str \| None |
None |
The name |
description |
str |
— | Human-readable description |
ImportInfo¶
An import statement extracted from source code.
| Field | Type | Default | Description |
|---|---|---|---|
source |
str |
— | Source |
items |
list[str] |
[] |
Items |
alias |
str \| None |
None |
Alias |
is_wildcard |
bool |
— | Whether wildcard |
span |
Span |
— | Span (span) |
ExportInfo¶
An export statement extracted from source code.
| Field | Type | Default | Description |
|---|---|---|---|
name |
str |
— | The name |
kind |
ExportKind |
ExportKind.NAMED |
Kind (export kind) |
span |
Span |
— | Span (span) |
SymbolInfo¶
A symbol (variable, function, type, etc.) extracted from source code.
| Field | Type | Default | Description |
|---|---|---|---|
name |
str |
— | The name |
kind |
SymbolKind |
SymbolKind.VARIABLE |
Kind (symbol kind) |
span |
Span |
— | Span (span) |
type_annotation |
str \| None |
None |
Type annotation |
doc |
str \| None |
None |
Doc |
Diagnostic¶
A diagnostic (syntax error, missing node, etc.) from parsing.
| Field | Type | Default | Description |
|---|---|---|---|
message |
str |
— | Message |
severity |
DiagnosticSeverity |
DiagnosticSeverity.ERROR |
Severity (diagnostic severity) |
span |
Span |
— | Span (span) |
CodeChunk¶
A chunk of source code with rich metadata.
| Field | Type | Default | Description |
|---|---|---|---|
content |
str |
— | The extracted text content |
start_byte |
int |
— | Start byte |
end_byte |
int |
— | End byte |
start_line |
int |
— | Start line |
end_line |
int |
— | End line |
metadata |
ChunkContext |
— | Document metadata |
ChunkContext¶
Metadata for a single chunk of source code.
| Field | Type | Default | Description |
|---|---|---|---|
language |
str |
— | Language |
chunk_index |
int |
— | Chunk index |
total_chunks |
int |
— | Total chunks |
node_types |
list[str] |
[] |
Node types |
context_path |
list[str] |
[] |
Context path |
symbols_defined |
list[str] |
[] |
Symbols defined |
comments |
list[CommentInfo] |
[] |
Comments |
docstrings |
list[DocstringInfo] |
[] |
Docstrings |
has_error_nodes |
bool |
— | Whether error nodes |
PackConfig¶
Configuration for the tree-sitter language pack.
Controls cache directory and which languages to pre-download. Can be loaded from a TOML file, constructed programmatically, or passed as a dict/object from language bindings.
| Field | Type | Default | Description |
|---|---|---|---|
cache_dir |
str \| None |
None |
Override default cache directory. Default: ~/.cache/tree-sitter-language-pack/v{version}/libs/ |
languages |
list[str] \| None |
[] |
Languages to pre-download on init. Each entry is a language name (e.g. "python", "rust"). |
groups |
list[str] \| None |
[] |
Language groups to pre-download (e.g. "web", "systems", "scripting"). |
ProcessConfig¶
Configuration for the process() function.
Controls which analysis features are enabled and whether chunking is performed.
| Field | Type | Default | Description |
|---|---|---|---|
language |
str |
— | Language name (required). |
structure |
bool |
True |
Extract structural items (functions, classes, etc.). Default: true. |
imports |
bool |
True |
Extract import statements. Default: true. |
exports |
bool |
True |
Extract export statements. Default: true. |
comments |
bool |
False |
Extract comments. Default: false. |
docstrings |
bool |
False |
Extract docstrings. Default: false. |
symbols |
bool |
False |
Extract symbol definitions. Default: false. |
diagnostics |
bool |
False |
Include parse diagnostics. Default: false. |
chunk_max_size |
int \| None |
None |
Maximum chunk size in bytes. None disables chunking. |
Enums¶
CommentKind¶
The kind of a comment found in source code.
Distinguishes between single-line comments, block (multi-line) comments, and documentation comments.
| Variant | Description |
|---|---|
Line |
Line |
Block |
Block |
Doc |
Doc |
DiagnosticSeverity¶
Severity level of a diagnostic produced during parsing.
Used to classify parse errors, warnings, and informational messages found in the syntax tree.
| Variant | Description |
|---|---|
Error |
Error |
Warning |
Warning |
Info |
Info |
DocstringFormat¶
The format of a docstring extracted from source code.
Identifies the docstring convention used, which varies by language
(e.g., Python triple-quoted strings, JSDoc, Rustdoc /// comments).
| Variant | Description |
|---|---|
PythonTripleQuote |
Python triple quote |
JSDoc |
J s doc |
Rustdoc |
Rustdoc |
GoDoc |
Go doc |
JavaDoc |
Java doc |
Other |
Other — Fields: _0: String |
ExportKind¶
The kind of an export statement found in source code.
Covers named exports, default exports, and re-exports from other modules.
| Variant | Description |
|---|---|
Named |
Named |
Default |
Default |
ReExport |
Re export |
StructureKind¶
The kind of structural item found in source code.
Categorizes top-level and nested declarations such as functions, classes,
structs, enums, traits, and more. Use Other for
language-specific constructs that do not fit a standard category.
| Variant | Description |
|---|---|
Function |
Function |
Method |
Method |
Class |
Class |
Struct |
Struct |
Interface |
Interface |
Enum |
Enum |
Module |
Module |
Trait |
Trait |
Impl |
Impl |
Namespace |
Namespace |
Other |
Other — Fields: _0: String |
SymbolKind¶
The kind of a symbol definition found in source code.
Categorizes symbol definitions such as variables, constants, functions, classes, types, interfaces, enums, and modules.
| Variant | Description |
|---|---|
Variable |
Variable |
Constant |
Constant |
Function |
Function |
Class |
Class |
Type |
Type |
Interface |
Interface |
Enum |
Enum |
Module |
Module |
Other |
Other — Fields: _0: String |