`tenets.core.prompt` Package¶

Prompt parsing and understanding system.

This package provides intelligent prompt analysis to extract intent, keywords, entities, temporal context, and external references from user queries. The parser supports various input formats including plain text, URLs (GitHub issues, JIRA tickets, Linear, Notion, etc.), and structured queries.

Core Features: - Intent detection (implement, debug, test, refactor, etc.) - Keyword extraction using multiple algorithms (YAKE, TF-IDF, frequency) - Entity recognition (classes, functions, files, APIs, databases) - Temporal parsing (dates, ranges, recurring patterns) - External source integration (GitHub, GitLab, JIRA, Linear, Asana, Notion) - Intelligent caching with TTL management - Programming pattern recognition - Scope and focus area detection

The parser leverages centralized NLP components for: - Keyword extraction via nlp.keyword_extractor - Tokenization via nlp.tokenizer - Stopword filtering via nlp.stopwords - Programming patterns via nlp.programming_patterns

Example

from tenets.core.prompt import PromptParser from tenets.config import TenetsConfig
Create parser with config¶
config = TenetsConfig() parser = PromptParser(config)
Parse a prompt¶
context = parser.parse("implement OAuth2 authentication for the API") print(f"Intent: {context.intent}") print(f"Keywords: {context.keywords}") print(f"Task type: {context.task_type}")
Parse from GitHub issue¶
context = parser.parse("https://github.com/org/repo/issues/123") print(f"External source: {context.external_context['source']}") print(f"Issue title: {context.text}")

Classes¶

AsanaHandler¶

Python

AsanaHandler(cache_manager: Optional[CacheManager] = None)

Bases: ExternalSourceHandler

Handler for Asana tasks.

Functions¶

can_handle¶

Python

can_handle(url: str) -> bool

Check if URL is an Asana URL.

extract_identifier¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract Asana task identifier from URL.

fetch_content¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from Asana API.

ExternalContent`dataclass`¶

Python

ExternalContent(title: str, body: str, metadata: Dict[str, Any], source_type: str, url: str, cached_at: Optional[datetime] = None, ttl_hours: int = 24)

Parsed content from an external source.

Attributes¶

title`instance-attribute`¶

Python

title: str

body`instance-attribute`¶

Python

body: str

metadata`instance-attribute`¶

Python

metadata: Dict[str, Any]

source_type`instance-attribute`¶

Python

source_type: str

url`instance-attribute`¶

Python

url: str

cached_at`class-attributeinstance-attribute`¶

Python

cached_at: Optional[datetime] = None

ttl_hours`class-attributeinstance-attribute`¶

Python

ttl_hours: int = 24

ExternalSourceHandler¶

Python

ExternalSourceHandler(cache_manager: Optional[CacheManager] = None)

Bases: ABC

Base class for external source handlers.

Initialize handler with optional cache.

PARAMETER	DESCRIPTION
`cache_manager`	Optional cache manager for caching fetched content TYPE:`Optional[CacheManager]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

cache`instance-attribute`¶

Python

cache = cache_manager

Functions¶

can_handle`abstractmethod`¶

Python

can_handle(url: str) -> bool

Check if this handler can process the given URL.

extract_identifier`abstractmethod`¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract identifier and metadata from URL.

RETURNS	DESCRIPTION
`Tuple[str, Dict[str, Any]]`	Tuple of (identifier, metadata)

fetch_content`abstractmethod`¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from the external source.

get_cached_content¶

Python

get_cached_content(url: str) -> Optional[ExternalContent]

Get cached content if available and valid.

PARAMETER	DESCRIPTION
`url`	URL to check cache for TYPE:`str`

RETURNS	DESCRIPTION
`Optional[ExternalContent]`	Cached content or None if not cached/expired

cache_content¶

Python

cache_content(url: str, content: ExternalContent) -> None

Cache fetched content.

PARAMETER	DESCRIPTION
`url`	URL as cache key TYPE:`str`
`content`	Content to cache TYPE:`ExternalContent`

process¶

Python

process(url: str) -> Optional[ExternalContent]

Process URL with caching support.

PARAMETER	DESCRIPTION
`url`	URL to process TYPE:`str`

RETURNS	DESCRIPTION
`Optional[ExternalContent]`	External content or None if failed

ExternalSourceManager¶

Python

ExternalSourceManager(cache_manager: Optional[CacheManager] = None)

Manages all external source handlers.

Initialize with all available handlers.

PARAMETER	DESCRIPTION
`cache_manager`	Optional cache manager for handlers TYPE:`Optional[CacheManager]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

cache_manager`instance-attribute`¶

Python

cache_manager = cache_manager

handlers`instance-attribute`¶

Python

handlers = [GitHubHandler(cache_manager), GitLabHandler(cache_manager), JiraHandler(cache_manager), LinearHandler(cache_manager), AsanaHandler(cache_manager), NotionHandler(cache_manager)]

Functions¶

process_url¶

Python

process_url(url: str) -> Optional[ExternalContent]

Process a URL with the appropriate handler.

PARAMETER	DESCRIPTION
`url`	URL to process TYPE:`str`

RETURNS	DESCRIPTION
`Optional[ExternalContent]`	External content or None if no handler can process it

extract_reference¶

Python

extract_reference(text: str) -> Optional[Tuple[str, str, Dict[str, Any]]]

Extract external reference from text.

PARAMETER	DESCRIPTION
`text`	Text that may contain a URL TYPE:`str`

RETURNS	DESCRIPTION
`Optional[Tuple[str, str, Dict[str, Any]]]`	Tuple of (url, identifier, metadata) or None

GitHubHandler¶

Python

GitHubHandler(cache_manager: Optional[CacheManager] = None)

Bases: ExternalSourceHandler

Handler for GitHub issues, PRs, discussions, and gists.

Functions¶

can_handle¶

Python

can_handle(url: str) -> bool

Check if URL is a GitHub URL.

extract_identifier¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract GitHub identifier from URL.

fetch_content¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from GitHub API.

GitLabHandler¶

Python

GitLabHandler(cache_manager: Optional[CacheManager] = None)

Bases: ExternalSourceHandler

Handler for GitLab issues, MRs, and snippets.

Functions¶

can_handle¶

Python

can_handle(url: str) -> bool

Check if URL is a GitLab URL.

extract_identifier¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract GitLab identifier from URL.

fetch_content¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from GitLab API.

JiraHandler¶

Python

JiraHandler(cache_manager: Optional[CacheManager] = None)

Bases: ExternalSourceHandler

Handler for JIRA tickets.

Functions¶

can_handle¶

Python

can_handle(url: str) -> bool

Check if URL is a JIRA URL.

extract_identifier¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract JIRA ticket identifier from URL.

fetch_content¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from JIRA API.

LinearHandler¶

Python

LinearHandler(cache_manager: Optional[CacheManager] = None)

Bases: ExternalSourceHandler

Handler for Linear issues.

Functions¶

can_handle¶

Python

can_handle(url: str) -> bool

Check if URL is a Linear URL.

extract_identifier¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract Linear identifier from URL.

fetch_content¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from Linear API using GraphQL.

NotionHandler¶

Python

NotionHandler(cache_manager: Optional[CacheManager] = None)

Bases: ExternalSourceHandler

Handler for Notion pages and databases.

Functions¶

can_handle¶

Python

can_handle(url: str) -> bool

Check if URL is a Notion URL.

extract_identifier¶

Python

extract_identifier(url: str) -> Tuple[str, Dict[str, Any]]

Extract Notion page/database identifier from URL.

fetch_content¶

Python

fetch_content(url: str, metadata: Dict[str, Any]) -> Optional[ExternalContent]

Fetch content from Notion API.

CacheEntry`dataclass`¶

Python

CacheEntry(key: str, value: Any, created_at: datetime, accessed_at: datetime, ttl_seconds: int, hit_count: int = 0, metadata: Dict[str, Any] = None)

A cache entry with metadata.

Attributes¶

key`instance-attribute`¶

Python

key: str

value`instance-attribute`¶

Python

value: Any

created_at`instance-attribute`¶

Python

created_at: datetime

accessed_at`instance-attribute`¶

Python

accessed_at: datetime

ttl_seconds`instance-attribute`¶

Python

ttl_seconds: int

hit_count`class-attributeinstance-attribute`¶

Python

hit_count: int = 0

metadata`class-attributeinstance-attribute`¶

Python

metadata: Dict[str, Any] = None

Functions¶

is_expired¶

Python

is_expired() -> bool

Check if this entry has expired.

touch¶

Python

touch()

Update access time and increment hit count.

PromptCache¶

Python

PromptCache(cache_manager: Optional[Any] = None, enable_memory_cache: bool = True, enable_disk_cache: bool = True, memory_cache_size: int = 100)

Intelligent caching for prompt parsing operations.

Initialize prompt cache.

PARAMETER	DESCRIPTION
`cache_manager`	External cache manager to use TYPE:`Optional[Any]`DEFAULT:`None`
`enable_memory_cache`	Whether to use in-memory caching TYPE:`bool`DEFAULT:`True`
`enable_disk_cache`	Whether to use disk caching TYPE:`bool`DEFAULT:`True`
`memory_cache_size`	Maximum items in memory cache TYPE:`int`DEFAULT:`100`

Attributes¶

DEFAULT_TTLS`class-attributeinstance-attribute`¶

Python

DEFAULT_TTLS = {'parsed_prompt': 3600, 'external_content': 21600, 'entity_recognition': 1800, 'intent_detection': 1800, 'temporal_parsing': 3600}

TTL_MODIFIERS`class-attributeinstance-attribute`¶

Python

TTL_MODIFIERS = {'github_open': 0.25, 'github_closed': 4.0, 'jira_active': 0.5, 'notion_page': 2.0, 'high_confidence': 1.5, 'low_confidence': 0.5}

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

cache_manager`instance-attribute`¶

Python

cache_manager = cache_manager if cache_manager and CacheManager else None

enable_memory`instance-attribute`¶

Python

enable_memory = enable_memory_cache

enable_disk`instance-attribute`¶

Python

enable_disk = enable_disk_cache and cache_manager is not None

memory_cache`instance-attribute`¶

Python

memory_cache: Dict[str, CacheEntry] = {}

memory_cache_size`instance-attribute`¶

Python

memory_cache_size = memory_cache_size

stats`instance-attribute`¶

Python

stats = {'hits': 0, 'misses': 0, 'evictions': 0, 'expirations': 0}

Functions¶

get¶

Python

get(key: str, check_disk: bool = True) -> Optional[Any]

Get a value from cache.

PARAMETER	DESCRIPTION
`key`	Cache key TYPE:`str`
`check_disk`	Whether to check disk cache if not in memory TYPE:`bool`DEFAULT:`True`

RETURNS	DESCRIPTION
`Optional[Any]`	Cached value or None if not found/expired

put¶

Python

put(key: str, value: Any, ttl_seconds: Optional[int] = None, metadata: Optional[Dict[str, Any]] = None, write_disk: bool = True) -> None

Put a value in cache.

PARAMETER	DESCRIPTION
`key`	Cache key TYPE:`str`
`value`	Value to cache TYPE:`Any`
`ttl_seconds`	TTL in seconds (uses default if not specified) TYPE:`Optional[int]`DEFAULT:`None`
`metadata`	Additional metadata for TTL calculation TYPE:`Optional[Dict[str, Any]]`DEFAULT:`None`
`write_disk`	Whether to write to disk cache TYPE:`bool`DEFAULT:`True`

cache_parsed_prompt¶

Python

cache_parsed_prompt(prompt: str, result: Any, metadata: Optional[Dict[str, Any]] = None) -> None

Cache a parsed prompt result.

PARAMETER	DESCRIPTION
`prompt`	Original prompt text TYPE:`str`
`result`	Parsing result TYPE:`Any`
`metadata`	Additional metadata TYPE:`Optional[Dict[str, Any]]`DEFAULT:`None`

get_parsed_prompt¶

Python

get_parsed_prompt(prompt: str) -> Optional[Any]

Get cached parsed prompt result.

PARAMETER	DESCRIPTION
`prompt`	Original prompt text TYPE:`str`

RETURNS	DESCRIPTION
`Optional[Any]`	Cached result or None

cache_external_content¶

Python

cache_external_content(url: str, content: Any, metadata: Optional[Dict[str, Any]] = None) -> None

Cache external content fetch result.

PARAMETER	DESCRIPTION
`url`	URL that was fetched TYPE:`str`
`content`	Fetched content TYPE:`Any`
`metadata`	Additional metadata (source, state, etc.) TYPE:`Optional[Dict[str, Any]]`DEFAULT:`None`

get_external_content¶

Python

get_external_content(url: str) -> Optional[Any]

Get cached external content.

PARAMETER	DESCRIPTION
`url`	URL to check TYPE:`str`

RETURNS	DESCRIPTION
`Optional[Any]`	Cached content or None

cache_entities¶

Python

cache_entities(text: str, entities: List[Any], confidence: float = 0.0) -> None

Cache entity recognition results.

PARAMETER	DESCRIPTION
`text`	Text that was analyzed TYPE:`str`
`entities`	Recognized entities TYPE:`List[Any]`
`confidence`	Average confidence score TYPE:`float`DEFAULT:`0.0`

get_entities¶

Python

get_entities(text: str) -> Optional[List[Any]]

Get cached entity recognition results.

PARAMETER	DESCRIPTION
`text`	Text to check TYPE:`str`

RETURNS	DESCRIPTION
`Optional[List[Any]]`	Cached entities or None

cache_intent¶

Python

cache_intent(text: str, intent: Any, confidence: float = 0.0) -> None

Cache intent detection result.

PARAMETER	DESCRIPTION
`text`	Text that was analyzed TYPE:`str`
`intent`	Detected intent TYPE:`Any`
`confidence`	Confidence score TYPE:`float`DEFAULT:`0.0`

get_intent¶

Python

get_intent(text: str) -> Optional[Any]

Get cached intent detection result.

PARAMETER	DESCRIPTION
`text`	Text to check TYPE:`str`

RETURNS	DESCRIPTION
`Optional[Any]`	Cached intent or None

invalidate¶

Python

invalidate(pattern: str) -> int

Invalidate cache entries matching a pattern.

PARAMETER	DESCRIPTION
`pattern`	Key pattern to match (prefix) TYPE:`str`

RETURNS	DESCRIPTION
`int`	Number of entries invalidated

clear_all¶

Python

clear_all() -> None

Clear all cache entries.

cleanup_expired¶

Python

cleanup_expired() -> int

Remove expired entries from cache.

RETURNS	DESCRIPTION
`int`	Number of entries removed

get_stats¶

Python

get_stats() -> Dict[str, Any]

Get cache statistics.

RETURNS	DESCRIPTION
`Dict[str, Any]`	Cache statistics dictionary

warm_cache¶

Python

warm_cache(common_prompts: List[str]) -> None

Pre-warm cache with common prompts.

PARAMETER	DESCRIPTION
`common_prompts`	List of common prompts to pre-cache TYPE:`List[str]`

Entity`dataclass`¶

Python

Entity(name: str, type: str, confidence: float, context: str = '', start_pos: int = -1, end_pos: int = -1, source: str = 'regex', metadata: Dict[str, Any] = dict())

Recognized entity with confidence and context.

Attributes¶

name`instance-attribute`¶

Python

name: str

type`instance-attribute`¶

Python

type: str

confidence`instance-attribute`¶

Python

confidence: float

context`class-attributeinstance-attribute`¶

Python

context: str = ''

start_pos`class-attributeinstance-attribute`¶

Python

start_pos: int = -1

end_pos`class-attributeinstance-attribute`¶

Python

end_pos: int = -1

source`class-attributeinstance-attribute`¶

Python

source: str = 'regex'

metadata`class-attributeinstance-attribute`¶

Python

metadata: Dict[str, Any] = field(default_factory=dict)

EntityPatternMatcher¶

Python

EntityPatternMatcher(patterns_file: Optional[Path] = None)

Regex-based entity pattern matching.

Initialize with entity patterns.

PARAMETER	DESCRIPTION
`patterns_file`	Path to entity patterns JSON file TYPE:`Optional[Path]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

patterns`instance-attribute`¶

Python

patterns = _load_patterns(patterns_file)

compiled_patterns`instance-attribute`¶

Python

compiled_patterns = _compile_patterns()

Functions¶

extract¶

Python

extract(text: str) -> List[Entity]

Extract entities using regex patterns.

PARAMETER	DESCRIPTION
`text`	Text to extract entities from TYPE:`str`

RETURNS	DESCRIPTION
`List[Entity]`	List of extracted entities

FuzzyEntityMatcher¶

Python

FuzzyEntityMatcher(known_entities: Optional[Dict[str, List[str]]] = None)

Fuzzy matching for entity recognition.

Initialize fuzzy matcher.

PARAMETER	DESCRIPTION
`known_entities`	Dictionary of entity type -> list of known entity names TYPE:`Optional[Dict[str, List[str]]]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

known_entities`instance-attribute`¶

Python

known_entities = known_entities or _get_default_known_entities()

Functions¶

find_fuzzy_matches¶

Python

find_fuzzy_matches(text: str, threshold: float = 0.8) -> List[Entity]

Find fuzzy matches for known entities.

PARAMETER	DESCRIPTION
`text`	Text to search in TYPE:`str`
`threshold`	Similarity threshold (0-1) TYPE:`float`DEFAULT:`0.8`

RETURNS	DESCRIPTION
`List[Entity]`	List of matched entities

HybridEntityRecognizer¶

Python

HybridEntityRecognizer(use_nlp: bool = True, use_fuzzy: bool = True, patterns_file: Optional[Path] = None, spacy_model: str = 'en_core_web_sm', known_entities: Optional[Dict[str, List[str]]] = None)

Main entity recognizer combining all approaches.

Initialize hybrid entity recognizer.

PARAMETER	DESCRIPTION
`use_nlp`	Whether to use NLP-based NER TYPE:`bool`DEFAULT:`True`
`use_fuzzy`	Whether to use fuzzy matching TYPE:`bool`DEFAULT:`True`
`patterns_file`	Path to entity patterns JSON TYPE:`Optional[Path]`DEFAULT:`None`
`spacy_model`	spaCy model name TYPE:`str`DEFAULT:`'en_core_web_sm'`
`known_entities`	Known entities for fuzzy matching TYPE:`Optional[Dict[str, List[str]]]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

pattern_matcher`instance-attribute`¶

Python

pattern_matcher = EntityPatternMatcher(patterns_file)

nlp_recognizer`instance-attribute`¶

Python

nlp_recognizer = None

fuzzy_matcher`instance-attribute`¶

Python

fuzzy_matcher = None

keyword_extractor`instance-attribute`¶

Python

keyword_extractor = KeywordExtractor(use_stopwords=True, stopword_set='prompt')

Functions¶

recognize¶

Python

recognize(text: str, merge_overlapping: bool = True, min_confidence: float = 0.5) -> List[Entity]

Recognize entities using all available methods.

PARAMETER	DESCRIPTION
`text`	Text to extract entities from TYPE:`str`
`merge_overlapping`	Whether to merge overlapping entities TYPE:`bool`DEFAULT:`True`
`min_confidence`	Minimum confidence threshold TYPE:`float`DEFAULT:`0.5`

RETURNS	DESCRIPTION
`List[Entity]`	List of recognized entities

get_entity_summary¶

Python

get_entity_summary(entities: List[Entity]) -> Dict[str, Any]

Get summary statistics about recognized entities.

PARAMETER	DESCRIPTION
`entities`	List of entities TYPE:`List[Entity]`

RETURNS	DESCRIPTION
`Dict[str, Any]`	Summary dictionary

NLPEntityRecognizer¶

Python

NLPEntityRecognizer(model_name: str = 'en_core_web_sm')

NLP-based named entity recognition using spaCy.

Initialize NLP entity recognizer.

PARAMETER	DESCRIPTION
`model_name`	spaCy model to use TYPE:`str`DEFAULT:`'en_core_web_sm'`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

nlp`instance-attribute`¶

Python

nlp = load(model_name)

Functions¶

extract¶

Python

extract(text: str) -> List[Entity]

Extract entities using NLP.

PARAMETER	DESCRIPTION
`text`	Text to extract entities from TYPE:`str`

RETURNS	DESCRIPTION
`List[Entity]`	List of extracted entities

HybridIntentDetector¶

Python

HybridIntentDetector(use_ml: bool = True, patterns_file: Optional[Path] = None, model_name: str = 'all-MiniLM-L6-v2')

Main intent detector combining pattern and ML approaches.

Initialize hybrid intent detector.

PARAMETER	DESCRIPTION
`use_ml`	Whether to use ML-based detection TYPE:`bool`DEFAULT:`True`
`patterns_file`	Path to intent patterns JSON TYPE:`Optional[Path]`DEFAULT:`None`
`model_name`	Embedding model name for ML TYPE:`str`DEFAULT:`'all-MiniLM-L6-v2'`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

pattern_detector`instance-attribute`¶

Python

pattern_detector = PatternBasedDetector(patterns_file)

semantic_detector`instance-attribute`¶

Python

semantic_detector = None

keyword_extractor`instance-attribute`¶

Python

keyword_extractor = KeywordExtractor(use_stopwords=True, stopword_set='prompt')

Functions¶

detect¶

Python

detect(text: str, combine_method: str = 'weighted', pattern_weight: float = 0.75, ml_weight: float = 0.25, min_confidence: float = 0.3) -> Intent

Detect the primary intent from text.

PARAMETER	DESCRIPTION
`text`	Text to analyze TYPE:`str`
`combine_method`	How to combine results ('weighted', 'max', 'vote') TYPE:`str`DEFAULT:`'weighted'`
`pattern_weight`	Weight for pattern-based detection TYPE:`float`DEFAULT:`0.75`
`ml_weight`	Weight for ML-based detection TYPE:`float`DEFAULT:`0.25`
`min_confidence`	Minimum confidence threshold TYPE:`float`DEFAULT:`0.3`

RETURNS	DESCRIPTION
`Intent`	Primary intent detected

detect_multiple¶

Python

detect_multiple(text: str, max_intents: int = 3, min_confidence: float = 0.3) -> List[Intent]

Detect multiple intents from text.

PARAMETER	DESCRIPTION
`text`	Text to analyze TYPE:`str`
`max_intents`	Maximum number of intents to return TYPE:`int`DEFAULT:`3`
`min_confidence`	Minimum confidence threshold TYPE:`float`DEFAULT:`0.3`

RETURNS	DESCRIPTION
`List[Intent]`	List of detected intents

get_intent_context¶

Python

get_intent_context(intent: Intent) -> Dict[str, Any]

Get additional context for an intent.

PARAMETER	DESCRIPTION
`intent`	Intent to get context for TYPE:`Intent`

RETURNS	DESCRIPTION
`Dict[str, Any]`	Context dictionary

Intent`dataclass`¶

Python

Intent(type: str, confidence: float, evidence: List[str], keywords: List[str], metadata: Dict[str, Any], source: str)

Detected intent with confidence and metadata.

Attributes¶

type`instance-attribute`¶

Python

type: str

confidence`instance-attribute`¶

Python

confidence: float

evidence`instance-attribute`¶

Python

evidence: List[str]

keywords`instance-attribute`¶

Python

keywords: List[str]

metadata`instance-attribute`¶

Python

metadata: Dict[str, Any]

source`instance-attribute`¶

Python

source: str

Functions¶

to_dict¶

Python

to_dict() -> Dict[str, Any]

Convert to dictionary.

PatternBasedDetector¶

Python

PatternBasedDetector(patterns_file: Optional[Path] = None)

Pattern-based intent detection.

Initialize with intent patterns.

PARAMETER	DESCRIPTION
`patterns_file`	Path to intent patterns JSON file TYPE:`Optional[Path]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

patterns`instance-attribute`¶

Python

patterns = _load_patterns(patterns_file)

compiled_patterns`instance-attribute`¶

Python

compiled_patterns = _compile_patterns()

Functions¶

detect¶

Python

detect(text: str) -> List[Intent]

Detect intents using patterns.

PARAMETER	DESCRIPTION
`text`	Text to analyze TYPE:`str`

RETURNS	DESCRIPTION
`List[Intent]`	List of detected intents

SemanticIntentDetector¶

Python

SemanticIntentDetector(model_name: str = 'all-MiniLM-L6-v2')

ML-based semantic intent detection using embeddings.

Initialize semantic intent detector.

PARAMETER	DESCRIPTION
`model_name`	Embedding model name TYPE:`str`DEFAULT:`'all-MiniLM-L6-v2'`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

model`instance-attribute`¶

Python

model = create_embedding_model(model_name=model_name)

similarity_calculator`instance-attribute`¶

Python

similarity_calculator = SemanticSimilarity(model)

intent_examples`instance-attribute`¶

Python

intent_examples = _get_intent_examples()

Functions¶

detect¶

Python

detect(text: str, threshold: float = 0.6) -> List[Intent]

Detect intents using semantic similarity.

PARAMETER	DESCRIPTION
`text`	Text to analyze TYPE:`str`
`threshold`	Similarity threshold TYPE:`float`DEFAULT:`0.6`

RETURNS	DESCRIPTION
`List[Intent]`	List of detected intents

PromptParser¶

Python

PromptParser(config: TenetsConfig, cache_manager: Optional[Any] = None, use_cache: bool = True, use_ml: bool = None, use_nlp_ner: bool = None, use_fuzzy_matching: bool = True)

Comprehensive prompt parser with modular components and caching.

Attributes¶

config`instance-attribute`¶

Python

config = config

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

cache`instance-attribute`¶

Python

cache = None

Functions¶

parse¶

Python

parse(prompt: str, use_cache: bool = True, fetch_external: bool = True, min_entity_confidence: float = 0.5, min_intent_confidence: float = 0.3) -> PromptContext

get_cache_stats¶

Python

get_cache_stats() -> Optional[Dict[str, Any]]

Get cache statistics.

RETURNS	DESCRIPTION
`Optional[Dict[str, Any]]`	Dictionary with cache statistics or None if cache is disabled

Example

stats = parser.get_cache_stats() if stats: ... print(f"Cache hit rate: {stats['hit_rate']:.2%}")

clear_cache¶

Python

clear_cache() -> None

Clear all cached data.

This removes all cached parsing results, external content, entities, and intents from both memory and disk cache.

Example

parser.clear_cache() print("Cache cleared")

warm_cache¶

Python

warm_cache(common_prompts: List[str]) -> None

Pre-warm cache with common prompts.

This method pre-parses a list of common prompts to populate the cache, improving performance for frequently used queries.

PARAMETER	DESCRIPTION
`common_prompts`	List of common prompts to pre-parse TYPE:`List[str]`

Example

common = [ ... "implement authentication", ... "fix bug", ... "understand architecture" ... ] parser.warm_cache(common)

TemporalExpression`dataclass`¶

Python

TemporalExpression(text: str, type: str, start_date: Optional[datetime], end_date: Optional[datetime], is_relative: bool, is_recurring: bool, recurrence_pattern: Optional[str], confidence: float, metadata: Dict[str, Any])

Parsed temporal expression with metadata.

Attributes¶

text`instance-attribute`¶

Python

text: str

type`instance-attribute`¶

Python

type: str

start_date`instance-attribute`¶

Python

start_date: Optional[datetime]

end_date`instance-attribute`¶

Python

end_date: Optional[datetime]

is_relative`instance-attribute`¶

Python

is_relative: bool

is_recurring`instance-attribute`¶

Python

is_recurring: bool

recurrence_pattern`instance-attribute`¶

Python

recurrence_pattern: Optional[str]

confidence`instance-attribute`¶

Python

confidence: float

metadata`instance-attribute`¶

Python

metadata: Dict[str, Any]

timeframe`property`¶

Python

timeframe: str

Get human-readable timeframe description.

TemporalParser¶

Python

TemporalParser(patterns_file: Optional[Path] = None)

Main temporal parser combining all approaches.

Initialize temporal parser.

PARAMETER	DESCRIPTION
`patterns_file`	Path to temporal patterns JSON file TYPE:`Optional[Path]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

pattern_matcher`instance-attribute`¶

Python

pattern_matcher = TemporalPatternMatcher(patterns_file)

Functions¶

parse¶

Python

parse(text: str) -> List[TemporalExpression]

Parse temporal expressions from text.

PARAMETER	DESCRIPTION
`text`	Text to parse TYPE:`str`

RETURNS	DESCRIPTION
`List[TemporalExpression]`	List of temporal expressions

get_temporal_context¶

Python

get_temporal_context(expressions: List[TemporalExpression]) -> Dict[str, Any]

Get overall temporal context from expressions.

PARAMETER	DESCRIPTION
`expressions`	List of temporal expressions TYPE:`List[TemporalExpression]`

RETURNS	DESCRIPTION
`Dict[str, Any]`	Temporal context summary

extract_temporal_features¶

Python

extract_temporal_features(text: str) -> Dict[str, Any]

Extract all temporal features from text.

PARAMETER	DESCRIPTION
`text`	Text to analyze TYPE:`str`

RETURNS	DESCRIPTION
`Dict[str, Any]`	Dictionary with temporal features and context

TemporalPatternMatcher¶

Python

TemporalPatternMatcher(patterns_file: Optional[Path] = None)

Pattern-based temporal expression matching.

Initialize with temporal patterns.

PARAMETER	DESCRIPTION
`patterns_file`	Path to temporal patterns JSON file TYPE:`Optional[Path]`DEFAULT:`None`

Attributes¶

logger`instance-attribute`¶

Python

logger = get_logger(__name__)

patterns`instance-attribute`¶

Python

patterns = _load_patterns(patterns_file)

compiled_patterns`instance-attribute`¶

Python

compiled_patterns = _compile_patterns()

PromptContext`dataclass`¶

Python

PromptContext(text: str, original: Optional[str] = None, keywords: list[str] = list(), task_type: str = 'general', intent: str = 'understand', entities: list[dict[str, Any]] = list(), file_patterns: list[str] = list(), focus_areas: list[str] = list(), temporal_context: Optional[dict[str, Any]] = None, scope: dict[str, Any] = dict(), external_context: Optional[dict[str, Any]] = None, metadata: dict[str, Any] = dict(), confidence_scores: dict[str, float] = dict(), session_id: Optional[str] = None, timestamp: datetime = datetime.now(), include_tests: bool = False)

Context extracted from user prompt.

Contains all information parsed from the prompt to guide file selection and ranking. This is the primary data structure that flows through the system after prompt parsing.

ATTRIBUTE	DESCRIPTION
`text`	The processed prompt text (cleaned and normalized) TYPE:`str`
`original`	Original input (may be URL or raw text) TYPE:`Optional[str]`
`keywords`	Extracted keywords for searching TYPE:`list[str]`
`task_type`	Type of task detected TYPE:`str`
`intent`	User intent classification TYPE:`str`
`entities`	Named entities found (classes, functions, modules) TYPE:`list[dict[str, Any]]`
`file_patterns`	File patterns to match (.py, test_, etc) TYPE:`list[str]`
`focus_areas`	Areas to focus on (auth, api, database, etc) TYPE:`list[str]`
`temporal_context`	Time-related context (recent, yesterday, etc) TYPE:`Optional[dict[str, Any]]`
`scope`	Scope indicators (modules, directories, exclusions) TYPE:`dict[str, Any]`
`external_context`	Context from external sources (GitHub, JIRA) TYPE:`Optional[dict[str, Any]]`
`metadata`	Additional metadata for processing TYPE:`dict[str, Any]`
`confidence_scores`	Confidence scores for various extractions TYPE:`dict[str, float]`
`session_id`	Associated session if any TYPE:`Optional[str]`
`timestamp`	When context was created TYPE:`datetime`

Attributes¶

text`instance-attribute`¶

Python

text: str

original`class-attributeinstance-attribute`¶

Python

original: Optional[str] = None

keywords`class-attributeinstance-attribute`¶

Python

keywords: list[str] = field(default_factory=list)

task_type`class-attributeinstance-attribute`¶

Python

task_type: str = 'general'

intent`class-attributeinstance-attribute`¶

Python

intent: str = 'understand'

entities`class-attributeinstance-attribute`¶

Python

entities: list[dict[str, Any]] = field(default_factory=list)

file_patterns`class-attributeinstance-attribute`¶

Python

file_patterns: list[str] = field(default_factory=list)

focus_areas`class-attributeinstance-attribute`¶

Python

focus_areas: list[str] = field(default_factory=list)

temporal_context`class-attributeinstance-attribute`¶

Python

temporal_context: Optional[dict[str, Any]] = None

scope`class-attributeinstance-attribute`¶

Python

scope: dict[str, Any] = field(default_factory=dict)

external_context`class-attributeinstance-attribute`¶

Python

external_context: Optional[dict[str, Any]] = None

metadata`class-attributeinstance-attribute`¶

Python

metadata: dict[str, Any] = field(default_factory=dict)

confidence_scores`class-attributeinstance-attribute`¶

Python

confidence_scores: dict[str, float] = field(default_factory=dict)

session_id`class-attributeinstance-attribute`¶

Python

session_id: Optional[str] = None

timestamp`class-attributeinstance-attribute`¶

Python

timestamp: datetime = field(default_factory=now)

include_tests`class-attributeinstance-attribute`¶

Python

include_tests: bool = False

Functions¶

add_keyword¶

Python

add_keyword(keyword: str, confidence: float = 1.0) -> None

Add a keyword with confidence score.

add_entity¶

Python

add_entity(name: str, entity_type: str, confidence: float = 1.0) -> None

Add an entity with type and confidence.

add_focus_area¶

Python

add_focus_area(area: str) -> None

Add a focus area if not already present.

merge_with¶

Python

merge_with(other: PromptContext) -> PromptContext

Merge this context with another.

to_dict¶

Python

to_dict() -> dict[str, Any]

Convert to dictionary representation.

from_dict`classmethod`¶

Python

from_dict(data: dict[str, Any]) -> PromptContext

Create PromptContext from dictionary.

get_hash¶

Python

get_hash() -> str

Compute a deterministic cache key for this prompt context.

The hash incorporates the normalized prompt text, task type, and the ordered list of unique keywords. MD5 is chosen (with usedforsecurity=False) for speed; collision risk is acceptable for internal memoization.

RETURNS	DESCRIPTION
`str`	Hex digest suitable for use as an internal cache key. TYPE:`str`

TaskType¶

Bases: Enum

Types of tasks detected in prompts.

Attributes¶

FEATURE`class-attributeinstance-attribute`¶

Python

FEATURE = 'feature'

DEBUG`class-attributeinstance-attribute`¶

Python

DEBUG = 'debug'

TEST`class-attributeinstance-attribute`¶

Python

TEST = 'test'

REFACTOR`class-attributeinstance-attribute`¶

Python

REFACTOR = 'refactor'

UNDERSTAND`class-attributeinstance-attribute`¶

Python

UNDERSTAND = 'understand'

REVIEW`class-attributeinstance-attribute`¶

Python

REVIEW = 'review'

DOCUMENT`class-attributeinstance-attribute`¶

Python

DOCUMENT = 'document'

OPTIMIZE`class-attributeinstance-attribute`¶

Python

OPTIMIZE = 'optimize'

SECURITY`class-attributeinstance-attribute`¶

Python

SECURITY = 'security'

ARCHITECTURE`class-attributeinstance-attribute`¶

Python

ARCHITECTURE = 'architecture'

MIGRATION`class-attributeinstance-attribute`¶

Python

MIGRATION = 'migration'

GENERAL`class-attributeinstance-attribute`¶

Python

GENERAL = 'general'

Functions¶

from_string`classmethod`¶

Python

from_string(value: str) -> TaskType

Create TaskType from string value.

Functions¶

create_parser¶

Python

create_parser(config=None, use_cache: bool = True, use_ml: bool = None, cache_manager=None) -> PromptParser

Create a configured prompt parser.

Convenience function to quickly create a parser with sensible defaults.

PARAMETER	DESCRIPTION
`config`	Optional TenetsConfig instance (creates default if None) DEFAULT:`None`
`use_cache`	Whether to enable caching (default: True) TYPE:`bool`DEFAULT:`True`
`use_ml`	Whether to use ML features (None = auto-detect from config) TYPE:`bool`DEFAULT:`None`
`cache_manager`	Optional cache manager for persistence DEFAULT:`None`

Uses centralized NLP components for all text processing.

RETURNS	DESCRIPTION
`PromptParser`	Configured PromptParser instance

Example

parser = create_parser() context = parser.parse("add user authentication") print(context.intent)

parse_prompt¶

Python

parse_prompt(prompt: str, config=None, fetch_external: bool = True, use_cache: bool = False) -> Any

Parse a prompt without managing parser instances.

Convenience function for one-off prompt parsing. Uses centralized NLP components including keyword extraction and tokenization.

PARAMETER	DESCRIPTION
`prompt`	The prompt text or URL to parse TYPE:`str`
`config`	Optional TenetsConfig instance DEFAULT:`None`
`fetch_external`	Whether to fetch external content (default: True) TYPE:`bool`DEFAULT:`True`
`use_cache`	Whether to use caching (default: False for one-off) TYPE:`bool`DEFAULT:`False`

RETURNS	DESCRIPTION
`Any`	PromptContext with extracted information

Example

context = parse_prompt("implement caching layer") print(f"Keywords: {context.keywords}") print(f"Intent: {context.intent}")

extract_keywords¶

Python

extract_keywords(text: str, max_keywords: int = 20) -> List[str]

Extract keywords from text using NLP components.

Uses the centralized keyword extractor with YAKE/TF-IDF/frequency fallback chain for robust keyword extraction.

PARAMETER	DESCRIPTION
`text`	Input text to analyze TYPE:`str`
`max_keywords`	Maximum number of keywords to extract TYPE:`int`DEFAULT:`20`

RETURNS	DESCRIPTION
`List[str]`	List of extracted keywords

Example

keywords = extract_keywords("implement OAuth2 authentication") print(keywords) # ['oauth2', 'authentication', 'implement']

detect_intent¶

Python

detect_intent(prompt: str, use_ml: bool = False) -> str

Analyzes prompt text to determine user intent.

PARAMETER	DESCRIPTION
`prompt`	The prompt text to analyze TYPE:`str`
`use_ml`	Whether to use ML-based detection (requires ML dependencies) TYPE:`bool`DEFAULT:`False`

RETURNS	DESCRIPTION
`str`	Intent type string (implement, debug, understand, etc.)

Example

intent = detect_intent("fix the authentication bug") print(intent) # 'debug'

extract_entities¶

Python

extract_entities(text: str, min_confidence: float = 0.5, use_nlp: bool = False, use_fuzzy: bool = True) -> List[Dict[str, Any]]

Extract named entities from text.

Identifies classes, functions, files, modules, and other programming entities mentioned in the text.

PARAMETER	DESCRIPTION
`text`	Input text to analyze TYPE:`str`
`min_confidence`	Minimum confidence threshold TYPE:`float`DEFAULT:`0.5`
`use_nlp`	Whether to use NLP-based NER (requires spaCy) TYPE:`bool`DEFAULT:`False`
`use_fuzzy`	Whether to use fuzzy matching TYPE:`bool`DEFAULT:`True`

RETURNS	DESCRIPTION
`List[Dict[str, Any]]`	List of entity dictionaries with name, type, and confidence

Example

entities = extract_entities("update the UserAuth class in auth.py") for entity in entities: ... print(f"{entity['type']}: {entity['name']}")

parse_external_reference¶

Python

parse_external_reference(url: str) -> Optional[Dict[str, Any]]

Parse an external reference URL.

Extracts information from GitHub issues, JIRA tickets, GitLab MRs, Linear issues, Asana tasks, Notion pages, and other external references.

PARAMETER	DESCRIPTION
`url`	URL to parse TYPE:`str`

RETURNS	DESCRIPTION
`Optional[Dict[str, Any]]`	Dictionary with reference information or None if not recognized

Example

ref = parse_external_reference("https://github.com/org/repo/issues/123") print(ref['type']) # 'github' print(ref['identifier']) # 'org/repo#123'

extract_temporal¶

Python

extract_temporal(text: str) -> List[Dict[str, Any]]

Extract temporal expressions from text.

Identifies dates, time ranges, relative dates, and recurring patterns.

PARAMETER	DESCRIPTION
`text`	Input text to analyze TYPE:`str`

RETURNS	DESCRIPTION
`List[Dict[str, Any]]`	List of temporal expression dictionaries

Example

temporal = extract_temporal("changes from last week") for expr in temporal: ... print(f"{expr['text']}: {expr['type']}")

Modules¶

cache - Cache module
entity_recognizer - Entity Recognizer module
external_sources - External Sources module
intent_detector - Intent Detector module
normalizer - Normalizer module
parser - Parser module
temporal_parser - Temporal Parser module

tenets.core.prompt Package¶

Create parser with config¶

Parse a prompt¶

Parse from GitHub issue¶

Classes¶

AsanaHandler¶

Functions¶

can_handle¶

extract_identifier¶

fetch_content¶

ExternalContentdataclass¶

Attributes¶

titleinstance-attribute¶

bodyinstance-attribute¶

metadatainstance-attribute¶

source_typeinstance-attribute¶

urlinstance-attribute¶

cached_atclass-attributeinstance-attribute¶

ttl_hoursclass-attributeinstance-attribute¶

ExternalSourceHandler¶

Attributes¶

loggerinstance-attribute¶

cacheinstance-attribute¶

Functions¶

can_handleabstractmethod¶

extract_identifierabstractmethod¶

fetch_contentabstractmethod¶

get_cached_content¶

cache_content¶

process¶

ExternalSourceManager¶

Attributes¶

loggerinstance-attribute¶

cache_managerinstance-attribute¶

handlersinstance-attribute¶

Functions¶

process_url¶

extract_reference¶

GitHubHandler¶

Functions¶

can_handle¶

extract_identifier¶

fetch_content¶

GitLabHandler¶

Functions¶

can_handle¶

extract_identifier¶

fetch_content¶

JiraHandler¶

Functions¶

can_handle¶

extract_identifier¶

fetch_content¶

LinearHandler¶

Functions¶

can_handle¶

extract_identifier¶

fetch_content¶

NotionHandler¶

Functions¶

can_handle¶

extract_identifier¶

fetch_content¶

CacheEntrydataclass¶

Attributes¶

keyinstance-attribute¶

valueinstance-attribute¶

created_atinstance-attribute¶

accessed_atinstance-attribute¶

ttl_secondsinstance-attribute¶

hit_countclass-attributeinstance-attribute¶

metadataclass-attributeinstance-attribute¶

Functions¶

is_expired¶

touch¶

PromptCache¶

Attributes¶

DEFAULT_TTLSclass-attributeinstance-attribute¶

TTL_MODIFIERSclass-attributeinstance-attribute¶

loggerinstance-attribute¶

`tenets.core.prompt` Package¶

ExternalContent`dataclass`¶

title`instance-attribute`¶

body`instance-attribute`¶

metadata`instance-attribute`¶

source_type`instance-attribute`¶

url`instance-attribute`¶

cached_at`class-attributeinstance-attribute`¶

ttl_hours`class-attributeinstance-attribute`¶

logger`instance-attribute`¶

cache`instance-attribute`¶

can_handle`abstractmethod`¶

extract_identifier`abstractmethod`¶

fetch_content`abstractmethod`¶

logger`instance-attribute`¶

cache_manager`instance-attribute`¶

handlers`instance-attribute`¶

CacheEntry`dataclass`¶

key`instance-attribute`¶

value`instance-attribute`¶

created_at`instance-attribute`¶

accessed_at`instance-attribute`¶

ttl_seconds`instance-attribute`¶

hit_count`class-attributeinstance-attribute`¶

metadata`class-attributeinstance-attribute`¶

DEFAULT_TTLS`class-attributeinstance-attribute`¶

TTL_MODIFIERS`class-attributeinstance-attribute`¶

logger`instance-attribute`¶

cache_manager`instance-attribute`¶

enable_memory`instance-attribute`¶

enable_disk`instance-attribute`¶

memory_cache`instance-attribute`¶

memory_cache_size`instance-attribute`¶

stats`instance-attribute`¶

Entity`dataclass`¶

name`instance-attribute`¶

type`instance-attribute`¶

confidence`instance-attribute`¶

context`class-attributeinstance-attribute`¶

start_pos`class-attributeinstance-attribute`¶

end_pos`class-attributeinstance-attribute`¶

source`class-attributeinstance-attribute`¶

metadata`class-attributeinstance-attribute`¶

logger`instance-attribute`¶

patterns`instance-attribute`¶

compiled_patterns`instance-attribute`¶

logger`instance-attribute`¶

known_entities`instance-attribute`¶

logger`instance-attribute`¶

pattern_matcher`instance-attribute`¶

nlp_recognizer`instance-attribute`¶

fuzzy_matcher`instance-attribute`¶

keyword_extractor`instance-attribute`¶

logger`instance-attribute`¶

nlp`instance-attribute`¶

logger`instance-attribute`¶

pattern_detector`instance-attribute`¶

semantic_detector`instance-attribute`¶

keyword_extractor`instance-attribute`¶