Term index
7743 terms
#
1
2
3
4
5
7
8
A
- A* search
- A/B test
- A/B testing
- A/B тест метрик качества
- A/B тестирование промптов
- A100
- A100 80GB
- A10G
- A2A
- A2A Protocol
- A2C
- A3C
- AARRR metrics
- ABC
- ABSTAIN
- ACID
- ACID транзакции
- ACK
- ACL
- ACME
- ADF test
- AES-256
- AG News
- AI Feynman
- AI Verify
- AI agents
- AI-constructed formal languages
- AIC
- AIM
- ALBEF
- ALBERT
- ALCE
- ALFWorld
- ALIGN
- ALiBi
- ALiPy
- AMD EPYC
- AMD MI300X
- AMQP
- ANLI
- ANN
- ANN индекс
- ANN-benchmarks
- ANSI-коды
- AOF
- AOF rewrite
- AOT compilation
- AP
- API
- API access control
- API call
- API contract
- API costs
- API error
- API key
- API key rate limiting
- API tokens
- API вызовы инструментов
- APM
- APScheduler
- AQLM
- AQuA
- ARC
- ARC-AGI
- ARC-Challenge
- ARIMA
- ARM Neoverse V2
- ARPC
- ARPU
- ASCII art
- ASE
- ASQA
- AST
- ASYNC
- AUC
- AWQ
- AWS CLI
- AWS CloudWatch
- AWS Cost and Usage Report
- AWS DMS
- AWS EC2
- AWS Global Accelerator
- AWS Glue
- AWS Glue Schema Registry
- AWS KMS
- AWS Price List API
- AWS Pricing Calculator
- AWS Region
- AWS SQS
- AWS Secrets Manager
- AWS capacity reservations
- Aaronson
- Abort
- Absolute Positional Encoding
- Abstraction layer
- Accelerate
- Access control
- Accuracy on goldenset
- Action Correctness
- Action Distribution Drift
- Action F1
- Action safety rate
- Activation Statistics
- Activation patching
- Activation quantization
- Activation steering
- Active Connections
- Active Probing
- Active-Active архитектура
- Active-Passive архитектура
- Active-passive
- Activity
- Actix
- Actor Model
- Actor-Critic architecture
- ActorSystem
- AdaBelief
- AdaGrad
- AdaLoRA
- Adam optimizer
- AdamW
- Adapter layers
- AdapterFusion
- Adaptive KL penalty
- Adaptive Prompting
- Adaptive RAG
- Adaptive Retrieval
- Adaptive Wave Decoding
- Adaptive backoff
- Adaptive buffering
- Adaptive computation time
- Adaptive concurrency
- Adaptive context
- Adaptive decomposition
- Adaptive design
- Adaptive sampling
- Adaptive-RAG
- Add
- Additive Quantization
- Additive attention
- Additive masking
- Adjudication
- AdmissionController
- AdvBench
- AdvGLUE
- Advantage
- Adversarial Examples for Code
- Adversarial Instructions
- Adversarial POPE
- Adversarial attacks
- Adversarial generation
- Adversarial pattern
- Adversarial prompts
- Adversarial query
- Adversarial reprogramming
- Adversarial suffix
- Affine Transformation
- Age самого старого сообщения
- Agent Card
- Agent Communication Protocol
- Agent Framework
- Agent Pipeline
- Agent looping
- Agent permissions
- Agent safety constraints
- Agent self-confidence
- Agent state
- Agent tools
- Agent utilization
- Agent with Memory
- Agent with tools
- Agent-Eval
- Agent-Validator
- Agent-based approach
- AgentBench
- AgentContext
- AgentCostTracker
- AgentError
- AgentExecutor
- AgentInterface
- AgentManifest
- AgentOps
- AgentPool
- AgentRunner
- AgentScope
- Agentic AI
- Agentic RAG
- Agentic chunking
- Agentic loops
- Agentic planning
- Aging
- Agno
- Agreement
- Aider
- Airbyte
- Airflow
- Akka
- Aleatoric uncertainty
- Alembic
- Alert rule
- Alert threshold
- Alertmanager
- Alignment budget
- AllGather
- AllReduce
- Allocate
- Allocation rule
- Allure
- Alpaca
- Alpaca-LoRA
- Alpaca-format
- AlpacaEval
- AlphaFold 3
- AlphaGo
- AlphaProof
- AlphaSearch
- AlphaZero
- Amazon Comprehend
- Amazon EMR
- Amazon Kinesis
- Amazon Mechanical Turk
- Amazon Reviews
- Amazon SageMaker Ground Truth
- Amazon Step Functions
- Ambiguous query
- Amino acid sequence
- Amplitude
- Analytics events
- Annoy
- Anomaly Detection
- Anonymity
- Anonymization by prompting LLM
- Anonymized data
- Answer
- Answer Recall
- Answer quality
- Answer relevance
- AnswerVerifier
- Ant Colony Optimization
- Anthropic Claude API
- Anthropic HH-RLHF
- Anthropic SDK
- Anthropic evals
- Anthropic prompt caching
- Anycast
- Apache 2.0
- Apache Atlas
- Apache Beam
- Apache Bench
- Apache Flink
- Apache Spark Streaming
- Apache TVM
- Apicurio
- Apicurio Registry
- AppArmor
- Append-only log
- Approval latency
- Approval rate
- Approval voting
- Architecture rules
- Argilla
- Argo Rollouts
- ArgoCD
- Argument Accuracy
- Arithmetic intensity
- Arize
- Arize AI
- Arize Phoenix
- Array of Strings
- Arrival time
- Artificial Analysis
- Assertions
- Assistants API
- Assumptions
- Asymmetric Distance Computation
- AsyncAPI
- AsyncPipeline
- Asynchronous Execution
- Asynchronous H2D copy
- Asynchronous SM-to-SM copy
- Asynchronous verification
- Asyncio timeouts
- At-most-once
- Atheris
- Atomic Operation
- Attention
- Attention dilution
- Attention dropout
- Attention heads
- Attention kernel
- Attention patterns
- Attention pruning
- Attention score
- Attention: PV
- Attention: QK^T
- Attribution-perturbation consistency score
- Auction
- Auctioneer
- Audio RAG
- Audio encoder
- AudioCraft
- AudioLM
- Audit logging
- Authentication
- Author
- Authority score
- Auto Scaling
- Auto-docs
- Auto-remediation
- Auto-success rate
- AutoAWQ
- AutoDAN
- AutoGPT
- AutoGPTQForCausalLM
- AutoGen
- AutoModelForCausalLM
- AutoScheduler
- AutoTVM
- AutoTokenizer
- Autocut
- Autograd
- Automatic Prompt Engineering
- Autoregressive
- Autoregressive inference
- Average Pairwise Similarity
- Average Wait Time
- Average cost per delegation
- Average handoffs per query
- Average iterations
- Average steps
- Average steps per milestone
- AverageUtilization
- Avro
- AvroConsumer
- AvroProducer
- Axolotl
- Azure AI Red Team Tools
- Azure Content Safety
- Azure Durable Functions
- Azure Key Vault
- Azure Monitor
- ablation study
- abstractive summarization
- acceptance rate
- acceptance threshold
- acceptance window
- accessibility tree
- accidental harmful actions
- accumulation steps
- accuracy
- accuracy drop
- accuracy предсказания winner
- acks=all
- acquisition function
- action
- action head
- action items
- actions/github-script
- activation offloading
- activation variance
- activations
- active learning
- active learning loop
- adapter conflict
- adapter conflicts
- adapter_config.json
- adapter_model.safetensors
- adapters
- adaptive KL controller
- adaptive compute
- adaptive learning rates
- adaptive rate limiting
- adaptive reasoning depth
- adaptive resource allocation
- adaptive routing
- adaptive sparse attention
- add_messages
- additionalProperties
- advantage estimation
- adversarial examples
- adversarial filtering
- adversarial hard negative
- adversarial input
- adversarial patch
- adversarial probing
- adversarial prompt detection
- adversarial retrieval
- adversarial training
- agent
- agent distillation
- agent explanation fidelity
- agent handover
- agent registry
- agent specification
- agent state management
- agent swarm
- agent system
- agent versioning
- agent-manager
- agentic observability
- agentic workflows
- aggregation
- agreement matrix
- aio_pika
- aiobreaker
- aiocache
- aiohttp
- aiokafka
- aiomonitor
- aioresponses
- aiortc
- aiosmtpd
- ajv
- alembic upgrade head
- alert
- alert rules
- alerting
- alerting rule
- algbw
- alignment tax
- all-mpnet-base-v2
- all-to-all communication
- all_reduce_perf
- allkeys-lfu
- allkeys-lru
- allocated_bytes / reserved_bytes
- allowed variations
- allowed_patterns
- allreduce_perf
- alpha
- alpine
- altinity-clickhouse-grafana plugin
- ambiguity_score
- ambiguous
- ambiguous queries
- amortized upfront
- anchoring
- anchoring bias
- anisotropic quantization
- annotator
- annotator calibration
- answer_correctness
- answer_exact_match
- anti-contamination
- any-to-any generation
- anytree
- appendfsync
- approve/deny
- approximate LFU
- arbitrary resolution
- argparse
- artifact
- assertions_feedback
- assertions_handler
- assertions_max_retries
- associative memory
- async CUDA
- async GEMM
- async call
- async copy
- async data movement
- async generator
- async messaging
- async response
- async session
- async with
- async-profiler
- async/await
- asynchronous data copy
- asynchronous preprocessing
- asynchronous transaction barriers
- asyncio
- asyncio.Barrier
- asyncio.Lock
- asyncio.Queue
- asyncio.Semaphore
- asyncio.Task
- asyncio.gather
- asyncio.sleep
- asyncio.wait_for
- asyncpg
- at-least-once semantics
- at-most-once семантика
- atomic action
- atomic append
- atomic operations
- atomic read-write
- attack success rate
- attention compression
- attention entropy
- attention fusion
- attention masking
- attention metrics
- attention normalization
- attention pattern analysis
- attention projections
- attention sink
- auction-based task allocation
- audience calibration
- auto-commit
- auto-gptq
- auto-instrumentation
- auto-merging retrieval
- auto-scaling
- auto-tuning
- auto_wrap_policy
- autoencoder
- autogenerate
- automated testing
- automatic baseline computation
- automatic labeling
- autoregressive generation
- autoregressive model
- autoscaling inference
- auxiliary loss
- availability
- available blocks
- average handling time
- averageValue
B
- B-tree
- B200
- B2B
- B2C
- BAAI/bge-large-en
- BAAI/bge-m3
- BAE
- BBH
- BBQ
- BEIR
- BERT
- BERT classifier
- BERT-Attack
- BERT-large
- BERT-masking
- BERT-tiny
- BERTopic
- BERTscore
- BF16
- BGE-reranker
- BIC
- BIG-bench
- BLAS
- BLEU
- BLEU-4
- BLEURT
- BLIP
- BLIP-2
- BLPOP
- BM25
- BM25 hard negative
- BNS
- BOM
- BOS token
- BPE
- BPTT
- BYOC
- BabyAGI
- Backdoor poisoning
- Backend Engineer
- Backfill
- Background task
- BackgroundTasks
- Backlog
- Backtranslation
- Backward generation
- Bag-of-Words
- Bag-of-words bias
- Baggage
- Balance coefficient beta
- Balance factor
- Base frequency
- Base64 encoding
- BaseAgent
- BaseCallbackHandler
- Baseline Scenario
- Baseline utilisation
- Batch Encoding
- Batch Hard Triplet Mining
- Batch RAG
- Batch inference
- Batch ingestion
- Batch mode
- Batch-запрос
- BatchNorm
- BatchSpanProcessor
- Batching scheduler
- Batching timeout
- Batching tool calls
- Bayesian Elo
- Bayesian approximation
- Bayesian optimization
- BeautifulSoup
- BeeGFS
- Behavior Drift
- Behavioral profiling
- Behavioral testing
- Belief Tracking
- Benchmarks
- Benign prompt
- Benjamini-Hochberg
- Bernoulli distribution
- BertViz
- Bi-encoder
- Bias Rate
- Bid
- Bidirectional LSTM
- BigBird
- BigQuery
- Binary Cross-Entropy Loss
- Binary classifier
- Binary quantization
- Binding
- Binpacking
- BioMedLM
- Bit signature
- Black-box attack
- Black-box extraction
- Black-box watermarking
- Blackboard
- Blackbox Exporter
- Blacklist/Whitelist
- Blackwell architecture
- Blame attribution problem
- Blameless culture
- Blameless postmortem
- Blob storage
- Block manager
- Block-based allocation
- Block-sparse attention
- Blocksworld
- Blockwise Parallel Transformer
- Bloom filter
- Bloom filter parameters
- Blue-green deployment
- Bonferroni correction
- BoolQ
- Boolean Filters
- Bootstrap estimation
- BootstrapFewShot
- BootstrapFewShotWithRandomSearch
- Borda count
- Bottleneck
- Bounded rationality
- BoundedSemaphore
- Bradley-Terry model
- Bradley-Terry модель
- Branch protection
- Branching
- Break-even Chart
- Breaking changes
- Brent method
- BridgeTower
- Brier score
- Broadcast
- Bucket resolution
- Budget utilization
- BudgetExceededError
- BufferWindowMemory
- Build engine
- Bulk API
- Bully algorithm
- Bulyan
- Bus utilization
- ByT5
- bAbI
- back-translation
- backdoor
- backdoor watermarking
- backpressure
- backpropagation
- backup
- backward compatibility
- backward pass
- bandit
- bank conflicts
- bare-metal инстанс
- base metrics
- baseline
- batch matrix multiplication
- batch mix
- batch search
- batch size
- batch update
- batch write
- batch-операции
- batch/v1 Job
- batched scoring
- beam search
- beam_width
- behavior cloning
- benchmark
- benchmark chasing
- benchmark overfitting
- benchmark task generation
- beta
- bge-large-en-v1.5
- bgsave
- bias
- bias amplification
- binary good/bad
- binary metric
- binary search
- bind mount
- binning
- binomial testing
- biometric features
- biometric identification
- bit masks
- bitarray
- bitsandbytes
- bitsandbytes 4-bit quantization
- black
- black-box
- blacklist
- block
- block allocation
- block_size
- blocking cases
- blue team
- bonding
- bootstrap
- boto3
- bottlenecks
- boundaries
- bounded queue
- bounding box coordinates
- bounding boxes
- boxplot
- branch coverage
- branch efficiency
- branch prediction
- branch references
- branch rules
- breadth-first traversal
- break-even point
- bridge
- browser agent
- brute force
- bucket
- bucketing
- budget balance
- budget monitoring
- budget per session
- budget usage
- budgeting
- buffer management
- build time
- building index
- bulk insert
- bulkhead
- bunched kernel launches
- burn rate
- burst
- burst allowance
- busbw
- byte-level tokenization
- bytearray
C
- CAC
- CAF
- CAP theorem
- CAP_NET_RAW
- CAS
- CDC
- CDF
- CDN
- CDNA3
- CFD
- CHAIR
- CHAIRi
- CHAIRs
- CI
- CI validation
- CI-артефакты
- CI-линтер
- CI/CD
- CI/CD for AI
- CI/CD for ML pipelines
- CI/CD for prompts
- CI/CD для промптов
- CIDEr
- CIF
- CIFAR-10
- CJK
- CKY
- CLAP
- CLARE
- CLI
- CLIP
- CLIP score
- CLIP-based NSFW detector
- CLS Token
- CLUSTER SETSLOT
- CLUTRR
- CNN
- COCO
- COCO API
- COCO Captions
- COCONUT
- COGS
- COMET
- CONFIG SET
- CONFIG_RDMA_RXE
- CONTRIBUTING.md
- COPRO
- CORS
- CPU
- CPU RAM
- CPU bottleneck
- CPU inference
- CPU offload
- CPU sockets
- CPU-GPU synchronization
- CPU-bound
- CPU↔GPU transfers
- CQRS
- CRAFT
- CRC errors
- CRC32
- CRDT
- CRM
- CRNN
- CRUD
- CSI
- CSV
- CSV datasource
- CTC
- CTF
- CTGAN
- CTranslate2
- CUAD
- CUDA
- CUDA 11.8
- CUDA API
- CUDA API calls latency
- CUDA API peer access
- CUDA Execution Provider
- CUDA Samples simpleP2P
- CUDA caching allocator
- CUDA context
- CUDA cores
- CUDA event
- CUDA events
- CUDA graphs
- CUDA kernel
- CUDA streams
- CUDA_VISIBLE_DEVICES
- CUPED
- CUSUM
- CUTLASS
- CVE
- CVSS
- Cache Agent
- Cache Systems
- Cache effect
- Cache hit ratio
- Cache misses
- Cache stability
- Cache stampede
- Cache-Aside
- Cache-Control
- CacheInterface
- CachedContent
- Caching decorator
- Caching popular vectors
- Caching strategies in AI systems
- CalibratedClassifierCV
- Calibration
- Calibration RM
- Calibration queries
- Call graph
- Call-level scoping
- CallbackManager
- Camelot
- Camunda
- Cancellation
- Cancellation token
- CancelledError
- CapEx
- Capability
- Capability-based negotiation
- Capacity
- Caption-based approach
- Captum
- Cascade Uncertainty Amplification
- Cascading
- Cassandra
- Causal LM Head
- Causal Tracing
- Causal attention
- Causal density
- Celery
- Celery beat
- Centering
- Central tendency
- Centralized architecture
- Certified robustness
- Chain of Responsibility
- Chain rule
- Chain-of-Thought
- Chain-of-Thought fine-tuning
- Chain-of-Thought generation
- Chain-of-Thought критика
- Chain-of-verification
- Chameleon
- Chameleon attack
- Chaos Mesh
- Chaos Monkey
- Chaos Toolkit
- ChaosProxy
- Character Error Rate
- Character-level attack
- ChartQA
- Chat Completion
- ChatCompletion
- ChatGPT API
- ChatML
- ChatOllama
- ChatOpenAI
- ChatPromptTemplate
- Check
- CheckList
- Checker
- Checkpoints
- Chi-squared test
- Child span
- Chimera
- Chinook
- Choreography
- Choreography SAGA
- Chroma
- ChromaDB
- Chroot
- Chunk Recall@k
- Chunk overlap
- Chunked synthesis
- Chunkization
- Churn Rate
- Cilium
- Citation
- Citation checking
- CityHash
- Claims
- Class balance
- Classifier
- Classifier-Free Guidance
- Claude 3
- Claude 3 Haiku
- Claude 3 Opus
- Claude 3.5
- Claude 3.5 Sonnet
- Claude API
- Cleanlab
- CleverHans
- CliRunner
- ClickHouse
- Client-side rate limiting
- Clip ε
- Clone
- Clone-Structured Causal Graphs
- CloudFlare
- CloudWatch
- Clumsy
- Cluster autoscaler
- Cluster ratio
- ClusterIP
- ClusterQueue
- ClusterRole
- ClusterSecretStore
- Clustering
- CoDel
- CockroachDB
- Code
- Code Agents
- Code Classification
- Code Clone Detection
- Code Summarization
- Code as Representation
- Code as Representation Language
- Code by Zapier
- Code execution
- Code-as-Thought
- CodeBERT
- CodeBLEU
- CodeGraph
- CodeSearchNet
- Codex
- Coefficient of redundancy
- CogVLM
- Cognitive bias
- Cognitive scaffolding
- Cohen's Kappa
- Cohere Embed
- Cohere multilingual
- Cohere rerank
- ColBERT
- ColBERT multilingual
- ColBERT-v2
- ColBERTv2
- Colang
- Cold queries
- Cold storage
- Cold-start
- CollNet
- Collaborative
- Collection per tenant
- Collusion
- Colossal-AI
- Column-wise
- Command R+
- Commit log
- Commit-reveal scheme
- Common Crawl
- Common item equating
- Common subexpression elimination
- CommonsenseQA
- Communication overhead explosion
- Communication rounds
- Compact+delete
- Compacted topics
- Comparison Dataset
- Compensating actions
- Competitive
- Completion Queue
- Completion time
- Complex plane rotation
- Compliance
- ComponentRegistry
- Composability
- Composite score
- Compositionality
- Compression
- Compression ratio
- Compressive Transformer
- Compute capability
- Compute costs
- Compute engine
- Compute/Communication overlap
- Compute/communication ratio
- Concept direction
- Concept shift
- ConceptNet
- Concurrent delegation
- Concurrent kernels
- Concurrent requests
- Conda / venv
- Conditional Kappa
- Conditional VAE
- Conditioning
- Condorcet method
- Confidence bins
- Confidence calibration error
- Confidence penalty
- Confidence threshold
- Confident learning
- Confidential computing
- Config Versioning
- ConfigMap
- ConfirmDialog
- Conflict resolution
- Confluent
- Confluent Control Center
- Conformal prediction
- Confusion matrix
- ConnectX-6
- Connection Draining
- Consensus
- Consensus mechanism
- Consistency Rate
- Consistency checks
- Constitution
- Constitutional AI
- Constitutional adherence
- Constitutional prompt
- ConstitutionalChain
- Constrained RL
- Consul
- Consumer
- Consumer Groups
- Consumer Lag
- Consumer group
- Contamination Detection Toolkit
- Contamination rate
- Content Filter
- Content-Encoding
- Content-Type: text/event-stream
- Content-oriented application
- Context
- Context Builder
- Context Coverage
- Context Engineering
- Context Extension
- Context Recall
- Context caching
- Context leakage
- Context loss
- Context manager
- Context manipulation
- Context overflow
- Context precision
- Context propagation
- Context relevance
- Context vector
- Context window explosion
- ContextBoundary
- Contextual hints
- Contextual retrieval
- Continuous Backup
- Continuous relaxation
- Contract testing
- Contradiction check
- Contradiction rate
- Contrastive Activation Addition
- Contrastive decoding
- Convergence
- ConversableAgent
- Conversation state
- ConversationBufferWindowMemory
- ConversationSummaryBufferMemory
- ConversationSummaryMemory
- Conversational repair
- Conversion Rate
- Convertible RI
- Convoy effect
- Cooperative Groups
- Coordination
- Coordination Engineering
- Coordination score
- Coordinator
- Copy Engine
- Copy-on-write
- Coq
- CoreML
- Coroutine
- Correction accuracy
- Corrective RAG
- Correlation ID
- Correlation Metrics
- Correlation analysis
- Corrupted PDF
- Corrupted document
- Cosine Decay
- Cosine Noise Schedule
- Cosine Scheduler
- Cost Analysis
- Cost Engineering
- Cost Explorer
- Cost Structure
- Cost optimisation
- Cost optimization
- Cost per Delegation Path
- Cost per Improvement
- Cost per agent run
- Cost per correct answer
- Cost per good answer
- Cost per second of user wait
- Cost per successful answer
- Cost per successful task
- Cost per user
- Cost tracking
- Cost vs Revenue chart
- Cost-accuracy-latency trade-off
- Cost-adjusted accuracy
- Cost-aware planner
- CostTracker
- Counter
- Counterfactual
- Counterfactual fidelity
- Covariate shift
- Coverage report
- Coverage ошибок
- Crash recovery
- CrashLoopBackOff
- Credentials
- Credit assignment
- Crescendo атака
- CrewAI
- Critical section
- Critique
- Cron
- Cross-Session Consistency Drift
- Cross-Validation
- Cross-attention
- Cross-validation annotators
- CrossEntropyLoss
- Crypten
- CuTe
- Cuckoo filter
- Cumulative Gain
- Curl
- Curriculum Learning
- Curse of dimensionality
- Cursor
- Custom CUDA kernel
- Custom Metrics API
- Custom Resource Definition
- Custom actions
- Custom layers
- Cybersecurity
- Cycles
- Cyclic graph
- Cypher
- Cython
- cProfile
- cache entry
- cache eviction policies
- cache invalidation
- cache invalidation strategies
- cache miss
- cache prefix
- cache rollback
- cache warming
- cache_control
- cache_creation_input_tokens
- cache_key
- cache_read_input_tokens
- cached response
- cachetools
- caching
- calibration dataset
- calibration error
- call directive
- call-center аналитика
- callback
- callbacks
- calls per session
- canary deployment
- canary examples
- canary:true
- cancellation_latency
- cancellation_rate
- candidate
- candidate tree
- canonical perturbations
- capability negotiation
- capacity factor
- capacity planning
- caption generation
- cardinality
- carryover effect
- cascade
- cascade failure
- cascading agent system
- cascading agent systems
- cascading failures
- catastrophic forgetting
- causal LM
- causal masking
- causal reasoning
- causal-conv1d
- causalnex
- central planner
- central tendency bias
- centroid
- cgroups
- chain
- chain decomposition
- chain of actions
- change detection
- channel
- chaos engineering
- chaosmonkey
- chaostoolkit
- check_secrets.py
- chi-square test
- chosen/rejected pairs
- chunk enrichment
- chunk size
- chunk-based search
- chunked prefill
- chunking
- chunks
- churn
- circuit breaker
- circuitbreaker
- citation accuracy
- citation check
- claim extraction
- class imbalance
- class-balanced sampling
- class_weight
- classification
- click models
- clickhouse-client
- clickhouse-driver
- client output buffer
- client-side rate limiter
- clipping
- clock_gettime
- closed-form expression
- closed-form solution
- cluster
- cluster state
- cluster-based randomization
- co-adaptation
- co-shag
- coalesced_group
- code correctness rate
- code coverage
- code coverage metrics
- code embeddings
- code generation
- code injection
- code review
- codebook
- cognitive schema
- cohere/embed-multilingual-v3
- coherence
- coherence illusion
- cohesion
- colbert-ai
- cold cache
- cold standby
- collaboration count
- collaboration_latency_ms
- collaboration_success_rate
- collaboration_total
- collapse
- collate function
- collection
- colorama
- combinatorial auction
- commit offset
- commit_transaction
- commitment loss
- common knowledge base
- common tokenizer
- compacted topic
- compensating transaction
- compilation success rate
- completion rate
- completion tokens
- complex queries
- complexity scoring
- composite SLO
- composite fusion
- compound key
- compressed memory
- computation graph
- compute
- compute budget
- compute utilization
- compute-bound
- compute-communication overlap
- compute_bid
- computer use agent
- conciseness
- concurrency
- concurrent users
- concurrent.futures
- conda
- conditional edge
- conditional edges
- conditional vector
- conditional vectors
- confidence score
- confidence-based routing
- config.yaml
- configuration
- confirmation bias
- confirmation_prompt
- confluent-kafka
- confluent_kafka
- confounding
- confounding factors
- conftest.py
- connection handling
- connection pooling
- consistency
- consistency regularization
- consistent hashing ring
- consistent prefix
- constant folding
- constitutional check
- constrained decoding
- constraint propagation
- constraint satisfaction
- constraints
- construct validity
- consumer priority
- consumer-producer
- container orchestration
- containerization
- content hash
- content validity
- content-addressed
- content-based
- content_filter
- context adherence
- context distillation
- context drift
- context package
- context parallelism
- context preparation
- context preservation
- context separation
- context serialization
- context truncation
- context utilization
- context window
- context_features
- context_length_exceeded
- context_only
- context_precision
- contextual enrichment
- contextual representations
- continuous batching
- continuous learning
- continuous monitoring
- continuous red teaming
- contradiction
- contrast effect
- contrastive learning
- contrastive loss
- contrastive search
- control
- convergence time
- convolution
- cooldown
- coordination metrics
- copy with padding
- copytest
- core
- correction множественных сравнений
- correlation
- cost
- cost anomaly detection
- cost attribution
- cost estimator
- cost management
- cost model
- cost of control
- cost of delegation
- cost of reasoning
- cost penalty
- cost per 1M tokens
- cost per hour
- cost per request
- cost per session
- cost per vector
- cost reduction
- cost savings
- cost tags
- cost threshold
- cost vs revenue
- cost-aware auto-scaling
- cost-aware caching
- cost-aware routing
- cost-latency trade-off
- cost-quality trade-off
- cost/latency/quality trade-off
- cost_table_version
- counter-offers
- counterfactual reasoning
- coverage
- coverage of API errors
- coverage-driven generation
- coverage-guided testing
- cp.async.bulk
- crash rate
- crashes
- crew
- criterion validity
- critic agent
- critical actions
- critical fraction
- critical workload
- cross-contamination
- cross-correlation
- cross-correlation heatmap
- cross-encoder
- cross-encoder vs bi-encoder
- cross-encoder/nli-deberta-v3-large
- cross-entropy loss
- cross-layer attention
- cross-layer connections
- cross-lingual recall@k
- cross-lingual transfer
- cross-model
- cross-region replication
- cross-session consistency
- cross-verification
- crossover
- crystal structure
- cuBLAS
- cuDNN
- cuda-memcheck
- cudaFree
- cudaLaunchCooperativeKernel
- cudaMalloc
- cudaMallocAsync
- cuda_malloc_count
- cuda_memtest
- curriculum adversarial training
- curse of length
- custom evaluators
- custom exporter
- custom generators
- custom metric
- custom metrics
- custom scheduler
- custom-metrics-apiserver
- cycle detection
D
- D2
- D2H
- D3PM
- DAG orchestration
- DALL-E
- DAN
- DBOS
- DBSCAN
- DBT
- DBpedia
- DCGM
- DCGM Exporter
- DCGM_FI_DEV_GPU_UTIL
- DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL
- DDL
- DDoS
- DEAP
- DELETE /generations/{id
- DETR
- DFA
- DGX
- DGX A100
- DGX H100
- DINOv2
- DLQ count
- DLQ size
- DMA
- DMA engine
- DNNL
- DNS failover
- DNS propagation
- DNS-based load balancing
- DOCX
- DOT format
- DP Inference
- DP-Fine-tuning
- DP-ретривер
- DPIA
- DPO gradient
- DPO loss
- DPOTrainer
- DQN
- DSGE-модели
- DSL
- DSPy
- DSPy Evaluate
- DSPyAssertionError
- DTensor
- DVC
- DVC pipeline
- DaemonSet
- Dafny
- Dagster
- Daily seasonality
- DailyDialog
- Dashboard
- Dask
- Data Augmentation for Code
- Data Collection
- Data Engineer
- Data Exchange
- Data Filtering
- Data Injection
- Data Quality
- Data Quality Monitoring
- Data Sanitization
- Data augmentation
- Data card
- Data contract
- Data extraction
- Data parallelism
- Data pipeline
- Data residency
- Data staleness
- Data transfer bottleneck
- Data versioning
- Data-centric AI
- Data-efficient fine-tuning
- DataLoader
- DataPool
- Databricks Dolly 15k
- Datadog
- Datadog APM
- Datalog
- Datasketch
- Davies–Bouldin Index
- DeBERTa-NER
- DeBERTa-NLI
- DeBERTa-v3
- DePlot
- Dead Letter Exchange
- Dead Letter Queue
- Dead letter
- Deadband
- Deadline
- Deadlock
- Deallocation
- Debezium
- Debugging
- Decentralized architecture
- Decentralized system
- Decision cache
- Decision matrix
- Decoder
- Decoder-only architecture
- Decoupling Score
- Deep Ensembles
- Deep health check
- DeepEval
- DeepSeek V2
- DeepSeek-MoE
- DeepSeek-R1
- DeepSeek-V2
- DeepSpeech
- DeepSpeed
- DeepSpeed Inference
- DeepSpeed Pipe
- DeepSpeed-MoE
- DeepSpeed-Ulysses
- DeepStream SDK
- DeepWordBug
- Deequ
- Default stream
- Defense in Depth
- Deficit round robin
- Definition of Done
- Degradation slope
- Degraded mode
- Delegated tools
- Delegation Efficiency
- Delegation Engineering
- DelegationManager
- Deliberative consensus
- Delimiter-based approach
- Delta
- Delta Lake
- Delta regularization
- Denoising
- Denoising score matching
- Dense Embedding
- Dense connections
- Dense model
- Dense rewards
- Density Functional Theory
- Deny by default
- Department
- Dependency between prompts
- Dependency injection in LLM pipelines
- Deprecated
- Deprecated Field
- Depth scaling without parameters
- Depth-First Search
- Dequantization overhead
- Deque
- Deserialization
- Design by Contract
- Design for failure
- Deskew
- Detection Delay
- Detection LLM
- Detectron2
- Deterministic runtime
- Deterministic seed
- Deterministic simulator
- Deterministic testing
- Detoxify
- DevEx
- DevOps Overhead
- DevTools Protocol
- Dialect
- Dialog system
- Diff
- Diffusion Models
- DiffusionBERT
- Dify
- Dify Prompt Management
- Digest
- Digital twin
- Direct I/O
- Direct Preference Optimization
- Direct mapping
- DirectML
- Directed Graph
- DirectoryLoader
- Disambiguation
- Disaster recovery drill
- Discounted Cumulative Gain
- Discounting
- DiscoveryRequest
- DiscoveryResponse
- Disk-based vector storage
- DiskANN
- Dispatcher
- DistilBERT
- DistilGPT2
- Distilabel
- Distinct-N
- Distractors
- Distributed Data Parallel
- Distributed Flash Attention
- Distributed rate limiting
- Distributed task queue
- Distributed tracing
- Distribution Collapse
- Distroless
- Divergent control flow
- Diversity bonus
- Django
- DoRA
- DoS
- DocLayNet
- DocTR
- Docker
- Docker Compose
- Docker Compose networking
- Docker Swarm
- Docker socket
- Docker-образ
- Dockerfile
- Docling
- Document Loader
- Document Understanding
- Document length
- Document type
- Document-based chunking
- Document-to-version mapping
- Domain
- Domain specialization
- Domain-adapted checkpoints
- Dominant Resource Fairness
- Donut
- Double Quantization
- Downstream model
- Downstream quality
- Downstream-процессы
- Draft7Validator
- Dragonfly
- DriftDetector
- Drop rate
- DropConnect
- Dry-run
- Dual control
- Dual index
- DuckDB
- DuckDuckGo
- Durable state
- Dynamic Quantization
- Dynamic Scoping
- Dynamic Task Mapping
- Dynamic confidence thresholds
- Dynamic evals
- Dynamic index update
- Dynamic list
- Dynamic padding
- Dynamic pricing
- Dynamic range
- Dynamic representation
- Dynamic resource allocation
- Dynamic routing
- Dynamic thresholds
- DynamoDB
- d'Aspremont–Gérard-Varet mechanism
- d_ff
- d_k
- d_model
- daily_spend_usd
- damp_percent
- dangerous action
- data cleaning
- data collator
- data drift
- data efficiency
- data labeling
- data lakehouse
- data lakes
- data lineage
- data locality
- data migration
- data mixing
- data programming
- data reordering
- data transfers
- data types
- data validation
- database
- database schema
- dataclass
- dataset
- dataset diversity
- dataset format
- dateparser
- day-of-week effect
- dbt contract
- dd
- dead code elimination
- dead neurons
- deadlock detection
- deadlock detection time
- decay rate
- decentralized control
- decode
- decoder-only LLM
- decoder-only model
- decoderbufs
- decorator
- decorator pattern
- decoupling
- dedup table
- deep eval
- deep learning models
- deepcopy
- deepdiff
- default partition
- default values
- default_segment_number
- defaultdict
- defensive distillation
- defineTool
- degradation
- degradation detection
- degradation threshold
- degraded UX
- delayed scaling
- delegation
- delegation by exception
- delegation chain
- delegation failure cascade
- delegation paths
- delegation_duration_seconds
- delegation_failure_total
- delegation_requests_total
- delegation_success_total
- deliberate decoding
- delta method
- delta weights
- demo.py
- demonstration
- dependency injection
- dependency management
- dependency tracking
- dependent functions strategy
- depends_on
- dequantization
- detach
- determinism rate
- device_map
- dialogue-based paradigm
- diff2html-cli
- differencing
- differential privacy
- difflib
- diffusers
- diffusion LLM
- diffusion backends
- diffusion model
- digital signature
- dilated sliding window
- diminishing returns
- disaster recovery
- discovery
- discriminated union
- distributed AI system
- distributed cache
- distributed file system
- distributed locking
- distributed systems
- distributed training
- distributed transactions
- distribution fidelity
- divergence
- diverse beam search
- diversity
- diversity sampling
- divide and conquer
- dlt
- do_sample
- docker-compose up
- docker-compose.yml
- docstring
- document classification
- document injection
- document masking
- document_id
- documents
- domain expert
- domain shift
- dominant strategy
- double auction
- double buffering
- downsampling
- downstream metrics
- downstream tasks
- dp_accounting
- draft length
- draft model
- drain
- drain connections
- drift
- drift detection
- drift metrics
- drift retrieval-качества
- drill‑down
- drop_caches
- dropout
- dslim/bert-base-NER
- dspy.Cache
- dspy.Predict
- dspy.ProgramOfThought
- dspy.Retrieve
- dspy.serialize
- dstat
- dtype
- dual write
- dual-write
- duckduckgo_search
- duplicate detection
- duplicate questions
- duplicate ratio
- durable pull-подписка
- dynamic analysis
- dynamic benchmark
- dynamic benchmarks
- dynamic context
- dynamic facets
- dynamic loss scaling
- dynamic programming
- dynamic scaling
- dynamic secrets
- dynamic shapes
- dynamic temperature
- dynamic tree construction
E
- E2E scenario
- E4M3
- E5-large
- E5M2
- EAGLE-1
- EAGLE-2
- EAGLE-3
- EAP
- EBS
- EBS volume
- EC2 instance type
- EFA
- EFS
- EKS
- ELBO
- ELK
- ELT
- EM convergence
- EOS token
- ESM3
- ESMFold
- ETL
- ETag
- EU AI Act
- EWC
- EWMA
- EXIF
- Early Stopping
- Early exiting
- EasyEdit
- EasyOCR
- EdSurvey
- Edge Accuracy
- Edge computing
- Edge deployment
- Edit distance
- Effective context length
- Effective cost per token
- Efficacy
- Efficiency
- EfficientNet
- Effort
- Elaboration
- ElastiCache
- Elasticity
- Elasticsearch
- Elbow method
- Election timeout
- ElevenLabs Turbo
- Elo rating
- Embedding Consumer
- Embedding Models
- Embedding Pipeline
- Embedding Rotation
- Embedding Signature
- Embedding dimension
- Embedding diversity
- Embedding drift
- Embedding dropout
- Embedding layer
- Embedding normalization
- Embedding shift
- Embedding space
- Embedding throughput
- Embedding воркеры
- Embedding-as-a-Service
- Embedding-based approach
- Embedding-based expansion
- Embeddings caching
- Empty document
- EnCodec
- Encoder
- Encoder-decoder transformer
- Encoder-only transformer
- Encryption
- Encryption at rest
- Encryption in memory
- Encryption in transit
- Encryption in use
- End-to-end
- End-to-end metrics
- End-to-end streaming
- End-to-end test
- End-to-end testing
- Energy distance
- Enron subset
- Ensemble of models
- Enterprise Contract
- Entity Extraction
- Entity Linking
- Entity Masking
- Entropy
- Envoy
- Envoy filter
- Ephemeral sequential node
- Episodic memory
- Epistemic uncertainty
- Epsilon
- Epsilon-greedy
- Equivariance
- Equivariant GNN
- Error rate
- Escalation flag
- Escalation of privileges
- Escalation rate
- Escalation system
- Eureqa
- Eval runner
- EvalAI
- Evaluate
- Evaluation
- Evaluation API
- Evaluator
- Event loop blocking
- Event processing latency
- Event sourcing
- Event streaming
- Event-driven sampling
- EventCollector
- EventSource API
- EventSourceResponse
- Eventually consistent
- Eviction policy
- Evidence Override Rate
- Evidently AI
- Evol-Instruct
- Evolution
- Evolutionary algorithms
- ExLlama
- Exact Set Match
- Exact attention
- Exact duplicate
- Exact hashing
- Exact kNN
- Exact match cache
- Exact-Match Cache
- Exception handler
- Exceptions
- Excessive Agency
- Exchange
- Execution
- Execution Accuracy
- Execution errors
- Execution guarantee
- Execution time
- Expansion
- Expected Calibration Error
- Expected trajectories
- Expert
- Expert Choice Routing
- Expert Specialization
- Expert agent
- Expert agreement
- Expert arbitration
- Expert knowledge
- Explanation Faithfulness
- Explanation-Decision Decoupling
- Explicit forget mechanism
- Explicit transitions
- Exploitation
- Exploration vs exploitation
- Exponential decay initialization
- Exponential growth of trajectories
- Exponential moving averages
- External Authorization
- External Secrets Operator
- ExtractNewRecordState
- Extraction attacks
- Extractive QA
- Extractive Summarization
- Extrapolation
- Extrinsic evaluation
- e5-large-v2
- eBPF
- eager PyTorch
- eager invalidation
- early fusion
- easy negative
- easy negatives
- echo-сервер
- edge
- edge case
- edge cases
- edge device
- edge_coverage
- edges
- ef
- ef_construct
- ef_construction
- ef_search
- effect size
- effective batch size
- effective reserved cost
- efficiency_gap
- embedding
- embedding API
- embedding distribution
- embedding inversion
- embedding model degradation
- embedding poisoning
- embedding-модель
- emergent behavior
- emergent specialization
- emptyDir volume
- emulator
- en_core_web_trf
- enable.idempotence
- encryption-at-rest
- end verifier
- end-to-end compiler
- end-to-end learning
- end-to-end обучение
- energy prediction
- enforce_partition_keys
- ensemble
- ensemble RM
- ensemble adversarial training
- ensemble generation
- ensemble reward models
- ensemble-based decoding
- entailment
- entropy bonus
- entrypoint
- enum
- env
- environment variable
- environment variables
- epistemic vigilance
- epoch
- error accumulation
- error budget
- error code
- error handling
- error penalty
- error recurrence
- error status
- error_handling
- error_rate_429
- escalation
- escape
- etcd
- eval set
- eval пайплайн
- evaluation leakage
- evaluation overfitting
- evaluation report
- evaluator scores
- evaluator-based evaluation
- evaluator-based quality assessment
- evasion
- event loop
- event pattern
- event tracking
- event-driven invalidation
- event-stream
- eventual consistency
- evicted keys
- evolve-check
- exact filter
- exact match caching
- exactly-once delivery
- examination probability
- exception class
- execution feedback
- executive summary
- executor agents
- exit 0
- exllamav2
- expand
- expandable_segments
- expander
- expected trajectory
- expected_final
- expected_trajectory
- experience exchange
- expert layers
- expert parallelism
- expert placement
- explicit feedback
- exploding gradients
- exploration
- exploration/exploitation
- exponential backoff
- ext4
- extended resource
- external models
- externality
- extism
F
- F1
- F1 для подграфа
- FAISS index serialization
- FAPE
- FAQ
- FATE
- FEC
- FEM
- FEniCS
- FFN
- FFN dropout
- FFT
- FFmpeg
- FGSM
- FIFO
- FIFO queue
- FIRING
- FLARE
- FLAVA
- FLOPs
- FLP theorem
- FLUSHALL
- FP16
- FP32
- FP32 master weights
- FP4
- FP8
- FP8 Tensor Core
- FP8 quantization
- FP8-aware training
- FPGA
- FPR@TPR=0.95
- FPS
- FSDP
- FX graph
- Factual Drift
- Failed inference
- Failover threshold
- Failover time
- Failure Blocking
- Failure mode
- Failure mode: node failure
- Failure modes
- Fair share
- Fair use
- Fairscale
- Faiss
- Faiss IVF-PQ
- FaithScore
- Faithfulness
- Faithfulness metrics
- Faithfulness threshold
- Faker
- Falco
- Falcon
- Fallback Adapter
- Fallback Usage
- Fallback chain
- Fallback message
- Fallback-модель
- FallbackContext
- False Negative Rate
- False negatives
- False positives
- Familiarity bias
- Fan-out/fan-in
- Fast Downward
- Fast-Conformer
- FastAPI
- FastAPI dependencies
- FastAPI dependency injection
- Faster R-CNN
- Fatigue index
- Faust
- Feast
- Feature Detection
- Feature Engineering for RAG
- Feature engineering
- Feature flag
- Feature group
- Feature hashing
- Feature monitoring
- Feature selection
- Feature validation
- Feature view
- Feature-Aware
- Feature-Aware Speculative Decoding
- Feedback mechanism
- Fencing token
- Few-shot examples
- Few-shot jailbreak
- Few-shot poisoning
- FewShotChatMessagePromptTemplate
- FiD
- Fidelity
- Figma
- File System Watcher
- Filebeat
- Filesystem
- Filter selectivity
- Filtered ANN Search
- Filtering
- Filters
- FinTabNet
- Final Answer Match
- Fine-tune эмбеддера
- Fine-tuned model
- Fine-tuning LLM for Agents
- Fine-tuning cost
- Fine-tuning loop
- Fiqa
- Firecracker
- Firecracker-containerd
- Fireworks AI
- First-come-first-serve
- First-price auction
- Fisher Information Matrix
- Fivetran
- Fixed shapes
- Fixed window
- Fixed-size chunking
- Flagger
- Flakiness
- Flaky test
- Flaky tests
- Flamingo
- Flan-T5
- Flan-T5-small
- FlanT5
- Flapping
- Flash Attention 2
- Flash Decoding
- Flash crowd
- FlashAttention
- FlashAttention-3
- FlashDecoding
- Flat planning
- Fleiss' Kappa
- FlexGen
- Flexibility
- Flickr30k
- Flickr30k Entities
- Flickr8k
- Flink Kubernetes Operator
- FlopCountAnalysis
- Flower
- Fluency
- Flux
- Flywheel
- Focal loss
- Follow-the-sun
- Foolbox
- Forced alignment
- Forced tour
- Formal Verification
- Formal language
- Formal plan
- Format Adherence
- Format prompt
- Forward hook
- Foundation models with built-in budget
- Free Tier
- Frozen LLM
- Full checkpointing
- Full delegation
- Full fine-tuning
- Full harness
- FullyShardedDataParallel
- Function Permutation
- Funnel
- Fusing
- Fusion ranking
- Fusion reranking
- Fuyu-8B
- FuyuProcessor
- Fuzzing
- f-strings
- facet
- faceted search
- facilitator
- fact checking
- fact-checking
- factor graph
- factual grounding
- fail-closed
- fail-fast
- fail-safe architecture
- failed trajectories
- failed trajectory
- failure analysis
- failure cases
- failure cost
- failure detection
- failure mode: Ollama не отвечает
- failure point
- failureThreshold
- failures_total
- fairness metrics
- fairness scheduling
- faithfulness scorer
- fake LLM
- fake device plugin
- fakeredis
- fallback adapters
- fallback model
- fallback-блок
- fallocate
- false escalation rate
- false positive
- fan_in
- fast rejection
- fastapi-admin
- fastavro
- fastchat
- faster-whisper
- fastparquet
- fasttext
- fatigue
- fatigue bias
- fatigue curve
- fault injection
- feature
- feature definition
- feature flags
- feature store
- feature-aware draft model
- feature-based billing
- feature_usage_logs
- federated learning
- feedback embeddings
- feedback rate
- feedback_log.jsonl
- fetch_20newsgroups
- field extraction
- filter_order
- filtered ANN
- final response
- fine-tuning
- fine-tuning embedding model
- finish_reason
- fio
- fire-and-forget
- first-class objects
- first-order optimization
- first-stage retrieval
- fixed cost
- fixture
- flake8
- flake8-async
- flamegraph
- flan-t5-large
- flat minima
- flocking
- flow
- foraging
- force push
- formal specifications
- formal verifier
- format constraints
- format exploitation
- forward compatibility
- forward pass
- four golden signals
- frame embeddings
- frame sampling
- free-riding
- freemium
- freeze
- frequency analysis
- frequency coverage
- frequency penalty
- frequency threshold
- frozen
- fsync
- fsync_after_insert
- full attention
- full compatibility
- full file strategy
- full invalidation
- full jitter
- full re-indexing
- full-duplex
- fully-connected network
- function calling
- functional correctness
- fuzzy matching
G
- G*Power
- G-Pipe
- GAN
- GAN-based detection
- GAN-style
- GC
- GC pause
- GCG
- GCS
- GDPR
- GELU
- GEMM
- GEMM (General Matrix Multiply) в LLM
- GES algorithm
- GET /prompts/{id}/latest
- GET /tasks/{id
- GGUF
- GID
- GIST1M
- GKE/AKS
- GLOO
- GLU
- GLUE
- GLaM
- GNoME
- GP3
- GPQA
- GPT-2
- GPT-2 Medium
- GPT-2 small
- GPT-2 tokenizer
- GPT-3
- GPT-3.5
- GPT-4
- GPT-4 Turbo
- GPT-4 eval
- GPT-4V
- GPT-4o
- GPT-4o mini
- GPT2Block
- GPTCache
- GPTQ
- GPU
- GPU Direct
- GPU Direct RDMA
- GPU Inference
- GPU acceleration
- GPU affinity
- GPU allocation
- GPU cluster
- GPU instance
- GPU memory
- GPU memory leak
- GPU memory management
- GPU scheduling
- GPU time
- GPU utilization
- GPU utilization drop
- GPU серверы
- GPU-hour
- GPU-экспортёр
- GPipe
- GRPO
- GSM8K
- GUID
- Game Days
- GameDay
- Gamification
- Gang scheduling
- Garak
- Gateway
- Gating function
- Gatling
- Gauge
- Gaussian Mixture Model
- Gaussian Process
- Gaussian noise
- Gemini
- Gemini 1.5 Flash
- Gemini 1.5 Pro
- Gemini API cache
- Gemma
- Gemma-2B
- General agent
- Generalized Advantage Estimation
- Generation confidence
- Generative attack
- Generative attacks
- Generative replay
- Geo-routing
- Ghost Clipping
- GigaChat
- Gini coefficient
- Giskard
- Git
- Git Flow
- Git LFS
- Git hook
- Git notes
- Git repository
- Git-based approach
- GitHub Actions
- GitHub Actions Summary
- GitHub Copilot
- GitLab CI
- GitOps
- GitPython
- GloVe
- Global + Local Attention
- Global load balancer
- Global reputation
- Gmail API
- Go
- Goal Success Rate
- Goal condition rate
- Goal divergence
- Gold trajectory
- Golden Holdout
- Golden path
- Goldenset
- Goodhart's law
- Google Analytics 4
- Google C4 dataset
- Google Calendar API
- Google Colab
- Google DLP
- Google Generative AI SDK
- Google Pub/Sub
- Google T5X
- Google TPU Pods
- Gossip protocol
- Grace Hopper
- Grace period
- Graceful cancellation
- Graceful preemption
- Gradient Boosted Regression Trees
- Gradient Conditioning
- Gradient Pulse
- Gradient compression
- Gradient inversion attack
- Gradient sharding
- Gradient-based attack
- Gradient-based prompts
- Gradient-based search
- Gradio
- Gradual Trust
- Grafana
- Grafana Cloud
- Grafana Tempo
- Grafana dashboard
- Granger causality test
- Grant
- Graph
- Graph Neural Network
- Graph caching
- Graph databases for prompt lineage
- Graph instantiation
- Graph path
- Graph replay
- Graph-of-Thoughts
- GraphCypherQAChain
- GraphQL
- GraphQL subscriptions
- GraphRAG
- Graphs
- Great Expectations
- Greedy speculative decoding
- Green ratio
- Gremlin
- Griffin
- Groma
- Groq
- Gross Profit
- Group Normalization
- Group size
- Group-wise quantization
- GroupChat
- GroupKFold
- Groupcache
- Grouped-Query Attention
- Growth Scenario
- Guaranteed QoS
- Guard agent
- Guardrails AI
- Guidance overrides evidence
- Gumbel-Softmax
- Gwet's AC1
- Gymnasium
- gRPC
- gRPC Load Balancing
- gRPC RESOURCE_EXHAUSTED
- gRPC metadata propagation
- gRPC webhook
- gVisor
- gain
- garbage response
- gated attention
- gated averaging
- gated cross-attention
- gated relevance
- gated residual connections
- gating
- gdrcopy
- gen_len
- generalization
- generate_image
- generation
- generation.latency_ms
- generative model
- generator
- genetic programming
- git revert
- global attention
- global dictionary
- global memory
- global rate limiting
- gold documents
- gold standard
- golden examples
- google-api-python-client
- google-auth-httplib2
- google-auth-oauthlib
- gpt-3.5-turbo
- gpu-burn
- gpu-exporter
- gpu-memory-utilization
- gpustat
- graceful degradation
- graceful shutdown
- gradcheck
- gradient accumulation
- gradient clipping
- gradient descent
- gradient flow
- gradient leakage
- gradient masking
- gradient monitoring
- gradient noise
- gradient norms
- gradient scaling
- gradient step
- gradient synchronization
- gradient-based
- gradient-based methods
- gradients
- gradual fine-tuning
- grammar
- graph breaks
- graph coloring
- graph embedding
- graph imbalance
- graph optimization
- graphviz
- greedy mode
- greedy traversal
- green list
- grey-box атака
- grid
- grid-level synchronization
- grid_group
- grid_group.sync
- gross margin
- ground truth подграф
- group strategyproofness
- gsarti/synthetic_imdb
- guidance
- gzip compression
H
- H100
- H2D
- H2O
- H3
- HAProxy
- HBM
- HBM3
- HCA
- HDBSCAN
- HDR100
- HEART framework
- HELMET
- HGX
- HINCRBY
- HIPAA
- HLO
- HMAC
- HNSW
- HNSW+IVF hybrid
- HPC
- HQQ
- HSM
- HTML
- HTML-таблица
- HTN
- HTR
- HTTP
- HTTP 200
- HTTP 201
- HTTP 404
- HTTP 429
- HTTP 429 Too Many Requests
- HTTP 500
- HTTP 503 Service Unavailable
- HTTP Bulk Insert
- HTTP PUT
- HTTP Request Node
- HTTP idempotency
- HTTP/2 multiplexing
- Hallucination detection
- Hallucination in reasoning
- Hamming distance
- Hand-crafted jailbreaks
- Handlebars
- HandoffSignal
- Hapax Legomena Ratio
- Happy path
- Hard constraints
- Hard failure
- Hard watermarking
- Hard-coded Prompt
- Hard-negative mining
- Hardening
- Hardware acceleration
- HarmBench
- Harness Engineering
- Harness-engineering
- Hash function
- Hashing
- Haystack
- Haystack Tracing
- Hazelcast
- Head-based sampling
- Header Accuracy
- Heading
- Health check failure
- Heavy Hitter
- Hebbian learning
- Hedge word penalty
- Helicone
- HellaSwag
- Helm values
- Helpfulness / Harmlessness
- Helsinki-NLP/opus-mt-en-ru
- Hessian
- Hidden dimension
- Hidden state
- Hierarchical
- Hierarchical Hit Rate
- Hierarchical Indexing
- Hierarchical Planning
- Hierarchical Retrieval
- Hierarchical Summarization
- Hierarchical chunking
- Hierarchical delegation
- Hierarchical memory
- Hierarchical resource quotas
- Hierarchical structure
- HierarchicalNodeParser
- High-Throughput
- High-level planner
- HighFailureRate
- HighLatency
- Hiredis
- Histogram
- Histogram binning
- History
- Hit rate
- Hit rate retrieval
- Hit rate@5
- Hit@3
- Hold-out validation
- Holdout set
- Homogeneous data
- Homomorphic Encryption
- Honeypot
- Honeypot запросы
- Hopper GPU
- Hopsworks
- Horizon
- Horizontal Pod Autoscaler
- Horizontal scaling
- Horovod
- Hot storage
- Hot-swap
- HotFlip
- HotpotQA
- Hough Lines
- HtmlDiff
- HuBERT
- Huber Loss
- Hugging Face
- Hugging Face CrossEncoder
- Hugging Face Evaluate
- Hugging Face Inference API
- Hugging Face Inference Endpoints
- Hugging Face PEFT
- Hugging Face TRL
- Hugging Face Trainer
- HuggingFace Evaluate
- HuggingFace Optimum
- HuggingFace Transformers
- HuggingFace dataset
- HuggingFace pipeline
- HuggingFaceEmbeddings
- HuggingFaceH4/ultrachat_200k
- HuggingFaceTB/SmolLM2-360M-Instruct
- Huggingface CLI
- Human acceptance rate
- Human evaluation
- Human evaluation costs
- Human labels
- Human response time
- Human validation
- Human workload
- HumanEval
- Humanloop
- HyDE
- Hybrid Learned + HNSW
- Hybrid architecture
- Hybrid delegation
- Hybrid detection
- Hybrid eval-set
- Hybrid scaling
- Hybrid update strategy
- HybridModel
- Hydra
- Hyena
- Hyena Operator
- Hypergraph
- Hypernetwork
- Hyperopt
- Hyperparameter
- Hypervolume
- Hypothesis
- Hypothetical
- Hypothetical role
- Hystrix
- half-open state
- hallucinated execution
- hallucination
- handcrafted features
- handover request
- handshake
- hard label
- hard labels
- hard limit
- hard negatives
- hard stop
- harmfulness score
- harness-one
- harness-one/tools
- hash
- hash cache
- hash encoding
- hash index
- hashlib
- hashlib.md5
- head_dim
- health check
- heatmap
- heavy-tailed distribution
- helm
- heuristics
- hey
- hidden representations
- hidden_size
- hierarchical SLO
- hierarchical agents
- hierarchical context
- hierarchy
- high latency
- high similarity
- high variance
- high-cardinality metrics
- high-risk context
- highway networks
- hinge loss
- histogram_quantile
- historical pilot
- hit_count
- hnswlib
- horizontal fusion
- host.docker.internal
- hostname
- hot index
- hot key
- hot requests
- hot restart
- hot shard
- hot shard detection
- hot spots
- hot-reload
- hot/warm strategy
- hot/warm индексы
- hotfix
- htop
- httpx
- human agreement
- human baseline
- human feedback score
- human judgments
- human-in-the-loop
- hybrid CPU/GPU deployment
- hybrid approach
- hybrid model
- hybrid scheduling
- hyena-dna
- hyperparameter search
- hyperparameters
- hypothesis engine
- hypothetical attack
I
- I/O
- I/O-bound
- IA3
- IAM
- IA³
- IBM AI Fairness 360
- IBV_SEND_INLINE
- IBV_SEND_SIGNALED
- ICA
- ICI
- IDF
- IMAP
- IMDb
- INDEX.md
- INT4
- INT8
- IO-aware
- IO-awareness
- IOPS
- IOR
- IP-based rate limiting
- IQR
- IREE
- IRR
- IRSA
- ISO 8601
- ISO/IEC 42001
- IVF+PQ
- IVFFlat
- Iceberg
- Ideal DCG
- Idempotent consumer
- Idempotent upsert
- Idempotent writes
- Identity Preference Optimization
- Identity Provider
- Identity mapping
- Image
- Image patches as tokens
- Image-Text Contrastive
- Image-Text Matching
- Image-grounded Text Generation
- ImageBind
- ImageNet
- Imagen
- Imitation learning
- Immutable Version
- Imperceptibility
- Implicit KL regularization
- Implicit feedback
- Improvement rate
- In-Context Learning
- In-Memory
- In-memory cache
- In-memory grid
- In-place rollback
- In-place update
- Incentive design
- Incremental indexing
- Independent Draft
- Independent heads
- IndexFlatIP
- IndexFlatL2
- IndexHNSW
- IndexIDMap
- IndexIVFPQ
- IndexIVFScalarQuantizer
- IndexScalarQuantizer
- Indirect Prompt Injection
- Inductor
- InfLLM
- Inference attack
- Inference cost
- Inference engine
- Inference scheduler
- Inference server
- Infini-attention
- InfiniBand
- InfiniBand NDR 400
- InfiniBand partition keys
- Infinity
- Infinity Fabric
- InfluxDB
- InfluxDB line protocol
- InfoNCE
- Information Gain
- Information loss between agents
- Ingestion latency
- Ingestion service
- Inner Model
- Inpainting
- Input Filter
- Input compression
- Input filtering
- Input sanitization
- Insecure Output Handling
- Insecure Plugin Design
- Instance Normalization
- InstructLab
- Instruction Formatting
- Instruction prefix
- Instruction tuning
- Instruction-response pair
- Integrated Gradients
- Integration test for prompt chain
- Integration testing
- Intent classification
- Inter-GPU bandwidth
- Inter-agent communication system
- Inter-annotator agreement
- Inter-cluster Distance
- Inter-cluster diversity
- Interactive prototype
- Interleaved 1F1B
- Interpretability
- Intersection over Union
- Intra-cluster Distance
- Intra-cluster diversity
- Intra-list diversity
- Intra-list similarity
- Intra-session diversity
- Intrinsic evaluation
- Intrinsic motivation
- Invalid Prompt
- Invalidation count
- Invariant
- Invariant violation rate
- Inverted File Index
- Inverted index
- Inverted list
- Isolation Forest
- Istio
- Item Response Theory
- Iterated Training
- Iterative process
- ib_core
- ib_read_bw
- ib_umad
- ib_write_bw
- ibdiagnet
- ibping
- ibroute
- ibsim
- ibstat
- ibstatus
- ibswitches
- ibv_asyncwatch
- ibv_devinfo
- ibv_fork_init
- ibv_poll_cq
- ibv_post_send
- ibv_rc_pingpong
- ibv_reg_mr
- ibv_set_pkey
- ibverbs-utils
- idempotency
- idempotency key
- idempotent increment
- idle GPU
- if_else
- ignore strategy
- im2col
- image captioning
- image retrieval
- image-to-image
- image-to-image retrieval
- imagebind_llm
- images
- imitation model
- implicit reward
- importance sampling
- importance score
- importance scoring
- in-batch negatives
- in-flight embeddings
- in-memory dictionary
- in-process mock
- incident.io
- include directive
- increase
- incremental ingestion
- incremental insert
- incremental update
- independent draft models
- indexing
- individual rationality
- inductive biases
- inference
- inference time
- inference-time gradient descent
- inference-time scaling
- inference_mode
- infinite loop rate
- inflight requests
- information gap
- infrastructure cost
- ingestion
- ingestion consumer
- ingestion pipeline
- ingestion_error_rate
- initialDelaySeconds
- injection classifier
- input rails
- inspect
- instance type
- instruct model
- instruction
- instruction format
- instructor
- instrumentation
- integral hash
- intel-scipy
- intent
- inter-agent messages
- inter-judge agreement
- inter-rater reliability
- inter-user variability
- interference
- interleaving
- intermediate layers
- intermediate tokens
- intermediate_answers
- interruption overhead
- intervention
- interventions
- intfloat/e5-mistral-7b
- intfloat/e5-small-v2
- intfloat/multilingual-e5
- intfloat/multilingual-e5-small
- invariants
- iostat
- iperf3
- iptables
- irrecoverability
- irrelevant text
- isolation.level=read_committed
- isolcpus
- isotonic regression
- item difficulty distribution
- iterated RLHF
- iteration
- iteration-level scheduling
- iterations
- iterative improvement
J
- JAX
- JIT compilation
- JIT-компилятор
- JIT-компиляция
- JMX/MBeans
- JOIN
- JSON
- JSON Schema validation
- JSON logs
- JSON mode
- JSON model
- JSON over HTTP
- JSON schema
- JSON-LD
- JSON-логгер
- JSONL
- JTBD
- JUnit
- JUnit XML
- JWT Token
- Jaccard similarity
- Jaeger
- Jaeger connection error
- JaegerExporter
- Jailbreak
- Jailbreak defense
- Jailbreak-атаки
- JailbreakBench
- JailbreakV-28K
- Jamba
- Janus
- Jenkins
- Jensen-Shannon divergence
- JetStream
- Jetson
- Jinja2
- Jira
- Jitter buffer
- Judge agent
- JuiceFS
- Jupyter Notebook
- jailbreak chain
- jailbreak robustness
- jailbreak taxonomy
- jieba
- jitter
- jiwer
- joint embedding space
- joint training
- jsonlint
K
- K-factor
- K-means
- KD-Tree
- KEDA
- KG-RAG
- KGW
- KILT
- KL divergence
- KL penalty
- KRaft
- KS-test
- KV cache compression
- KV cache explosion
- KV cache fragmentation
- KV cache management
- KV cache manager
- KV-cache
- KV-cache compression
- KV-cache replication
- KV-cache reuse
- Kafdrop
- Kafka
- Kafka Connect
- Kafka Headers
- Kafka Lag Exporter
- Kafka Log Cleaner Manager
- Kafka Streams
- Kafka compaction
- Kafka lag
- Kafka topic
- Kafka transactions
- Kahneman-Tversky Optimization
- Kaiming initialization
- Kandinsky
- Kata-containers
- Keep-alive
- Kendall's Tau
- Kendall's τ
- Kerberos
- Kernel Duration
- Kernel density estimation
- Kernel launch
- Key
- Key prefixing
- Key-value model
- KeyDB
- KeyError
- Keycloak
- KeywordTable
- Kibana
- Kind
- Kirchenbauer watermarking method
- Kirsch-Mitzenmacker
- Knapsack problem
- Knowledge Graph from Image
- Knowledge Version
- KnowledgeGraphIndex
- Kong
- Kosmos-2
- Krippendorff's Alpha
- Krum
- Kubeflow Pipelines
- Kubernetes
- Kubernetes Admission Controller
- Kubernetes Device Plugin for MIG
- Kubernetes Job
- Kubernetes Jobs
- Kubernetes Secret
- Kubernetes device plugin
- Kubernetes probe
- Kueue
- k1
- k6
- k9s
- kNN
- k_proj
- kafka-python
- kcat
- kernel
- kernel computation
- kernel fusion
- kernel headers
- kernel launch overhead
- kernel trick
- kernels
- key cache
- key distribution
- key extraction
- key words
- keyspace_hits
- keyspace_misses
- knowledge editing
- knowledge graph
- knowledge_version
- known issues
- kube-prometheus-stack
- kube-scheduler
- kube-state-metrics
- kubectl
- kubetest
L
- L-Eval
- L1 cache
- L1/L2 cache
- L2 Cache
- L2 Norm
- L2 Normalization
- L2 distance
- L3 cache
- L4
- L7 load balancer
- LAMB
- LC-QuAD 2.0
- LD/ST
- LDA
- LDAP
- LEGAL-BERT
- LFAnalysis
- LFU
- LGBMRanker
- LID
- LIFO-эвристика
- LIMA
- LIME
- LIPS
- LLC-load-misses
- LLM
- LLM API
- LLM Cost
- LLM Eval Toolkit
- LLM Gateway
- LLM Invoker
- LLM assistant
- LLM augmentation
- LLM calibration
- LLM call
- LLM chain
- LLM compiler
- LLM confidence score
- LLM detector
- LLM distillation
- LLM endpoint
- LLM evaluation
- LLM evaluation metrics
- LLM executor
- LLM fingerprinting
- LLM inference
- LLM inference cluster
- LLM kernels
- LLM logging
- LLM memory
- LLM observability
- LLM pipeline
- LLM price
- LLM production
- LLM server
- LLM streaming
- LLM training
- LLM задачи
- LLM кластер
- LLM с памятью
- LLM-SR
- LLM-as-Judge
- LLM-as-a-judge
- LLM-as-firewall
- LLM-assessor
- LLM-based detection
- LLM-call classifier
- LLM-firewall
- LLM-generated
- LLM-generated expansion
- LLM-generated hard negative
- LLM-generated hard negatives
- LLM-in-the-loop
- LLM-валидатор
- LLM-валидация
- LLM-классификатор
- LLM-оценка риска
- LLM-приложения
- LLM.int8
- LLMChain
- LLMLingua
- LLMOps
- LLMProvider
- LLVM
- LLaMA-2-70B
- LLaMA-Factory
- LLaVA
- LLaVA-Bench
- LM Contamination
- LM head
- LMDB
- LMSys Chatbot Arena
- LOF
- LPDDR5X
- LPU
- LRU
- LRU-кэш
- LRU-эвакция
- LSH attention
- LSTM
- LTV
- LaBSE
- Label Studio
- Label flipping
- Label quality
- Label smoothing
- Labelbox
- LakeFS
- Lakera Guard
- Lambda Function
- LambdaMART
- LambdaRank
- Lamini
- Lamport clock
- LanceDB
- LangChain
- LangChain AgentExecutor
- LangChain ConversationBufferMemory
- LangChain Hub
- LangChain Red Teaming
- LangChain Tool Calling
- LangFuse
- LangGraph
- LangServe
- LangSmith
- LangSmith Hub
- Language compliance
- Language detection
- Laplace noise
- Laplace smoothing
- Last hidden state
- Late interaction
- Latency
- Latency SLA
- Latency costs
- Latency hiding
- Latency injection
- Latency p50/p95
- Latency-Correctness Trade-off Inversion
- Latency-based routing
- Latency-sensitive
- Latent Reasoning
- LaunchDarkly
- LayerNorm
- Layered defense
- Layout Analysis
- Layout Optimization
- Layout-Aware Chunking
- Layout-aware parsing
- LayoutLMv3
- LayoutParser
- Lazy creation
- Lazy invalidation
- Leaky ReLU
- Lean
- LeanDojo
- Learnable embeddings
- Learned Index Structures for ANN
- Learned positional embeddings
- Learning Rate Schedule
- Learning Rate Scheduling
- Lease
- Least Confidence
- Least connections
- Least critical first
- Legal document
- Length compliance
- Length normalization
- Length-based curriculum
- Length-based sampling
- Lexical diversity
- Lexical gap
- LiT
- Lifecycle hooks
- LightGBM
- LightLLM
- Lightweight model
- Likelihood Ratio Attack
- Likelihood ratio
- Likert scale
- LimitRange
- Linalg
- Line Chart
- Linear
- Linear Artificial Tomography
- Linear Decay
- Linear SSM
- Linear Scaling Rule
- Linear Transformers
- Linear attention
- Linear heads
- Linear layer
- Linear layers
- Linear warmup + linear decay
- Linformer
- List Preservation
- ListAndWatch
- ListNet
- Listwise evaluation
- LiteLLM
- LiteLLM Router
- LiveBench
- LiveIdeaBench
- LiveKit
- Liveness probe
- Liveness/readiness probes
- Llama
- Llama 3.1 405B
- Llama Guard
- Llama-3-1B
- Llama-3-70B
- Llama-3-8B
- Llama-3-8B-128k
- Llama-3.1-70B
- LlamaCloud
- LlamaIndex
- LlamaIndex Function Calling
- LlamaParse
- Lm-format-enforcer
- LoRA
- LoRA merging
- LoRA rank
- LoReFT
- Load balancer
- Load testing
- Loaders
- Local buffer
- Local reputation
- LocalAI
- LocalExecutor
- LocalQueue
- LocalStack
- LocalStorage
- Locality
- Locality Sensitive Hashing
- Locate-Then-Edit
- Lock acquisition time
- Lock contention rate
- Lock falsification
- Lock hold time
- Locking
- Locust
- Log Aggregation
- Log Cleaner
- Log Parsing
- Log-probability
- LogQL
- Logger
- Logging levels
- Logical KV-blocks
- Logical Replication
- Logical replication slot
- Logit Clipping
- Logit masking
- Logit-based fingerprint
- Logits processors
- LogitsProcessor
- Logprob
- Logstash
- Loki
- Long Context
- Long Context RAG
- Long Range Arena
- Long context reasoning
- Long-context capability
- Long-form
- Long-running operation
- LongBench
- LongLoRA
- LongNet
- Longest common prefix
- Longformer
- Lookahead decoding
- Loop unrolling
- LoraConfig
- Loss
- Loss aversion
- Loss masking
- Loss-based MIA
- Loss-based attack
- Lossless
- Lost in the Middle
- Lost in the Middle prompting
- Low confidence
- Low-level executor
- Low-rank decomposition
- Lowering
- Lua filter
- Lua-скрипт
- Lua-скрипты
- Lucene
- Lunary
- Lustre
- label selector
- label_values
- labeling function
- labels
- langdetect
- language gap
- language representation
- large batch inference
- large batches
- large model
- late fusion
- late-arriving data
- latency SLO
- latency overhead
- latency reduction
- latency requirement
- latency stability
- latent reasoning token
- latent space
- latent space reasoning
- launch overhead
- launch statistics
- layer splitting
- layout detection
- lazy evaluation
- lazy write
- lazy-loading
- leader election
- leakage tracking
- leaky bucket
- learnable representations
- learning curve experiment
- learning from failure
- learning rate
- learning-to-rank
- least-loaded
- lecture search
- left join
- length exploitation
- leniency bias
- libcst
- libibumad
- libibverbs
- library of operations
- librdmacm
- lightweight BERT
- likelihood
- likwid-perfctr
- limited membership
- line coverage
- line-based протокол
- linear complexity
- linear complexity attention
- linear correction
- linear interpolation
- link down
- listwise
- llama-cpp-python
- llama.cpp
- llama3.2:1b
- lm-eval-harness
- lm-evaluation-harness
- lm_evaluation_harness
- lmbench
- lmql
- load prediction
- load shedding
- load time
- load_penalty
- loader.py
- local LLM
- local communication
- lockfile
- log rotation
- log-Mel spectrogram
- log-and-apply
- log-log scale
- log.cleanup.policy=compact
- log_softmax
- logistic regression
- logit lens
- logit-based uncertainty
- logit-манипуляции
- logits
- logits processor
- logprobs
- logs
- loguru
- long convolutional filters
- long jumps
- long-form answers
- long-running agents
- long-running задачи
- lookahead
- loose filter
- lora_alpha
- losetup
- losing response
- loss landscape
- loss of diversity
- lossy side-effects
- lost requests
- low faithfulness
- low latency
- low-bit quantization
- low-confidence highlighting
- low-quality filtering
- low-rank matrices
- low-rank projection
- lru_cache
- lxml
M
- MAML
- MARGIN-режим
- MATTR
- MCP
- MCP Client
- MCP Server
- MCTSAgent
- MCTSNode
- MDLM
- MEGA
- MEMIT
- METEOR
- MHLO
- MIG Manager
- MIG profile
- MIME
- MIME type
- MIPRO
- MIPROv2
- MITRE ATLAS
- MITRE ATT&CK
- MKL
- ML Certification
- ML Engineer
- ML Model Access
- ML pipeline
- ML workload
- ML-based suggest
- MLE
- MLIR
- MLOps
- MLOps pipeline
- MLP Projection
- MLP layers
- MLPerf Inference
- MLaaS
- MLflow
- MLflow Tracing
- MM-Vet
- MMA
- MMBench
- MMDiT
- MMHal-Bench
- MMLU
- MNIST
- MNLI
- MOG2
- MOS
- MPI
- MPICH
- MPP-движки
- MPS
- MRR
- MRR@10
- MRR@5
- MS MARCO
- MSCCL
- MSCOCO
- MSE
- MT-Bench
- MTBF
- MTEB
- MTTD
- MTTR
- MTU
- MULTI/EXEC
- MVP
- MXNet
- Machine epsilon
- Magpie
- Mailbox
- Mailpit
- Make
- Makefile
- Mamba
- MambaBlock
- MambaFormer
- Man-in-the-middle attack
- Mann–Whitney U
- Manual
- Manual Review
- MapStore/MapLoader
- Margin Sampling
- Markdown
- Markdown table
- Market-based delegation
- Marlin kernel
- Marquez
- Mask R-CNN
- Masked Image Modeling
- Masking loss
- Materialization
- Materials Project
- Materials Project API
- Math
- Math-500
- MathQA
- Mathlib
- Matplotlib
- Matrix Scaling
- Matrix multiplication
- Matryoshka evaluation
- MatterGen
- Matthews Correlation Coefficient
- Max sequence length
- Max similarity to holdout
- Maximum Calibration Error
- Maximum Mean Discrepancy
- Mean Absolute Error
- Mean pooling
- Mechanistic interpretability
- Median-based aggregation
- Medusa-2
- MegaByte
- Megablocks
- Megatron-LM
- Mel-спектрограмма
- Mel-шкала
- Mellanox ConnectX
- MemGPT
- Memcached
- Memorization
- Memorization vs. generalization trade-off
- Memory
- Memory & Persistence
- Memory Bandwidth
- Memory Networks
- Memory Overhead
- Memory Overhead Ratio
- Memory Pattern
- Memory Tuning
- Memory Updater
- Memory poisoning
- Memory pool
- Memory profiling
- Memory prompt
- Memory utilization
- Memory-efficient attention
- Memory-efficient inference
- Memory-optimized ANN
- Mermaid
- Mesa
- MeshOrchestrator
- Message Passing Neural Network
- Message Type
- Message dispatcher
- MessageTransport
- Messages API
- Meta-model
- MetaGPT
- Metadata consistency
- Metadata filtering
- Metadata index
- MetadataReplacementNodePostprocessor
- Metal
- Metric exporter
- Metrics
- MetricsPort
- Micro-interactions
- MicroTVM
- Microcopy
- Micrometer
- Microservice architecture
- Microsoft Counterfit
- Microsoft Graph API
- Microsoft TaskWeaver
- Middleware
- Middleware Chain
- Midjourney
- Milestone completion order
- Milestone completion rate
- Milestone evaluation
- Milestone hit rate
- Milvus
- Min-Max Scaling
- Min-max fairness
- Min.insync.replicas
- MinHash
- MinHashLSH
- MinIO
- MinIO consistency flag
- MiniGPT-4
- Minikube
- Mirostat
- MirrorMaker 2
- Mismatch rate
- Missing details
- Mission-Critical Application
- Mistral
- Mistral Large
- Mistral-70B
- Mistral-7B
- Mistral-7B-Instruct
- Mixtral
- Mixtral 8x22B
- Mixture of Experts
- MobileViT
- Mock API
- Mock LLM
- Mock message bus
- Mock-функции
- MockTime
- Mocking LLM
- Modal window
- Model Compiler
- Model Poisoning
- Model Theft
- Model Updates
- Model cards
- Model parallelism
- Model registry
- Model unrolling
- Model warm-up
- Model-based RL
- Module
- Modus ponens
- MongoDB
- MongoDB Change Streams
- Monitor
- Monitoring and logging
- Monitoring stack
- Monotonicity
- Monte Carlo
- Monte Carlo Dropout
- Monte Carlo Tree Search
- Moto
- Mount
- MovedError
- Multi-Agent Orchestration
- Multi-Head Attention
- Multi-Instance GPU
- Multi-Latent Attention
- Multi-Query Attention
- Multi-Task Optimization
- Multi-agent RAG
- Multi-agent workflows
- Multi-hop RAG
- Multi-hop accuracy
- Multi-hop reasoning
- Multi-model support
- Multi-needle
- Multi-region deployment
- Multi-step reasoning
- Multi-step research agent
- Multi-step search
- Multi-tenant LLM serving
- Multi-turn
- Multi-turn attack
- Multi-turn detection
- Multi-vector index
- Multi-vector retrieval
- MultiChainComparison
- MultiWOZ
- Multidimensional IRT
- Multilingual Retrieval
- Multilingual alignment
- Multilingual attacks
- Multilingual audio
- Multinomial Diffusion
- Multipart chunk size
- Multipart upload
- Multiple Heads
- Multiple Sequence Alignment
- Multiple Testing
- Multiple Testing Correction
- Multiple runs
- Multiple sampling
- MultipleNegativesRankingLoss
- Multitask Learning
- Murmur3Partitioner
- MurmurHash3
- Murphy decomposition
- MusicGen
- Mutating admission
- MySQL
- m16n16k16
- m16n64k16
- m16n8k16
- m64n16k16
- m64n64k16
- m8n8k32
- mDNS
- mTLS
- machine unlearning
- maintenance window
- majority voting
- malformed response
- malicious embeddings
- mamba-ssm
- mammoth
- manual commit
- manual reprocess
- manual spans
- many-shot
- map of repo
- mapping
- margin
- marginal value
- mask and insertion
- mask-and-fill
- masked language modeling
- math reasoning
- math_verify
- matrix factorization
- matrix units
- max attention weight
- max entropy
- max probability
- max tokens
- max-batch-prefill-tokens
- max-model-len
- max-num-batched-tokens
- maxLength
- max_attempts
- max_batched_tokens
- max_children
- max_degree
- max_delegations
- max_depth
- max_insert_block_size
- max_iterations
- max_length
- max_locked_memory
- max_new_tokens
- max_num_seqs
- max_position_embeddings
- max_retries
- max_seq_length
- max_split_size_mb
- max_steps
- max_tokens
- maximum
- maximum steps exceeded
- maxmemory
- maxmemory-policy
- mbw
- mdtest
- mechanism design
- mediasoup
- membership inference attack
- memmap
- memmap_threshold_kb
- memory bandwidth bottleneck
- memory bandwidth utilization
- memory bank
- memory binding
- memory blocks
- memory coalescing
- memory compression
- memory consolidation
- memory corruption
- memory coverage
- memory embeddings
- memory footprint
- memory fragmentation
- memory management
- memory planning
- memory reduction
- memory region
- memory savings
- memory stall ratio
- memory stalls
- memory traffic
- memory update
- memory-bound
- memory-speed tradeoff
- message bus
- message pipeline
- message replay
- message_id
- messages
- meta-evaluation
- meta-learning
- meta-llama/Llama-3.2-3B-Instruct
- metric
- metric drift
- metrics counters
- metrics-driven testing
- metrics-server
- micro-VM
- microbatches
- middleware chains
- minLength
- minReplicas
- minimal privileges
- minimum
- missing tool
- mitmproxy
- mixed batch
- mixed precision training
- mixed-modal
- mlnx-ofed-kernel-dkms
- mlx5_core
- mmap
- mock agent
- mock downstream
- mock server
- mock-LLM
- mock-воркеры
- mock-провайдер
- mocks
- modAL
- model
- model chaining
- model depth
- model extraction
- model inversion attack
- model ranking
- model selection
- model stealing attack
- model version
- model weights
- model.unload
- model_name
- moderation rails
- moment retrieval
- momentum
- monitoring delegation
- monitoring errors/latency
- monitoring for LLM applications
- monkeypatch
- monoBERT
- monorepo
- monorepository
- moral reasoning attack
- moving average
- mpirun
- multi-GPU inference
- multi-agent coordination
- multi-agent debate
- multi-agent jailbreak
- multi-agent pipeline
- multi-agent planning
- multi-agent system
- multi-agent verification
- multi-armed bandits
- multi-context storage
- multi-document question answering
- multi-hop QA
- multi-layer DLQ
- multi-layer graph
- multi-modal representation languages
- multi-model
- multi-objective optimization
- multi-primary
- multi-region active-active
- multi-region active-passive
- multi-region failover
- multi-stage build
- multi-stage retrieval
- multi-step agent
- multi-step retrieval
- multi-step scenario
- multi-tenant
- multi-tenant RAG
- multi-tenant isolation
- multi-tenant network
- multi-turn QA
- multi-turn dialogue
- multi-turn scenarios
- multi-turn диалоги
- multi_tool
- multilingual attack
- multimodal LLM
- multimodal agent
- multimodal embedding
- multimodal encoder
- multimodal retrieval
- multimodality
- multiple annotators
- multiple judges
- multiprocessing
- mutation
- mutual information
- mypy
N
- N-gram novelty
- NATS
- NATS CLI
- NCCL
- NCCL_BUFFSIZE
- NCCL_DEBUG
- NCCL_IB_DISABLE
- NCCL_IB_HCA
- NCCL_MAX_NCHANNELS
- NCCL_NCHANNELS
- NCCL_NET_GDR_LEVEL
- NCCL_NTHREADS
- NCCL_PROTO
- NCCL_TIMEOUT
- NDCG
- NDCG@10
- NER
- NER model
- NFKC
- NGINX Ingress
- NIST AI 600-1
- NIST AI RMF
- NLI
- NLI model
- NLLB
- NLP
- NLTK
- NLU
- NMT
- NP-hard
- NPV
- NSFW фильтрация
- NTK-aware RoPE
- NTP
- NUMA
- NUMA distance
- NVIDIA Container Toolkit
- NVIDIA DCGM Exporter
- NVIDIA GPU Operator
- NVLink
- NVLink 1.0, 2.0, 3.0
- NVLink 5.0
- NVLink Switch System
- NVLink mesh
- NVLink peer access
- NVLink topology
- NVLink-C2C
- NVML
- NVMe
- NVMe Offload
- NVSwitch 4
- NVTX
- NVTX markers
- NaN
- Nadam
- Naive RAG
- Namespace
- Native Protocol
- Native function
- Nats-Msg-Id
- Natural Questions
- NeMo
- NeMo Guardrails
- NeSymReS
- Near-duplicate
- Needle in a Haystack
- Negative Log Likelihood
- NegotiationRequest
- NegotiationResponse
- Neo4j
- Nested cross-validation
- Nested fallback
- Nesterov momentum
- Network Timeout
- NetworkPolicy
- NetworkX
- New Relic
- Nginx
- Nightly tests
- No hallucination
- No upfront
- No-leakage
- NoPE
- Node Graph
- Node pool
- Noise
- Noise Multiplier
- Noising
- Noisy neighbor problem
- Non-Maximum Suppression
- Non-autoregressive inference
- Nonce
- NormalFloat4
- Normalized edit distance
- Notion API
- Nougat
- Novelty
- Null-значения
- Nullable
- Numba
- Number of deadlocks
- Number of lock retries
- Numerical Weather Prediction
- n-gram
- n-gram overlap
- n-grams
- n8n
- nats-py
- natural language
- natural language bottleneck
- nccl-tests
- ncu
- ndiff
- negation
- negative entropy
- negative prompt
- negative prompting
- negative sampling
- negative transfer
- network
- network partition
- networkx path analysis
- neural network
- neural representations
- neurosymbolic integration
- next step accuracy
- next token prediction
- ngrok
- nlist
- nlpaug
- nltk.word_tokenize
- nn.Parameter
- nn.Sequential
- nnsight
- no-answer scenarios
- no-repeat n-gram size
- no_split_module_classes
- node
- node affinity
- node selector
- node_exporter
- nodes
- noise injection
- noise-based augmentation
- non-autoregressive transformer
- non-blocking
- normalization
- novelty effect
- nprobe
- nsys
- nsys stats
- num_alloc_retries
- num_checkpoints
- num_heads
- num_workers
- numactl
- numastat
- numerical embeddings
- numerical stability
- numexpr
- numpy
- numpy.mmap
- nvbandwidth
- nvcc
- nvidia-container-toolkit
- nvidia-device-plugin
- nvidia-fabricmanager
- nvidia-peermem
- nvidia-persistenced
- nvidia-smi
- nvidia-smi nvlink -s
- nvidia-smi topo -m
- nvidia-uvm
- nvprof
- nvtop
O
- O(n log n) complexity
- O(n) memory complexity
- O(n²) calls
- O(n²) complexity
- O(n²) memory complexity
- OAuth
- OAuth 2.0 Client ID
- OAuth-токен
- OAuth2
- OAuth2 Scopes
- OFED
- OLE
- OMP_NUM_THREADS
- ONNX
- ONNX Runtime
- OOD encoding
- OOM
- OOV
- OPQ
- OPT
- OPTIC
- ORDER BY
- OTLP
- OTLP exporter
- OWASP
- OWASP Top 10 for LLM
- OWASP Top 10 for LLM Applications
- Object store with Git semantics
- Observability Triad
- Observation of Error
- Obsidian
- Odds Ratio Preference Optimization
- Offline RL
- Offline Store
- Offline features
- Offline preference optimization
- Offload
- Offloading
- Offset management
- Ollama
- On-Demand Instances
- On-demand GPU
- On-policy
- Onboarding flow
- One-time token
- Online Hard Negative Mining
- Online Learned Index
- Online Store
- Online auction
- Online features
- Online fine-tuning
- Online learning
- Online softmax
- Online vs offline
- Ontology
- OpEx
- Opacus
- Open LLM Leaderboard
- Open-weight models
- OpenAI API
- OpenAI Batch API
- OpenAI Embeddings
- OpenAI Evals
- OpenAI Functions
- OpenAI Moderation
- OpenAI Moderation API
- OpenAI Prompt Caching
- OpenAI SDK
- OpenAI Swarm
- OpenAI Triton Inference Server
- OpenAIEmbeddings
- OpenAPI
- OpenAPI specification
- OpenCL
- OpenCLIP
- OpenCV
- OpenLineage
- OpenMetrics
- OpenMined
- OpenModelica
- OpenRouter
- OpenSM
- OpenSearch
- OpenTelemetry
- OpenTelemetry Python SDK
- OpenTelemetry collector
- OpenVINO
- OpenWeatherMap API
- OpenWebText
- Operability
- Operation coverage
- Operational Excellence
- Operational Intensity
- Operational Readiness Review
- Operational Review
- Operational Reviews
- Operational debt
- Operator
- Operator Satisfaction Score
- Opsgenie
- Optical Flow
- Optimal checkpointing
- Optimal specification depth
- Optuna
- Orchestration SAGA
- Orchestrator
- Orchestrator pattern
- Orchestrator-Workers
- Originality
- Orleans
- Orthogonal Procrustes
- Orthogonal initialization
- Out-of-knowledge query
- Out-of-order events
- Outbox pattern
- Outcome Reward Model
- Outlier detection
- Outlier score
- Output Parser
- Output manipulation
- Over-decomposition
- Over-expansion
- Over-provisioning
- Overage
- Overcorrection
- Overfitting
- Overfitting detection
- Overlap
- Overlay network
- Overprovisioning
- Overreliance
- Oversampling
- Oversubscription
- o_proj
- obfuscated code
- object detection
- object swapping
- observation
- occupancy
- occupancy requirements
- off-peak scheduling
- off-policy
- offline batch inference
- offline evaluation
- offline migration
- offline training
- offline-метрики
- offset
- on-call
- on-call rotation
- on-chip memory
- on-demand price
- on-disk payload
- on_failure_callback
- onboarding
- one-class SVM
- one-hot
- online decoding
- online evaluation
- online inference
- online reinforcement learning
- online-метрики
- online/offline feature consistency
- onnxruntime-genai
- open-ended task evaluation
- open_clip
- operator optimization
- ops/sec
- optimistic locking
- optimizer
- optimizer sharding
- optimizer state
- optimizer step
- optimizers_config
- optional fields
- orchestration
- order sensitivity
- ordering
- orthogonal transformation
- out of domain
- out_of_scope
- outlier-aware scaling
- outliers
- outlines
- output filtering
- output parsers
- output_scores
- over-constraining
- over-prompting
- over-pruning
- over-refusal
- over-specification
- overconfidence
- overflow
- overhead
- overhead ratio
- overoptimization
- overthinking
P
- P&L
- P-tuning v2
- P90
- PAE
- PAEF
- PAIR
- PATE
- PC algorithm
- PCA
- PCIe
- PCIe Gen5
- PCIe bottleneck
- PCIe fallback
- PCIe root
- PCIe switch
- PCIe transfers
- PDDL
- PEP 8
- PG-19 dataset
- PGD
- PHB
- PHI
- PII
- PII Detection
- PII leakage
- PII masking
- PII rate
- PII redaction
- PIX
- POC
- POPE
- PPOTrainer
- PR
- PReLU
- PSI
- PTX
- PWWS
- PXB
- PYTORCH_CUDA_ALLOC_CONF
- PaLM
- PaLM 2
- Pachyderm
- Packet loss
- Pact
- Pad Tokens
- PaddleOCR
- Page-Hinkley
- PageRank
- Paged Attention
- Paged Optimizers
- PagerDuty
- Pair representation
- Pairformer
- Pairwise attention
- Pairwise comparison
- Pairwise cosine distance
- Pairwise distance
- Pairwise loss
- PandasLFApplier
- Pandera
- Parallel fallback
- Parallel prefix sum
- Parallel scan
- Parallelization
- Parameter-Efficient Fine-Tuning
- Parameterized query
- Paraphrasing attack
- Paraphrasing query
- Parent Document Retrieval
- Parent-child retrieval
- Pareto analysis
- Pareto frontier
- Pareto principle
- Parquet
- Parrot
- Parser
- Partial Harnessing
- Partial hypotheses
- Partial upfront
- PartialHarness
- PartialPlan
- Particle Swarm Optimization
- Partition
- Partitioning
- Pass
- Pass Rate
- Pass@1
- Pass@k
- Patch Embedding
- Patch match
- Path Accuracy
- Path Efficiency
- Path traversal
- Path-level evaluation
- Path-level metrics
- Pathlib
- Paxos
- Payback period
- Payload
- Payload index
- Payload splitting
- Payment rule
- Peak memory
- Pearson correlation
- Peer-to-Peer
- Peer-to-peer delegation
- PeftMixedModel
- PeftModel
- Pegasus
- PendingAction
- Penetration Testing
- Per agent rate limiting
- Per channel rate limiting
- Per priority rate limiting
- Per-agent limit
- Per-token latency
- Per-token quantization
- Perceived latency
- Percent agreement
- Performance Drift
- Performer
- Permutation
- Permutation test
- Perplexity
- Perplexity change
- Perplexity filtering
- Perplexity gain
- Persistence
- Personalization
- Perspective API
- Perturbation Rate
- Pessimistic Scenario
- Pet-project
- Phantom
- Phi-2
- Phi-3-mini
- Physical KV-blocks
- Physical isolation
- Pickle
- Pilot set
- Pinecone
- Pinned memory
- Pipeline bubble ratio
- Pipeline flush
- Pipeline parallelism
- Piper
- Pitch Deck
- Pixie
- Plan
- Plan Accuracy
- Plan Completeness
- Plan Correctness
- Plan Efficiency
- Plan coherence
- Plan deviation score
- Plan manipulation
- Plan quality
- Plan-and-Execute
- Plan-and-Solve
- PlanAndExecute
- Planner
- Planner/Executor Architecture
- Planning alignment
- Platt scaling
- Playground
- PlotQA
- Plotly Dash
- Pod
- Pod Disruption Budgets
- Pod priority
- Poetry
- Point estimate
- Point-in-time recovery
- Point-to-point communication
- Poisson arrival
- Policy
- Policy as code
- Policy evaluation
- Polly
- Pool
- Pooling
- Popular POPE
- Porter stemmer
- Portkey
- Position Encoding
- Position Interpolation
- Position bias
- Position-Based Model
- Position-aware metrics
- Post-filter
- Post-filtering
- Post-flight check
- Post-hoc Calibration
- Post-hoc rationalization
- Post-ingestion checks
- Post-processing
- Post-processing filters
- Post-retrieval
- Post-training quantization
- PostHog
- PostgreSQL
- Postman
- Power law
- PrOntoQA
- Pre-baked responses
- Pre-fill
- Pre-filtering
- Pre-flight check
- Pre-ingestion checks
- Pre-push hook
- Pre-retrieval
- PreStop hook
- Precision exceptions
- Precision/Recall
- Precision@5
- Precomputed features
- Predictive scaling
- Preemption by recomputation
- Preemption by swap
- Prefect
- Preference tuning
- Prefix injection
- Prefix-tuning
- PrefixSpan
- Presidio
- Pricing model
- Primary
- Primary Key
- Principle of Least Privilege
- Priority
- Priority (Weighted) Routing
- Priority = bid / compute
- Priority ceiling
- Privacy Accounting
- Privacy attacks
- Privilege escalation
- Probabilistic Output
- Probing
- Process
- Process reward model
- Processing time
- Procfs
- Prodigy
- Producer
- Product Manager
- Product Quantization
- Product Quantization (PQ) parameters
- ProductQuantizer
- Profit margin
- Profitability
- Progressive Neural Networks
- Progressive training
- Projection
- Projection into LLM space
- Prolog
- PromQL
- Prometheus
- Prometheus + Grafana
- Prometheus API
- Prometheus Alertmanager
- Prometheus Blackbox Exporter
- Prometheus TSDB
- Prometheus client
- Prometheus scrape interval
- Prometheus-2
- Prompt Engineer
- Prompt Management
- Prompt Regression Testing
- Prompt Security
- Prompt Tuning
- Prompt building
- Prompt chaining
- Prompt compression
- Prompt conditioning
- Prompt engineering
- Prompt fragility
- Prompt injection
- Prompt lifecycle
- Prompt manifest
- Prompt testing strategies
- Prompt-based guardrails
- Prompt-tuning
- PromptBench
- PromptInject
- PromptLayer
- PromptLinter
- PromptTemplate
- Promptfoo
- Promtail
- Proof-of-personhood
- Prophet
- Protein language modeling
- Protobuf
- Protocol verification
- Prototype
- Proximal Policy Optimization
- Proxy Goal Convergence
- Proxy_buffering
- Pruning heads
- Pseudo-relevance feedback
- Ptrace
- PubSub
- PubTables-1M
- Publisher confirms
- Pulumi
- PyArrow
- PyFlink
- PyMuPDF
- PyPDF2
- PyRIT
- PySR
- PySceneDetect
- PySpark
- PySyft
- PyTorch
- PyTorch Geometric
- PyTorch Lightning
- PyTorch Profiler
- PyYAML
- Pydantic
- Pydantic BaseModel
- Pygame
- Pyodide
- Pytest fixtures for LLM prompts
- Python SDK
- Python control flow
- p2pBandwidthLatencyTest
- p50
- p95
- p99
- p99/p50 ratio
- packing sequences
- padded sequences
- page cache
- page swapping
- paged optimizer
- paired t-test
- pairwise
- pairwise agreement
- pairwise comparisons
- pairwise embedding distance
- pairwise ranking
- pairwise ranking loss
- pandas
- pandas DataFrame
- parallel branching
- parallel forward pass
- parallel verification
- parallelism
- parallelizability
- parameters
- parent span ID
- parent-child chunks
- parsing
- partial data
- partial failure UI
- partial-response rate
- partition function
- partition key
- partition tolerance
- patch
- patch encoder
- pattern detection
- pattern matching
- pay-per-token
- pay-per-use
- payload-индексы
- pdfminer.six
- pdfplumber
- pdsh
- peeking
- peer-to-peer bandwidth
- peer-to-peer interaction
- per agent
- per-channel scaling
- per-feature cost breakdown
- per-tensor scaling
- percentile
- perceptual loss
- perf stat
- performance
- performance tests
- perfquery
- permission_denied
- permutation invariance
- perplexity analysis
- perplexity anomaly
- perplexity-based detector
- persona modulation
- perturbation
- perturbation consistency
- perturbation-consistency
- pessimistic locking
- pg_notify
- pg_partman
- pgcrypto
- pgoutput
- pgvector
- phased rollout
- physical attack
- pika
- pin_memory
- pipeline architecture
- pipeline bubbles
- piper.cpp
- pivot_root
- placeholder
- placeholders
- plain text
- planning
- planning model
- plotly
- plugins
- plural
- pod_count
- point-in-time
- point-in-time correctness
- pointwise
- pointwise fusion
- policies
- policy gradient
- polling
- port-forward
- position bias ratio
- positional invariance
- posix_memalign
- post-hoc correction
- post-hoc explanation
- post-norm
- post-processing filter
- postmortem
- power analysis
- pre-commit hook
- pre-normalization
- pre-tokenization
- pre-training
- pre/post conditions
- precision
- precision-recall
- precision@k
- precomputed norms
- preconditions and effects
- predicated execution
- predicated instructions
- predict_linear
- predict_proba
- preemption
- preemption overhead
- preference agreement
- preference data collection
- preference distributions
- preference simulation
- preferred trajectory
- prefill
- prefill stage
- prefix caching
- prefix hashing
- prepositions
- presence penalty
- pricing per token
- primacy effect
- primary storage
- primary/secondary replication
- prioritization
- priority inheritance
- priority inversion
- priority queuing
- priority-based scheduling
- privacy by design
- proactive replacement
- probabilistic early recomputation
- probabilistic invalidation
- probabilistic label
- probabilities
- probability distribution
- probe_duration_seconds
- probe_success
- producer failure
- production
- production ML system
- production evaluation
- production incident
- production logs
- production readiness
- profile
- profiler
- profiling
- program
- program compilation
- programmatic labeling
- progress bar
- progressive disclosure
- projection matrix
- prometheus_client
- prompt
- prompt adaptation
- prompt completion ratio
- prompt composition
- prompt diff
- prompt hardening
- prompt hash
- prompt language
- prompt leakage
- prompt lineage
- prompt linting
- prompt observability
- prompt regression suite
- prompt rewriting
- prompt rollback
- prompt stealing
- prompt tokens
- prompt versioning
- prompt_hash
- prompt_template_schema.json
- promptlint
- prompts engineering
- proof-of-work
- propagators
- property-based testing
- property-based tests
- prospect theory
- proven theorems rate
- provider switching
- provisioning
- proxy API
- proxy metrics
- proxy reward
- proxy-модель
- pruning search trees
- pseudo-labels
- psutil
- psychological safety
- psycopg2
- pull-based
- pull-модель
- purple team
- pushgateway
- pwr
- py-spy
- pybloom_live
- pybreaker
- pyirt
- pylint
- pymatgen
- pyproject.toml
- pyrate-limiter
- pyreft
- pytest
- pytest-asyncio
- pytest-cov
- pytest-html
- pytest-httpx
- pytest-langchain
- pytest-mock
- pytest-rerunfailures
- pytest-timeout
- pytest-xdist
- python-docx
- python-json-logger
- python-pptx
- pytrec-eval
- pyvis
Q
- Q-Former
- Q-value
- QA
- QA-based evaluation
- QA-based verification
- QAMPARI
- QASA
- QEMU/KVM
- QK-normalization
- QK^T
- QLoRA
- QPS
- QPS per shard
- Qdrant
- Qdrant Cloud
- Qdrant filter conditions
- QdrantClient
- QoS
- Qoder
- QuIP
- Quadratic bottleneck
- Quality degradation
- Quality gates
- Quantization
- Quantization-aware training
- Quarantine
- Quasar
- Query
- Query Complexity Classifier
- Query Tokens
- Query embedding
- Query rewriter
- Query routing
- Query-document alignment
- Query/Key/Value vectors
- QueryEngine
- Quest
- Queue Pair
- Queue length
- QuickCheck
- Quota
- Qwen 2.5 1.5B
- Qwen-VL
- Qwen2-1.5B
- Qwen2.5 72B
- Qwen2.5-1.5B
- Qwen2.5-1.5B-Instruct
- Qwen2.5-7B
- Qwen2.5-MoE
- q_proj
- qdrant-client
- qrels
- quality
- quality metrics
- quality score
- quality-cost curve
- quantization-aware scaling
- quantized
- quantized target
- quantized verification
- quantlib
- query complexity distribution
- query expansion
- query latency
- query metrics
- query reformulation
- query set
- query-positive-negative triplet
- query_range API
- query_type
- question distribution
- question generation
- questions
- queue
- queue length monitoring
- queue-based escalation architecture
- queue.Queue
- queue_latency
- quorum
R
- RAG
- RAG Corpus
- RAG agent
- RAG chains
- RAG evaluation
- RAG indexing
- RAG orchestrator
- RAG pipeline
- RAG poisoning
- RAG-bot
- RAG-префикс
- RAGAS
- RAGEngine
- RAPTOR
- RASA
- RAdam
- RC-соединение
- RCA
- RCCL
- RDB
- RDB preamble
- RDF
- RDMA
- RDMA Read
- RDS
- RDTSC
- README.md
- REALM
- RED metrics
- REDIS SCAN
- REINFORCE
- REST
- RGB
- RICE framework
- RL update
- RL4LMs
- RLAIF
- RLHF Evaluation Suite
- RLlib
- RMSD
- RMSE
- RMSNorm
- RMSProp
- RNN
- ROC curve
- ROC-AUC
- ROCE v2
- ROCProfiler
- ROCm
- ROI
- ROME
- ROUGE
- RPC queue
- RPO
- RRF
- RSS Feed
- RTO
- RTSP
- RTT
- RTX 4090
- RULER
- RWKV
- RabbitMQ
- Radix tree
- RadixAttention
- Raft
- Random
- Random Forest
- Random POPE
- Random Search
- Random assignment
- Random injection
- Random projections
- Random seed sampling
- Random swap
- Randomisation промптов
- Randomized Smoothing
- Rank-Based Normalization
- Rank-one update
- RankNet
- Rare Tokens
- RateLimitExceeded
- Rating
- Ray
- Ray Serve
- Re-planning
- Re-prompting
- Re-ranker
- Re-reading policy
- ReAct Agent
- ReAct prompt
- ReFT
- ReLU
- ReLU attention
- Reactive scaling
- Read-after-write consistency
- ReadOnlyRootFilesystem
- Readiness probe
- Real-time ingestion
- Real-time video understanding
- Real-time voice agent
- Real-time обработка документов
- Real-time признаки из Kafka
- Reasoning
- Reasoning depth
- Reasoning errors
- Reasoning via Planning
- Recalibration
- Recall
- Recall exceptions
- Recall@100
- Recall@5
- Recall@k
- Recency
- Receptance
- Reciprocal Rank
- Recomputation-based preemption
- Reconciliation
- Reconnection strategy
- Record
- RecordException
- Recording
- Recovery actions
- Recovery rate
- Recurrent Block
- Recurrent Depth
- Recurrent GPT
- Recurrent Memory Transformer
- Recurrent operation
- Recurrent vs parallel computation
- Recursive
- RecursiveCharacterTextSplitter
- RediSearch
- Redis
- Redis Cluster
- Redis Enterprise CRDB
- Redis INFO
- Redis KV-cache
- Redis Keyspace notifications
- Redis List
- Redis Lock
- Redis PubSub
- Redis Queue
- Redis Redlock
- Redis Sentinel
- Redis Sets
- Redis Stack
- Redis Streams
- Redis pipeline
- Redis replication
- Redis-based rate limiter
- RedisCluster
- Redpanda Schema Registry
- Redrive Policy
- ReduceScatter
- Reducer
- Reference architecture
- Reference point
- Reference-based attack
- Reflexion
- Reformer
- Refusal on OOD
- Refusal testing
- Refusal to answer
- Regex-фильтры
- Regions
- Registers
- Registry
- Registry service
- Rego
- Regression rate
- Regularization
- Reinforcement Learning
- Reinforcement Learning for Index Tuning
- Reinforcement Learning from Human Feedback
- Reinforcement Learning with Explanation Reward
- Relative Position Encoding
- Relay
- Relay IR
- Relay bus
- RelayCaching
- Relevance check
- Reliability Engineering
- Reliability diagram
- Reloader
- Remote Key
- Remote Procedure Call
- Rendezvous hashing
- Repeat rate
- ReplacingMergeTree
- Replay реальных диалогов
- Replicate API
- Replication factor
- Replication lag
- RepoCoder
- Representation Level
- Reptile
- Reputation Score
- Reputation scores
- Request Count
- Request ID
- Request classification
- Request-level scoping
- ResNet
- ResNet-18
- ResNet-50
- Resampler
- Reserve price
- Reserved GPU
- Residual Vector Quantization
- Residual dropout
- Resilience4j
- Resolution rate
- ResourceFlavor
- ResourceQuota
- Response Consistency
- Result validation
- Retention Rate
- RetinaNet
- Retrieval Quality
- Retrieval agent
- Retrieval metrics
- Retrieval success rate
- Retrieval-Generation Correlation
- RetrievalQA
- Retrospective analysis
- Retry Topic
- Retry count
- Retry storm
- Retry with deduplication
- Retry with exponential backoff
- Revenue
- Revenue Streams
- Revenue Structure
- Reverse Instruction
- Review
- Reward
- Reward Normalization
- Reward Scaling
- Reward score
- Reward shaping
- Rewrite prompt
- Rewrite-Retrieve-Read
- Ring
- Ring Attention with Load Balancing
- Ring all-reduce
- RiskEx
- Riva
- RoBERTa
- RoCE
- RoPE
- Robust child
- Robustness Evaluation
- Robustness Gym
- Robustness Score
- Robustness@k
- RocksDB
- Role
- Role-based
- Role-based decomposition
- Role-play / persona
- Role-play attack
- RoleBinding
- Roleplay jailbreak
- Roles
- Rollback frequency
- Rollback loop
- Rolling Buffer Cache
- Rolling deployment
- Rolling update
- Rollout policy
- RonDB
- Root span
- Round-robin
- Route53
- Router
- Router Collapse
- Router LLM
- Router prompt
- Routing entropy
- Row-level locking
- Row-wise
- RuBERT NLI
- RuBERT-score
- RuShareGPT
- RuTurboAlpaca
- Rule-based classifier
- Rule-based executor
- Rule-based filtering
- Rule-based routing
- Rule-based suggest
- Run-time verification
- Running queue
- Runtime detection
- Runtime validation
- RuntimeError
- Russian SuperGLUE
- Rényi DP
- race condition
- race condition prevention
- radius perception
- rainbow teaming
- ramp
- random deletion
- random embeddings
- random features
- random graph
- random insertion
- random token drop
- random walk
- rank_bm25
- ranking
- ranking improvement
- ranx
- rare classes
- rare languages
- rare queries
- rare trajectories
- rate
- rate limiting
- rate limits
- rate query
- raw trajectory
- razdel
- rdma-core
- rdma_rxe
- read replicas
- read-after-write
- read-only filesystem
- read-only index
- read-only mode
- read-only rootfs
- readiness delayed
- real data
- real data mixing
- real-time RAG
- real-time factor
- real-time monitoring
- reasoning degradation
- reasoning models
- reasoning schema
- reasoning steps
- recall@1
- recency effect
- receptive field
- recompilation overhead
- recomputation
- reconstruction error
- record_shapes
- recording rules
- recovery time
- recurrence
- recurrent backpropagation
- recurrent memory
- recursive reduction
- red list
- red teaming
- red teaming certification
- red teaming evaluation
- red teaming loop
- redis-benchmark
- redis-cell
- redis-cli
- redis-py
- redis_exporter
- reduce
- reduce-scatter
- reduction fusion
- reference policy
- reference модели
- reflection
- reflection loops
- reflection module
- refresh interval
- refreshInterval
- refusal
- refusal hacking
- refusal rate
- refusal suppression
- regex
- region affinity
- register pressure
- regression threshold
- regret
- regularization retrieval
- rehearsal
- reindex
- rejection sampling
- rejection tokens
- relative degradation
- relative improvement
- relative order
- relevance score
- relevance signal
- reliability
- repair_rate
- repeated error
- repetition penalty
- replay buffer
- replica
- replicas
- reply_to queue
- representation engineering
- representation levels
- reprocess strategy
- reputation decay
- reputation system
- request batching
- request-response
- request_rate
- requests
- required checks
- required field
- required_variables
- requirements.txt
- reranking
- resampling
- reserved field
- reset timeout
- resharding
- residual blocks
- residual connection
- residual connections
- residual stream
- residual vectors
- resource cleanup
- resource limits
- resource manager
- response safety
- response_quality_score
- responses
- respx
- result verification
- retention policy
- retrieval
- retrieval context
- retrieval degradation
- retrieval latency
- retrieval logs
- retrieval miss
- retrieval pipeline
- retrieval results distribution
- retrieval strategy
- retrieval-based hard negative
- retrieval-based hard negatives
- retrieval.latency_ms
- retrieved_chunks
- retrospective
- retry
- retry rate
- retry storm mitigation
- retry_delay
- retryable / non-retryable
- revenue per request
- reverse proxy
- revision
- reward correlation
- reward delay
- reward hacking
- reward model
- ring attention
- risk assessment
- risk score
- robust aggregation
- robust training
- robustness
- robustness to overfitting
- role differentiation
- role prompting
- role-play
- role-play jailbreak
- rollback
- rollback delegation
- rolling baseline centroid
- rolling cache
- rolling restart
- rollouts
- roofline model
- rotation matrix
- rouge-score
- router model
- routing
- row-based retrieval
- rowmax
- ruBERT
- ruGPT-3.5
- ru_core_news_lg
- rubric
- rubric-based evaluation
- ruff
- rule-based checks
- rule-based reward
- rule-based reward model
- rule-based validation
- rule-based системы
- rule_files
- rules
- run_id
- runaway costs
- runbook
- runtime
- runtime tracing
S
- S3
- S3 Glacier
- S3 Versioning
- S3 consistency
- S3 events
- S3 timeout
- S4
- S5
- SAGA pattern
- SAT
- SAT-решатель
- SBOM
- SCAN
- SCF
- SCROLLS
- SDXL
- SELECT ... FOR UPDATE
- SENTINEL Tokens
- SETP
- SFT
- SFT Model
- SFTTrainer
- SGD
- SGDClassifier
- SGLang
- SHA-256
- SHAP
- SIEM
- SIFT1M
- SIGKILL
- SIGTERM
- SIMD
- SIMT
- SLA
- SLA compliance
- SLERP
- SLI
- SLO
- SLO violation rate
- SLO-driven
- SLURM
- SM
- SM occupancy
- SMTP
- SNLI
- SOAR
- SPADE
- SPANN
- SPARQL
- SPARQLWrapper
- SPEC.md
- SPICE
- SPIN
- SPLADE
- SQL
- SQL schema
- SQL-инъекция
- SQLAlchemy
- SQLAlchemy 2.0
- SQLAlchemy async sessions
- SQLDatabaseChain
- SQLTableNodeMapping
- SQLite
- SQS
- SQuAD
- SQuAD 2.0
- SRE
- SROIE
- SSD
- SST-2
- STIX
- STL decomposition
- STRIPS
- STaR
- SUID
- SUPR-Q
- SWE-agent
- SWE-bench
- SaaS
- Safe retries
- Safetensors
- Safety & Guardrails
- Safety Valve
- Safety fine-tuning
- Safety/security
- SafetyBench
- SageMaker
- SageMaker Batch Transform
- Saiga
- Salient weights
- Sama
- Sample Efficiency
- Sampler
- Sanitizing parsing
- Sanity check
- Saturation point
- Savings Plans
- ScaNN
- Scalar quantization
- Scale
- Scale AI
- Scale-to-zero
- Scaled dot-product attention
- ScaledObject
- Scaling Laws
- Scatter-Gather Element
- Scenario
- Scenario-based routing
- Scene Graph
- Scene Graph Generation
- Scheduled RI
- Scheduler policy
- Schema Compatibility
- Schema compliance
- Schema registry
- Schema-Activated In-Context Learning
- SciPy
- Scientific formalization
- Scissorhands
- Scope
- Score normalization
- Scorer
- Scorers
- Scoring rubric
- Scratch
- Scrubbing
- Seaborn
- Sealed Secrets
- Search engineering
- SecAgg
- SecAgg+
- Seccomp
- Second-price auction
- Secondary
- Secret sharing
- SecretStore
- Secrets Store CSI Driver
- Section Recall@k
- Secure Aggregation
- Secure Multi-Party Computation
- Seed pool
- Segment caching
- Seldon
- Seldon Core
- Selection
- Selective Attention
- Selective Context
- Selective checkpointing
- Selective memory
- Selective scan
- Selective shedding
- Selective state space
- Self-Ask
- Self-Debugging
- Self-QA
- Self-RAG
- Self-Speculative Decoding
- Self-Supervised Loss
- Self-contained query
- Self-critique through pairwise
- Self-enhancement bias
- Self-hosted LLM
- Self-hosted models
- Self-improvement
- Self-instruct
- Self-paced Learning
- Self-reflection
- Self-schema generation
- SelfCheckGPT
- Semantic Caching
- Semantic Coherence Score
- Semantic Kernel
- Semantic Versioning
- Semantic chunking
- Semantic coherence
- Semantic distance
- Semantic diversity
- Semantic duplicate
- Semantic function
- Semantic gap
- Semantic idempotency
- Semantic loop detection
- Semantic similarity check
- SemanticChunker
- SendHandle
- Sensitive Info Disclosure
- Sensitivity Table
- Sensor
- Sentence-level NLI
- Sentence-level attack
- SentencePiece
- SentenceTransformers
- Sentry
- Separate indices
- Separator
- Sequence matching
- Sequence mining
- Sequence number
- Sequence numbers
- Sequence of steps
- Sequence-level confidence
- Sequential chain
- SerpAPI
- Server-Sent Events
- Serverless compute
- Service
- Service Account
- Service Graph
- ServiceAccount
- ServiceMonitor
- Serving API
- Serving infrastructure
- Session Management
- Session Middleware
- Session Store
- Session Worker
- Session window
- Session-level scoping
- SetRank
- Setex
- Setnx
- Shadow mode
- Shadow testing
- Shadowing
- Shape specialization
- Shapiro-Wilk
- Sharded cache
- ShardingStrategy
- ShareGPT / OpenAssistant / Dolly
- Shared Tokenizer
- Shared context protocol
- Shared plan graph
- Shared prefix
- Shell
- Shikra
- Shingle
- Side channel
- Side output
- Sidecar
- Siege
- SigLIP
- Sigmoid
- Signal alarm
- Signature
- SignatureOptimizer
- SimCTG
- Similarity search
- Simple Preference Optimization
- SimpleSpanProcessor
- Simpson's paradox
- Simpy
- Simulation testing
- Simulink
- Single Responsibility Principle
- Single representation
- Singleflight
- Sinusoidal Positional Encoding
- Sinusoidal encoding
- SipHash
- Skeletonization
- Skill adoption rate
- Skill success rate
- Skills
- Skweak
- Slack
- Slack API
- Slack Block Kit
- Slack Bot Token
- Slack Events API
- Slack webhook
- Sleep window
- Sliding window cache
- Sliding window chunking
- Slots
- Small Initialization
- Small world networks
- Smoke Test
- SnapKV
- Snapshot isolation
- Snorkel
- Social choice aggregation
- Social welfare
- Soft constraints
- Soft labels
- Soft watermarking
- Soft-RoCE
- Soft-embedding
- Soft-label
- SoftRoCE
- Softmax
- Softmax Overflow
- Softmax saturation
- SoundStream
- Source ID
- Source Verification
- Source weight
- Span
- Span Masking
- Span attributes
- Span status
- Spanner
- Spark
- Spark Structured Streaming
- SparkSubmitOperator
- Sparse Autoencoders
- Sparse Embedding
- Sparse Transformers
- Sparse computation
- Sparse file
- Sparse rewards
- Speaker Diarization
- Spearman correlation
- SpecAugment
- Specificity
- Spell correction
- Spell-checker
- Sphinx
- Spider
- Spin the wheel
- Spinnaker
- Split
- Splunk
- Spoofing attack
- Spot Fleet
- Spot GPU
- Spot Instances
- Spot termination
- Spring Cloud Contract
- Stability AI API
- StabilizationWindowSeconds
- Stable Diffusion
- Stable Diffusion 3.5
- Stable-Baselines3
- StableHLO
- Stan
- Standard RI
- StandardScaler
- Startup probe
- Starvation
- State Bloat
- State Graphs
- State Manager
- State Recovery
- State Schema
- State Space Model
- State graph
- State machine
- State reconstruction
- State snapshots
- State space exploration
- State store
- State verification
- StateGraph
- Stateful
- Stateful testing
- Stateful workflow
- Stateless
- Stateless RAG
- Static Quantization
- Static partitioning
- Statistical tests
- Steady State
- Step Latency
- Step Order Accuracy
- Step Success Rate
- Step accuracy
- Step-level supervision
- Step-level training
- Sticky sessions
- Stigmergy
- Stochastic depth
- Stochastic speculative decoding
- Stochasticity
- Storage costs
- StoryBench
- Stratification
- StreamReader / StreamWriter
- Streaming
- Streaming ASR
- Streaming Ingestion
- Streaming TTS
- Streaming deduplication
- Streaming parsing
- Streaming pipeline
- StreamingCallbackHandler
- StreamingLLM
- StreamingResponse
- Streamlit
- Strimzi
- Stripe API
- StripedHyena
- Strong consistency
- Structural consistency
- Structure preservation
- Structured Format
- Structured Prompting
- Structured extraction
- Structured table formats
- Student Agent
- Style Consistency Score
- Subgoal completion rate
- Subgraph
- Subgraph Retrieval Precision
- Subgraph retrieval recall
- Subject
- Subprocess
- Subscription
- Subtask
- Subtask Completion
- Success rate
- Successful task completion rate
- Summarise prompt
- SummarizerMemory
- SummaryIndex
- SuperGLUE
- Supervised autonomy
- Supply Chain
- Supply Chain Vulnerabilities
- Suppress Tokens
- Surge AI
- Swap
- Swap-based preemption
- Swapped queue
- SwiGLU
- Swish
- Switch Transformer
- Sybil attack
- Sybil protection
- SymPy
- Symbolic consistency
- Symlink
- Symmetric quantization
- SyncBatchNorm
- Synonym swap
- Synonymizer
- Synthesis
- Synthesizer
- Synthetic batch
- Synthetic dataset
- Synthetic file
- Synthetic load
- System cards
- System prompt hardening
- s3fs
- sacrebleu
- safari
- safety
- safety alignment
- safety benchmarks
- safety case
- safety filter
- safety valves
- safety-utility trade-off
- saliency maps
- sample ratio mismatch
- sample size
- sample size determination
- samples
- sampling
- sampling probability
- sandbox escape
- sandwich technique
- sanitizer
- saturation analysis
- saturation gap
- save time
- scalability
- scalar product
- scalar rating
- scale-and-add
- scale-up/down
- scaling factors
- scapy
- scenario attack
- scene detection
- schedule-based scaling
- schedule_interval
- scheduled retraining
- scheduler extender
- schema drift
- schema evolution
- schema resolution
- schema validation
- schema-valid data
- schemaless
- scikit-activeml
- scikit-learn
- scikit-optimize
- scipy.integrate.solve_ivp
- scipy.optimize.minimize_scalar
- scipy.spatial.distance
- scipy.stats
- scipy.stats.entropy
- score_threshold
- scoring
- scrape
- scrape interval
- scrape_config
- scrape_configs
- scrape_interval
- seasonality
- second opinion
- secret rotation
- secure containers
- seed
- seed examples
- seed-факты
- seed.py
- segments
- selective activation recomputation
- selective pruning
- selectivity
- self-BLEU
- self-chat
- self-correcting LLMs
- self-correction
- self-correction loop
- self-diagnosis
- self-healing
- self-healing pipeline
- self-hosted
- self-improvement loop
- self-judge
- self-organization
- self-play
- self-reported incidents
- self-supervised tool use
- self-supervision
- self-training
- semantic HTML
- semantic cache
- semantic comparison
- semantic compression
- semantic conventions
- semantic drift
- semantic entropy
- semantic ranking
- semantic tag
- semantic watermark
- semaphore
- sensitive data
- sensitivity analysis
- sensitivity curve
- sent_tokenize
- sentence embeddings
- sentence-level confidence
- sentence-level evaluation
- sentence-transformers
- sentence-transformers/all-MiniLM-L6-v2
- sequence
- sequence alignment
- sequence classification
- sequence graph analysis
- sequence mode
- sequence parallelism
- sequence slots
- sequential delegation
- sequential testing
- serialization
- serverless
- service mesh
- service name
- serving framework
- session
- session history
- session memory
- session replay
- session state
- session.timeout.ms
- session_id
- severity
- severity classification
- shadow model
- shadow traffic
- shaped reward
- shard key
- shard utilization
- sharding
- shared layers
- shared prefixes
- shared state
- sharp minima
- sharpening
- shell access
- shortcuts
- shuffle instructions
- sidecar pattern
- sigmoid loss
- signal
- silhouette score
- simhash
- simulation
- simulation mode
- simulation-based verification
- simulator
- single-stage autoregressive transformer
- single_tool
- sink ratio
- sink tokens
- size penalty
- skew
- skill
- skill library
- skip-grams
- sklearn.metrics
- sklearn.metrics.ndcg_score
- slack-sdk
- slot memory
- slot migration
- slot-filling
- slowapi
- slowlog
- slowlog-log-slower-than
- small LLM
- smoke tests
- smolagents
- smooth quantization
- smtplib
- snapshot
- snapshot mode
- social choice
- social scoring
- soft TTL
- soft label
- soft limit
- softiwarp
- softmax attention
- sops
- source
- source whitelist
- spaCy
- sparse MoE
- sparse attention
- sparse features
- sparse gradients
- sparse matrix
- sparse reward
- sparse softmax
- spatial hashing
- speaker
- speculative decoding
- speculative execution
- speedup
- spilling
- spinner
- split-brain
- spot price
- spot termination notice
- spot termination rate
- spurious correlations
- sse-starlette
- stability
- stable version
- stacked bar
- staging environment
- stake
- stale data
- stale-while-revalidate
- standard deviation
- state
- state coverage
- state management
- state object
- state space
- state summarization
- state transfer
- state-action-next state
- static analysis
- static batching
- static memory allocation
- static routing
- static shapes
- stationarity
- statistical distance
- statistical distribution tests
- statistical power
- statsmodels
- step completion
- step embeddings
- step merging
- step verifier
- step-back prompting
- step-wise verification
- stepLR
- step_count
- step_number
- steps per session
- sticky assignment
- sticky-сессия
- stochastic rounding
- stop words
- stop_after_attempt
- stop_after_delay
- stop_token
- storage per shard
- storage system
- strace
- stragglers
- stratified sampling
- streaming chunking
- streaming data
- streaming feature pipeline
- streaming tasks
- streaming-агент
- stress test
- stress-ng
- stride
- structlog
- structural pruning
- structured logging
- structured loss metrics
- structured output
- structured output format
- structured representation
- structured representations
- structured response
- structuring
- stub database
- student model
- style bias
- subscription_tier
- subshots
- subtle injection
- subtle injections
- subtract max
- subvector
- summarization
- supervised loss
- supervisor agent
- surrogate objective
- swap positions
- swap-space
- swap-test
- swarm coordination
- swarm simulation
- sweep
- switching criteria
- swizzle
- sycophancy
- symbolic regression
- symbolic representations
- synchronization primitives
- synchronous replication
- synchronous update
- synonym mapping
- synthetic benchmark generator
- synthetic data collapse
- synthetic data generation
- synthetic eval collapse
- synthetic eval datasets
- synthetic evaluation
- synthetic generation
- synthetic request
- syscall interposition
- system.query_log
- systolic array
T
- T-lite-instruct
- T4
- T5
- T5 relative bias
- TAP
- TAPAS
- TATR-structure
- TCO
- TCP
- TCP retransmission
- TEDS
- TEE
- TEI
- TF-IDF
- TFLOPS
- TGI
- TIES-Merging
- TIR
- TLA+
- TLS
- TLS 1.3
- TLS/SASL
- TMA
- TORCH_DISTRIBUTED_DEBUG
- TPOT
- TPU
- TREC
- TREC Robust
- TSDB
- TTFT
- TTL
- TTL-словарь
- TTPs
- TTS
- TTT Layer
- TTestIndPower
- Table
- Table Extraction Score
- Table Transformer
- Table format
- Table recovery accuracy
- TableFormer
- TableNet
- TableRetrieverQueryEngine
- Tabula
- Tag-based invalidation
- Tail-based sampling
- Tailwind CSS
- Target KL
- Targeted poisoning
- Task
- Task Completion Rate
- Task curriculum
- Task priority
- Task queue
- Task vector arithmetic
- TaskGroup
- TaskSpec
- TaskType
- Tasks per Operator
- Taxonomy
- Teacher Agent
- Teacher Forcing
- Team coordination layer
- Tecton
- Telegram
- Temperature
- Template injection prevention
- Temporal
- Temporal PDDL
- Temporal Web UI
- Temporal modeling
- Temporal partitioning
- Temporalite
- Tenseal
- Tensor Cores
- Tensor parallelism
- TensorBoard
- TensorFlow
- TensorFlow Federated
- TensorFlow Privacy
- TensorRT Plugin API
- TensorRT-LLM
- Terminal state
- Terraform
- Tesseract OCR
- Test fixtures
- Test queries
- Test stand
- Test-Time Compute
- Test-Time Training
- Test-time compute scaling laws
- Test-time iteration
- TestClient
- Testcontainers
- TestsetGenerator
- Text classification
- Text encoder
- Text repetition
- Text-to-SQL
- TextAttack
- TextFooler
- TextRank
- Tfidf + LogisticRegression
- Thanos
- The Pile
- Theorem of Myerson
- Theory of Mind
- Thesaurus/WordNet
- Thompson sampling
- Thought-Action-Observation loop
- ThreadPoolExecutor
- Thresholds
- Thrift
- Thundering Herd
- Tied embeddings
- Tiered storage
- Time to fix
- Time window
- TimeSformer
- TinyBERT
- TinyDB
- TinyLlama
- TinyStories
- Together.ai
- Toil reduction
- Token binding
- Token budgets
- Token efficiency
- Token manager
- Token repetition removal
- Token smuggling
- Token-based payment
- Token-level caching
- Token-level evaluation
- Token-level matching
- TokenTextSplitter
- TokenTracker
- Tokenization of coordinates
- Tombstone message
- Tombstone record
- Tombstone records
- Tool
- Tool Accuracy
- Tool Call Accuracy
- Tool Degradation with Availability Masking
- Tool Drift
- Tool Executor
- Tool Success Rate
- Tool System
- Tool Timeout
- Tool Usage Accuracy
- Tool Validation
- Tool Versioning
- Tool call rate
- Tool correctness
- Tool failure
- Tool integration
- Tool misuse rate
- Tool prompt
- Tool role
- Tool selection
- Tool trace accuracy
- Tool use alignment
- Tool-level attack
- ToolValidationError
- Toolformer
- Top-1 selection
- Top-K sparsification
- Top-k routing
- Top-k sampling
- Top-p (nucleus) sampling
- Top-token confidence
- Topics
- Torch-MLIR
- TorchDynamo
- TorchMetrics
- Torrance Tests of Creative Thinking
- Total Revenue
- Total cost per session
- Toxicity filter
- Toxicity score
- Toxiproxy
- Tqdm
- TrOCR
- Trace context
- Trace propagation
- TraceId
- TraceManager
- TraceQL
- TracerProvider
- Traefik
- Train set
- Train-serve skew
- Training Data Poisoning
- Training Stability
- Training dataset
- TrainingArguments
- Trajectory Exact Match
- Trajectory reward
- Trajectory similarity
- TransNetV2
- Transaction ID
- Transformer
- Transformer Engine
- Transformer-XL
- TransformerBlock
- TransformerLens
- Translation attack
- Translation role
- Treatment
- Tree
- Tree Attention
- Tree Cache Management
- Tree Search Agents
- Tree attention mask
- Trigger
- Trimmed mean
- Trimming attack
- Triple
- Triplet loss
- TripletDataset
- Triton Inference Server
- TruLens
- TrueSkill
- Trusted documents
- Truthful mechanism
- TruthfulQA
- Tsunami
- Tumbling window
- Tutel
- Two-person rule
- Two-phase indexing
- Type I Error
- Type II Error
- Type-token ratio
- TypeScript
- TypedDict
- Typer
- Typical sampling
- Typo attack
- t-SNE
- t-test
- t3.medium
- table understanding
- tablespace
- tabula-py
- tactic
- tag
- tags_history.json
- tail latency amplification
- tail risks
- tanh
- target CPGA
- target UP
- target hardware
- target model
- target_modules
- targeted attack
- task allocation
- task prompt routing
- task taxonomy
- task templates
- task vector
- task_id
- taskset
- tc
- tcpdump
- tctl
- te.LayerNorm
- te.Linear
- teacher-forcing
- teacher-student
- telegram bot
- teleprompter
- temperature response
- tempfile
- template
- template circuits
- template versioning
- template-based generation
- temporal bounding
- temporal constraints
- temporary key unavailability
- tenacity
- tenant_id
- tensor-parallel-size
- termTimeoutSeconds
- termcolor
- termination notice
- test generation
- test plan
- test set generation
- tests
- text
- text-embedding-3-large
- text-embedding-3-small
- text-to-image retrieval
- theorem proving
- think/act/observe
- thop
- thrashing
- thread
- thread pool
- thread safety
- thread_block
- threading
- threading.Barrier
- threading.Lock
- threading.Timer
- threat intelligence
- threat modeling
- threshold
- threshold similarity
- threshold-based filtering
- threshold_early_stop
- throughput
- tiered SLA
- tiktoken
- tiled_partition
- tiling
- time
- time series
- time to verification
- time-series analysis
- time.monotonic
- timeit
- timeline
- timeout
- timestamp
- timestamps
- timm
- tit-for-tat
- tmpfs
- token
- token bucket
- token concatenation
- token cost
- token economics
- token leak
- token leakage
- token manipulation
- token masking
- token overshoot
- token smoothing
- token usage
- token-level confidence
- token-level confidence estimation
- token-level representations
- token-level scheduler
- tokenizer
- tokens per second
- tokens per word
- tokens_wasted
- tool call consistency
- tool injection
- tool misuse
- tool overuse
- tool selection learning
- tool testing
- tool use accuracy
- tool verification
- tool_call_failure
- top-5-10
- top-k
- top-k KL divergence loss
- topic modeling
- topk
- topology
- topology matrix
- topology-aware scheduling
- torch memory stats
- torch.autograd.Function
- torch.bmm
- torch.compile
- torch.cuda.amp.autocast
- torch.cuda.empty_cache
- torch.cuda.max_memory_allocated
- torch.cuda.memory_snapshot
- torch.cuda.memory_summary
- torch.cuda.set_per_process_memory_fraction
- torch.distributed
- torch.distributed.optim
- torch.jit.script
- torch.no_grad
- torch.utils.checkpoint
- torch.utils.cpp_extension
- torchrun
- torchvision
- toroidal topology
- toxic content
- trace validation
- traceability
- traceback
- traceparent
- traces
- tracestate
- tracking state
- trade-off
- trade-off качество/латенси
- train/test split
- training
- training cost proportionality
- training objective
- trajectories
- trajectory
- trajectory accuracy
- trajectory coverage
- trajectory distillation
- trajectory divergence
- trajectory graph
- trajectory optimization
- transaction
- transactional.id
- transferability
- transform
- transformer block
- transformer_lens
- transformers
- transitive closure
- transitive dependencies
- translation
- transmission overhead
- transport layer
- transpose
- traversal-запрос
- tree search
- tree-based decoding
- trend analysis
- trigger_rollback
- triggers
- true objective
- truncated BPTT
- truncation
- trust calibration
- trust model
- trust score
- trust-weighted averaging
- truthful bidding
- truthfulness
- try-catch
- tuned lens
- two-phase commit
- two-stage training
- two-step confirmation
- type hints
U
- U-Net
- U-shaped curve
- UCB constant C
- UDP
- UI
- UMAP
- UP-Fall
- UPSERT
- UUID
- UUID v4
- UX
- UX metrics
- UltraFeedback
- Ultralytics
- Uncertainty quantification
- Underconfidence
- Underfitting
- Underprovisioning
- Undersampling
- Unicode
- Unicode homoglyphs
- Unicode replacement character
- UnicodeDecodeError
- Unified embedding
- Unified embedding space
- Unified retrieval
- Unified schema
- Uniform control flow
- Uniformity
- Unigram
- Union Type
- Uniqueness
- Unit test for prompt
- Unit testing
- Unitary/toxic-bert
- Universal Adversarial Triggers
- Universal Transformer
- Unix Time
- Unleash
- Unnatural Instructions
- Unpacking
- Unsloth
- Unstructured
- Untargeted poisoning
- Up-training
- Upper Confidence Bound
- Usability testing
- User Browsing Model
- User Modeling
- User Story
- User bias
- User confirmation
- User feedback
- User flow
- User persona
- User retention
- User study
- Uvicorn
- uint8
- unambiguous semantics
- unanswerable question
- unauthorized tool chain
- uncertainty UI
- uncertainty sampling
- underflow
- undo window
- unembedding
- unified architecture
- unified memory
- unified_diff
- unique_paths
- unit economy
- unittest
- unittest.mock.patch
- unsupervised loss
- untargeted attack
- upstream
- usage
- usage hours
- useState
- user adoption
- user engagement
- user satisfaction
- user-based rate limiting
- user_embedding
- user_id
- user_id хэш
- user_id-based split
- user_template
- user_tenure
- utility per token
V
- V100
- VAD
- VAE
- VALSE
- VALSE benchmark
- VCR.py
- VL-LLM
- VLLM
- VLM
- VQ-GAN
- VQA
- VQVAE
- VRAM usage
- VS Code
- Valid Efficiency Score
- Validating admission
- Validation fail reason distribution
- Validation prompt
- Validation set
- Value
- Value Network
- Value head
- Vamana
- Vanna.ai
- Variable
- Variable Renaming
- Variable costs
- Variance Estimation
- Variational Speculative Decoding
- Vault
- Vault Agent Injector
- Vault CSI Provider
- Vector indexes
- Vector stores
- VectorIndex
- VectorStoreRetriever
- VectorStoreRetrieverMemory
- Vendi Score
- Vendor lock-in
- Verbosity bias
- Vercel
- Version control
- Versioned API
- Vertex AI
- Vertex AI Batch Prediction
- Vespa
- ViLT
- ViT
- ViT-L/14
- ViViT
- Vickrey-Clarke-Groves auction
- VictoriaMetrics
- Vicuna benchmark
- VideoCLIP
- VideoCoCa
- VideoMAE
- View
- Violation rate
- Virtual Users
- Virtual contexts
- VirtualService
- VisDial
- Visibility Timeout
- Vision encoder
- Vision-Language Models
- Visit count
- Visual Embedding
- Visual Genome
- Visual Prompt Injection Dataset
- Visual grounding accuracy
- Visualization of computational graphs
- Vitis AI
- Volatile
- Volcano
- Volume Discount
- Voronoi diagram
- Voting
- Vulkan
- vGPU
- v_proj
- validate_schema.py
- validation metric
- vanishing gradients
- variable cost
- variable-length sequences
- variance normalization
- variance of accuracy
- variational methods
- vector DB poisoning
- vector field
- vector score
- vector search
- vector similarity
- vegeta
- venv
- verbosity
- verifier models
- verifier-guided decoding
- version bump
- version negotiation
- versioned agents
- versioned cache
- versioned documents
- veth
- video group
- video indexing
- video summarization
- virtual nodes
- virtual shards
- virtualenv
- visual expert modules
- visual prompt injection
- vllm:num_requests_waiting
- vocabulary projection
- vocabulary size
- vulnerability
- vulnerability disclosure policy
W
- W3C Trace Context
- WAL
- WATCH
- WGMMA
- WGMMA instructions
- WIMBD
- WKV
- WORM storage
- WQE
- WSL
- Waiting queue
- Wall time
- Warm storage
- Warp
- Warp group
- Warp scheduler
- Warp schedulers
- Warp scheduling
- WasmEdge
- Wasmer
- Wasmtime
- Wasserstein distance
- WatchError
- Watermark Detector
- WatermarkStrategy.forBoundedOutOfOrderness
- Wav2Vec
- Wav2Vec2
- Wave Decoding
- Weaviate
- WebArena
- WebAssembly
- WebP
- WebRTC
- WebShop
- WebSocket
- Webhooks
- Weekly seasonality
- Weight Decay
- Weight sharding
- Weight sharing
- Weight tying
- Weight-only quantization
- Weighted Kappa
- Weighted Scoring
- Weighted routers
- Weighted routing
- Weighted voting
- WeightedRandomSampler
- Weights & Biases
- Weights & Biases Prompts
- Whisper
- Whisper streaming
- Whisper tokenizer
- WhisperFeatureExtractor
- Whistleblowing
- White-box
- Whoosh
- Why3
- WhyLabs
- WhyLogs
- WikiText-103
- WikiText-2
- Wikidata
- Wikipedia
- Wikipedia API
- Wikipedia abstracts
- Wikitext
- Wilcoxon signed-rank test
- Wildcard
- Win rate
- WinoBias
- WireMock
- Wireframe
- Wireshark
- Wizard-of-Oz
- WizardLM
- Word Error Rate
- Word-Patch Alignment
- Word-level attack
- WordNet
- WordPiece
- Work Request
- Workflow
- World models
- Write quorum
- Write-through
- Wuerstchen
- w2v-BERT
- wait_exponential
- wait_random
- wal2json
- wall-clock speedup
- warm index
- warm standby
- warmup steps
- warp divergence
- warp stall reasons
- warp-level parallelism
- warp_group
- washout period
- waterfall diagram
- watermark
- watermarking
- wav2vec 2.0
- wave beam search
- wave decoder
- weak supervision
- web search
- webhook
- weight initialization
- weight optimization
- weighted fusion
- weighted logistic regression
- weighted recall
- where clause
- whisper.cpp
- white-box extraction
- white-box jailbreak
- whitelist
- whitelist/blacklist
- window + watermark
- windowed processing
- winner prediction accuracy
- winning response
- wmma
- wolframalpha
- worker
- worker_prefetch_multiplier
- workers
- worst-case error
- write-behind
- write-through cache
- wrk
X
Y
Z
- Z-score
- Z3
- ZAB
- ZSTD
- Zamba
- Zapier
- ZeRO
- ZeRO-3
- ZeRO-Infinity
- ZeRO-Offload
- Zero init
- Zero point
- Zero-downtime
- Zero-hit rate
- Zero-shot
- Zero-shot attack
- Zero-shot extrapolation
- Zero-shot generalization
- Zero-shot retrieval
- ZeroSCROLLS
- Zipf distribution
- Zipkin
- Zod
- ZooKeeper
- zCDP
- zero downtime
- zero-copy
- zero-order search
- zigzag effect
А
- Абстрактный парсер
- Автоматизация сценария
- Автономное делегирование
- Архитектор агентных систем
- Асимметричное квантование
- Асинхронная индексация
- Атаки инверсии
- авторегрессивное декодирование
- агент в production
- адаптивные алгоритмы
- адаптивный лимит
- активационная разреженность
- акустические токены
- акустическое кодирование
- анализ покрытия
- аннотации о деплоях
- асимметричная репликация кэша
- асинхронная обработка
- ассоциативный сканер
- аукцион ресурсов
Б
В
Г
Д
- Двухфазная миграция
- Делегирование человеку
- Дерево саммари
- Детектор PII
- датасеты
- двухступенчатый ретривал
- деанонимизация
- декартово произведение
- декларативное описание
- декодирующая голова
- детектор циклов
- детекция водяного знака
- детекция повторяющихся действий
- детерминированное распределение трафика
- детерминированные подмножества
- дивергентное мышление
- дискретизация
- дискретизация аудио
- дискретные токены
- дискриминация задания
- долгосрочная память
- дрейф распределения документов
- дрейф распределения запросов
Е
З
И
К
- Каскад моделей
- Каскадная конвертация
- Качество относительно full attention
- Комбинаторный взрыв
- Контекст LLM
- Контекстная маскировка
- Коэффициент автономии
- Коэффициент полезного делегирования
- Кэширование запросов
- калибровка модели IRT
- кардинальность лейблов
- кластеризация эмбеддингов
- ключевой кадр
- ключевой поиск
- ключевые кадры
- коллективные коммуникации
- компилятор DSPy
- коннекторные методы
- консистентность данных
- константная память
- контракты между агентами
- контроллер бюджета
- конфигурационный файл
- конфигурация сервера
- корпус документов
- косинусная близость
- краткосрочная память
- краудсорсинг с верификацией
Л
М
- Материализация матрицы S
- Матрица действий
- Минимальные привилегии
- Минимизация данных
- Многостраничные таблицы
- Многошаговая jailbreak-атака
- Мониторинг безопасности
- маскировка
- матрица перехода A
- матрица проекции B
- матрица проекции C
- межузловая сеть
- мел-спектрограмма
- мел-спектрограммы
- метаданные
- метрика сложности
- метрики успеха
- микро-бенчмарк
- микросервис
- многомерная IRT
- многорукий бандит
- модель Лотки-Вольтерры
- мониторинг
- мониторинг в production
- мультимодальная изоляция
- мультимодальные возможности
- мультимодальные документы
- мультимодальный RAG
Н
О
П
- Пайплайн генерации
- Паттерн Strategy
- Переиндексация
- Повёрнутый текст
- Право на забывание
- Проблема счастливого пути
- Псевдонимизация
- пайплайн
- пайплайн автоматического тестирования
- параметризованные тесты
- параметризованный тест
- патчи
- перекомпиляция
- поведенческие сигналы
- политика перемещения данных
- прайсинг
- программируемые промпты
- промпт агента
- промпт для парафразирования
- промпты
Р
С
- Семантический маппинг
- Сепарабельность
- Сессионные контексты
- Сжатие эмбеддингов
- Симметрия
- Смещение фидбека
- Событийная архитектура
- Среда исполнения
- Страничная организация памяти
- Структурированные промпты
- Структурные фичи
- сегментация трафика
- селективные фильтры
- семантическая амбигуозность
- семантическая дедупликация
- семантическая кластеризация
- семантическая память
- семантические токены
- семантическое кодирование
- сериализация datetime
- сигнатуры
- симуляция отказов
- синтетическая генерация датасетов
- синхронизация кэша
- сложность вопроса
- слои размышления
- специальные токены
- специфичные токены
- способность модели
- сравнение ответов
- средняя яркость пикселей
- статистическая значимость
- статистический тест
- стеганография
- субквадратичное внимание
- суммаризация таблицы
- сырая accuracy
Т
- Токенизация состояния
- Транзакционный консюмер
- Транзакционный продюсер
- Тримминг
- таблица страниц
- текстовый RAG
- текстовый промпт
- телепромпты
- тестирование агентов
- тестовые промпты
- тестовый набор запросов
- тестовый сценарий
- типы узлов
- токенизация изображений
- токены
- топология GPU
- траектория агента
- трансформер-декодер
- трейсинг