Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary

Technical terms explained for non-experts. These definitions appear throughout ThinkLLM to help you understand model profiles.

4733 terms

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

1

1-Bit Architecture

A model design where weights are restricted to only three discrete values (-1, 0, or 1) instead of continuous floating-point numbers, drastically reducing model size and computation.

1-Bit Model

A neural network where each weight is represented using only 1 bit of information (in this case, as one of three values: -1, 0, or 1).

1-Bit Precision

An extreme form of quantization where each weight is represented by just a single bit (0 or 1), maximizing compression but reducing model expressiveness.

1-Bit Quantization

An extreme form of compression that represents model weights using only 1 bit of information per value, drastically reducing memory use but with significant quality loss.

16-Bit Precision

A data format that represents model weights using 16 bits per number, balancing memory efficiency with numerical accuracy.

3

3D Gaussian Splatting

A technique for representing and rendering 3D scenes using millions of small Gaussian blobs instead of traditional meshes.

3D Gaussians

Mathematical shapes (Gaussian distributions) positioned in 3D space used to represent and render 3D scenes efficiently.

3d Layout Conditioning

Guiding AI model outputs by conditioning on 3D spatial layout information.

3D Scene Reconstruction

Building a complete 3D model of a physical environment from images or sensor data.

3D Scene Understanding

Comprehending the three-dimensional structure, objects, and relationships within a physical environment.

4

4-Bit Integer Quantization

A specific quantization method that represents model weights using only 4 bits per number instead of the standard 32 bits, dramatically reducing memory usage.

4-bit Precision

A quantization level where model weights are stored using only 4 bits per value, significantly reducing model size at the cost of some accuracy.

4-bit Quantization

A specific type of quantization that represents model weights using only 4 bits instead of the original 32 bits, enabling very efficient inference on consumer hardware.

5

5-bit Quantization

A specific compression method that represents model weights using only 5 bits of data per value, enabling efficient local deployment on resource-constrained hardware.

6

6-bit Precision

A quantization method that represents model weights using only 6 bits per value, significantly reducing memory requirements compared to standard 32-bit floating-point storage.

6-bit Quantization

A specific quantization method that represents model weights using 6 bits instead of the standard 32 bits, significantly shrinking the model while maintaining reasonable accuracy.

8

8-bit Precision

A quantization method that represents model weights using 8 bits instead of the standard 32 bits, reducing memory usage by approximately 75% while maintaining reasonable performance.

8-bit Quantization

A specific quantization method that represents model weights using 8 bits instead of the standard 32 bits, significantly reducing memory requirements.

A

Ablation

A testing technique that removes or disables components to measure their impact on system behavior.

Ablation Study

An experiment that removes components from a system one at a time to measure how much each contributes to overall performance.

Abliterated

A model variant where safety filters and refusal mechanisms have been removed, allowing it to respond to requests without built-in content restrictions.

Abliteration

A technique that removes or disables a model's built-in safety refusal mechanisms, allowing it to respond to a wider range of requests.

Abnormality Localization

Identifying and highlighting the specific regions in medical images where disease or abnormalities are present.

Abnormality Maps

Visual maps showing which regions of a medical image are abnormal, derived from comparing to historical cases.

Absolute Query-Key Relevance

A measure of relevance between a query and key that is independent of other keys, allowing explicit rejection of irrelevant keys.

Abstention

When a system declines to make a prediction or recommendation instead of providing an answer.

Abstract Syntax Tree (AST)

A tree representation of code structure that shows how statements and expressions relate to each other.

Abstractive Summarization

Generating a summary by creating new sentences that capture key information, rather than selecting existing text.

Acceptance Predicate

A formal criterion that determines whether a generated design meets specified requirements.

Acceptance Rate

The proportion of draft model's proposed tokens that the target model accepts as correct during speculative decoding.

Access Control

Restrictions on what operations or resources a system or user is allowed to use.

Accessibility

Designing technology so people with disabilities can use it effectively.

Accountability Attribution

Determining which party in a system is responsible for harms or failures.

Accuracy-Effort Trade-off

A measure of how well an agent performs relative to the computational cost or number of steps it takes.

Acoustic Feature

Measurable properties of sound like loudness, pitch, or spectral characteristics used to describe audio signals.

Acoustic Representation

An internal mathematical encoding of sound properties that a model learns to recognize, such as frequency, pitch, and timbre characteristics.

Acquisition Function

A rule that decides which point to evaluate next by balancing exploration of new areas with exploitation of promising regions.

Action Binding

The problem of correctly associating a specific action command with the correct agent or subject in a scene.

Action Blindness

A failure mode where agents make poor action choices that lead to uninformative observations, cascading into reasoning errors.

Action Interface

The mechanism through which an agent specifies what operations to perform, such as code execution or structured tool calls.

Action Magnitude

The numerical size of predicted robot movements, which directly correlates with execution speed and distance traveled.

Action Prior

Learned knowledge about how actions and motion typically unfold over time, used to guide policy learning.

Action Recognition

The task of identifying and classifying specific actions or activities occurring in video frames.

Action Token Pretraining

Pre-training a model to understand and predict action tokens before learning continuous control.

Action-conditioned generation

Creating videos where specific physical actions (like forces or robot movements) control what happens in the scene.

Action-Conditioned Rollouts

Simulating multiple future steps of an environment given a sequence of actions the agent might take.

Action-Distance Metric

A continuous measure of how different two agents' behaviors are from each other.

Action-Relevant Perception

Visual features that specifically encode aspects of a scene important for robot control and manipulation.

Action-Value Estimation

Predicting the expected future reward from taking a specific action in a given state.

Actionable Representation

A learned encoding of an object that explicitly captures how it responds to and changes under different actions.

Activated Parameters

The portion of a model's total parameters that are actually used to process a given input; in MoE models, this is typically much smaller than the total parameter count.

Activation Distribution

The probability distribution of neuron outputs at each layer of a network.

Activation Noise

Random variations added to a model's internal computations to test robustness.

Activation Patching

A mechanistic interpretability technique that replaces activations during inference to identify which components cause specific behaviors.

Activation Pattern

The specific configuration of which neurons are active across a network when processing a particular input or task.

Activation Precision

The number of bits used to represent intermediate calculations during inference; keeping this higher (like 16-bit) helps preserve model quality when weights are heavily compressed.

Activation Probing

Analyzing internal neural network activations to understand what a model has learned or decided at different points.

Activation Quantization

The process of reducing the precision of intermediate values (activations) computed during model inference, separate from weight quantization.

Activation Space

The space of internal neuron activations in a model, as opposed to parameter or gradient space.

Activation Steering

Controlling model behavior by modifying internal activations during inference without changing model weights.

Activation-Based Analysis

Examining the internal numerical outputs of neural network layers to understand or guide model behavior.

Activation-based Jailbreaking

Bypassing AI safety features by manipulating the internal numerical patterns the model uses to process information.

Activation-Dark Regime

A setting where representation similarity metrics are uninformative because models can share identical activations yet have different task-specific outputs.

Activation-Space Dynamics

How hidden layer representations evolve during training in terms of magnitude and direction.

Active Learning

A training approach where the model chooses which new examples to learn from rather than using random data.

Active Parameter Count

The number of model parameters that are actually used during inference for a given input, as opposed to the total parameters available.

Active Parameter Design

A model architecture where only a subset of parameters are used for each token, reducing computational cost while maintaining model capacity.

Active Parameters

The subset of a model's total parameters that are actually used during inference for each input, as opposed to all parameters being used every time.

Active Perception

An agent's ability to selectively choose what to observe or process based on task needs, rather than passively consuming all input.

Actor-Critic

A reinforcement learning approach combining a policy-learning actor with a value-estimating critic for improved training stability.

Acyclicity Constraint

A mathematical constraint ensuring a causal graph has no cycles, enforcing valid causal structures.

AdamW

A standard optimizer algorithm commonly used to train neural networks by adjusting weights based on gradients.

Adapter

A small, specialized module added to a model that modifies its output for a specific task without changing the core model weights.

Adapter Code

Custom code written to translate data between incompatible formats or interfaces.

Adapter-Based Architecture

Adding lightweight modules to a pre-trained model to enable new capabilities without retraining the entire model.

Adaptive Aggregation

A method that adjusts how client updates are combined on the server based on their quality or contribution, rather than using fixed weights.

Adaptive Attack

An attack that adjusts its strategy based on feedback from the target system to improve its effectiveness.

Adaptive Computation

Automatically adjusting the amount of computation (depth, steps) based on the difficulty of each input.

Adaptive Compute

Dynamically adjusting the amount of computation (e.g., number of steps) based on problem difficulty.

Adaptive Learning

Educational systems that adjust content difficulty and pacing based on real-time analysis of learner performance and understanding.

Adaptive Opponents

Players who adjust their strategies based on observing the history of play rather than using fixed predetermined strategies.

Adaptive Policy

A system that dynamically adjusts parameters (like reward weights) based on the current task or input.

Adaptive Prompting

Dynamically selecting or modifying prompts based on the specific input query to optimize model performance.

Adaptive Quantization

A quantization approach that adjusts its representation strategy based on the distribution of input values.

Adaptive Reasoning

Dynamically adjusting how much computational effort a model uses based on problem difficulty.

Admm

Optimization algorithm that splits problems into smaller parts solved alternately.

Adoption and Usage Data

Real-world information about how and when workers actually use AI tools in practice.

Advantage Estimation

Computing how much better an action is compared to the baseline, used to guide policy gradient updates.

Advantage Function

In RL, measures how much better an action is compared to the average action in a given state.

Advantage Gap

The difference in expected returns between two policies, used to measure which policy is making better decisions in a given state.

Advantage Weighting

Using estimated advantage values to weight training examples, emphasizing transitions that improve over baseline.

Adversarial Attack

Intentional manipulation of input data to trick an AI model into making wrong decisions.

Adversarial Auditing

Systematically testing an agent's reasoning to find logical or evidential violations it may have missed.

Adversarial co-evolution

A training loop where attack and defense agents compete and improve against each other iteratively.

Adversarial Evaluation

Testing designed to find weaknesses and edge cases rather than help the system succeed.

Adversarial Examples

Deliberately tricky test cases designed to fool AI models, like plausible wrong answers.

Adversarial Falsification

Systematically searching for inputs where a model fails, used here to find materials where ML predictions diverge from ground truth.

Adversarial Generator-Discriminator Framework

A training setup where a generator model creates outputs while a discriminator learns to distinguish them from human examples, providing implicit feedback.

Adversarial Learning

Training where two networks compete—one generates behavior, the other judges if it matches the expert.

Adversarial loop

A process where one agent intentionally creates challenging test cases to improve another agent's output.

Adversarial Objectives

Training approach where a generator and discriminator compete to improve output quality and realism.

Adversarial Perturbations

Carefully crafted, often imperceptible changes added to images to fool AI models into producing incorrect outputs.

Adversarial Prefill Attack

Manipulating a model's output by prepending adversarial text that constrains the model's response generation.

Adversarial Prompting

Deliberately crafted inputs designed to trick an LLM into unsafe or unreliable outputs.

Adversarial Reweighting

A training technique that reweights samples to penalize worst-case calibration errors, improving robustness under distribution shift.

Adversarial Robustness

The ability of an AI system to maintain correct behavior even when facing intentionally crafted misleading inputs.

Adversarial Training

A defense method that trains models on adversarial examples to improve robustness against attacks.

Adversarial Training-Free Defense

A defense mechanism that protects models from attacks without requiring exposure to adversarial examples during training.

Aerodynamic Downwash

The disrupted air flow created by a moving object that affects nearby objects, a physical interaction agents must learn to handle.

Aesthetic Assessment

Evaluating the visual appeal and artistic qualities of images or scenes, such as composition and harmony.

Aesthetics-Guided Training

Pre-training a model with automatic musicality labels before preference alignment to establish quality priors and reduce downstream optimization conflicts.

Affect Coupling

Linking emotional or sentiment states between connected entities in a system.

Affective Polarity

The emotional tone of text, measured as the degree of negativity, positivity, or neutrality in language.

Affective State

A person's emotional state, typically measured by dimensions like valence (positive/negative) and arousal (calm/energized).

Affordance Prediction

Predicting which areas or objects in a scene are suitable for a specific action or interaction.

Agency

An AI system's ability to act autonomously toward goals in its environment.

Agency Transfer

Gradually shifting decision-making control from one policy to another during training.

Agent Audit

Systematic evaluation of an agent's behavior changes to detect shifts toward undesired traits or capabilities.

Agent autonomy

The degree to which an agent retains independent decision-making capability without external manipulation.

Agent Harness

The framework or system that orchestrates how an AI agent retrieves information, calls tools, and processes results.

Agent Orchestration

Coordinating multiple AI agents to work together on complex tasks.

Agent power

A measure of how much influence an individual agent has on collective outcomes and system behavior.

Agent Recommendation

Automatically selecting the most suitable agent(s) for a task from available registries using matching and ranking techniques.

Agent Scaffold

A multi-agent system where subagents work together to complete a task while hiding the overall objective.

Agent Skill

A specific capability or tool that an AI agent can use to accomplish part of a larger task.

Agent Trajectory

The sequence of actions and decisions an agent makes while working toward a goal.

Agent-Agnostic

A system or interface designed to work with any type of agent without requiring agent-specific customization.

Agent-Based Model

A simulation where independent agents follow simple rules and interact, creating emergent group behavior.

Agent-Tool Interface Grounding

Teaching an AI agent how to interact with a specific tool by providing its input language, constraints, and validation rules.

Agential Cut

The boundary between what a measurement instrument observes and what it ignores, which is shaped by the instrument's design choices.

Agentic

A model designed to act autonomously by making decisions, selecting actions, and using tools to accomplish multi-step tasks.

Agentic AI

An AI system that can autonomously plan and execute multi-step tasks, making decisions along the way.

Agentic Behavior

The ability of a model to autonomously plan and execute sequences of actions or tool calls to accomplish a goal.

Agentic Coding

An approach where an AI model autonomously plans and executes multi-step coding tasks, making decisions about which files to modify and how to structure solutions.

Agentic Data Retrieval

The process of autonomous agents searching for and obtaining datasets needed to complete tasks.

Agentic Depth

Sequential overhead from cascaded perception, reasoning, and tool-calling loops in agentic systems.

Agentic Engineering

Designing and building systems where AI agents autonomously plan, decide, and act toward goals.

Agentic Evaluation

Testing an AI system's ability to complete multi-step tasks that require planning, searching, and taking actions.

Agentic Framework

A system where an AI model acts as an agent that can call tools repeatedly to solve problems step-by-step, rather than answering in a single pass.

Agentic Language

A structured language with explicit control constructs (IF, GOTO, FORALL) that agents use to execute plans deterministically.

Agentic Language Model

An LLM system that can plan and execute multi-step tasks autonomously to achieve goals.

Agentic Microphysics

The study of local interaction dynamics where one agent's output becomes another agent's input under specific protocol conditions.

Agentic Multimodal Models

AI systems that can process multiple types of input (text, images, etc.) and actively interact with external tools and environments.

Agentic Perception

Vision systems that extract structured state information needed for an agent to make decisions, not just recognize objects.

Agentic Reasoning

Reasoning through explicit tool calls or code execution that can be interpreted and debugged, but may incur latency from external execution.

Agentic Reflection

An agent's ability to evaluate its own performance and autonomously improve its behavior across multiple attempts.

Agentic Reinforcement Learning

Training autonomous agents to make sequential decisions by learning from rewards and reusable experience.

Agentic Reinforcement Learning

A training approach where an AI model learns to make sequential decisions and take autonomous actions to complete multi-step tasks, rather than just responding to individual prompts.

Agentic Retrieval Mechanism

A tool-augmented system where a reasoning model navigates and searches through structured memory using an Observation-Reason-Action loop.

Agentic Search Systems

AI systems that iteratively search and synthesize information to solve complex problems autonomously.

Agentic Self-Correction

An AI agent's ability to detect and fix its own errors by using tools or feedback without human intervention.

Agentic Self-Evolution

An agent's ability to autonomously improve its capabilities by creating and refining its own skills over time.

Agentic Strategies

Structured approaches where an AI system takes initiative to gather information systematically rather than passively responding to user input.

Agentic Systems

AI systems that autonomously plan, act, and adapt based on feedback to accomplish multi-step goals in complex environments.

Agentic Task

A structured problem where an agent dynamically selects actions and resources to achieve a goal.

Agentic Tasks

Complex tasks where a model acts autonomously to break down goals into steps, use tools, and make decisions to reach an objective.

Agentic Visual Generation

A visual generation system that autonomously decides when and how to use external tools or search to improve image generation quality.

Agentic Workflow

A system where an AI agent autonomously performs tasks with explicit goals and structured decision-making.

Agentic Workflows

Processes where a model autonomously plans and executes multiple steps or tool calls to accomplish a goal, rather than responding to a single prompt.

Aggregation

Combining multiple data points or model outputs into a single summary result.

AI Control

The study of safely deploying capable but potentially untrusted AI systems through monitoring and oversight.

AI-Augmented Ecosystems

Interconnected systems where multiple AI components interact through shared data and infrastructure.

AI-Text Detection

Methods to identify whether text was written by an AI system or a human.

Aleatoric Uncertainty

Randomness or noise inherent in data that cannot be reduced with more information.

Algorithmic Bias

Systematic errors in AI systems that unfairly disadvantage certain groups of people.

Algorithmic Fairness

Ensuring AI systems treat different groups equitably without discrimination.

Algorithmic Monoculture

Tendency of AI systems to produce similar outputs or behaviors, either naturally or in response to incentives.

Algorithmic Transparency

The ability to reconstruct and understand the computational process by which a model arrives at its outputs using interpretable intermediate states.

ALiBi Positional Encoding

A technique that helps the model understand the order and position of words in long sequences without needing to add extra position information to each word.

Aligned

A model trained to behave safely and follow human values through techniques like safety filtering and refusal of harmful requests.

Alignment

The process of training a model to behave safely and according to human values and preferences, which base models typically lack.

Alignment Auditing

Systematic testing to verify that AI systems behave safely and according to intended values in realistic deployment scenarios.

Alignment Faking

When an AI model appears aligned under monitoring but subverts its goals when unmonitored.

Alignment Fine-Tuning

The process of adjusting a model's behavior to make it safer, more helpful, and better aligned with human values.

Alignment Guardrails

Safety constraints built into a model during training to prevent it from generating harmful, biased, or inappropriate content.

Alignment Layer

Additional training applied to a base model to make it behave safely and follow user intentions more reliably.

Alignment Tampering

A vulnerability where a model exploits the alignment process by influencing its own training data to amplify misaligned behaviors.

Alignment Tax

The degradation of a model's general capabilities that occurs when training it to align with human values.

Allocation Monotonicity

A guarantee that higher bids weakly increase an item's chance of being recommended without requiring model retraining.

Alpha Release

An early, experimental version of software that is still under development and may have bugs or incomplete features.

Always-On Personal Assistants

AI agents continuously available to help users by understanding context across their entire digital world and proactively anticipating needs.

Ambiguity Bias

Errors caused by confusion between similar or overlapping UI elements when determining which one to interact with.

Amino Acid Sequence

The linear chain of amino acids that makes up a protein, which determines its structure and function.

Amino Acid Sequences

The linear arrangement of amino acids that make up a protein, written as a string of letters where each letter represents a different amino acid.

Amortization

Spreading the cost of an expensive computation across multiple uses to reduce per-use cost.

Ancestor-Only Attention Mask

An attention pattern that restricts a model to only attend to ancestor nodes in a tree structure, enabling efficient tree verification.

Anchor Selection

Choosing a reference model to compare all other models against in pairwise evaluation tasks.

Anchoring

Bias where initial information disproportionately influences subsequent decisions.

Anchoring Effect

A bias where an initial piece of information (anchor) disproportionately influences subsequent judgments.

Angular Resolution

In quantization, the granularity of representable values in high-dimensional space; low resolution causes accuracy loss.

Angular Step Size

The learning rate applied to the direction of weight updates, separate from the magnitude of weights.

Annotation Aggregation

Methods for combining multiple human judgments into a single training signal for the model.

Annotation Framework

A structured set of guidelines for labeling data with specific linguistic or semantic information.

Annotation Pipeline

A systematic process for labeling data with human-verified information to create training datasets.

Annotator Disagreement

Variation in how different people label the same content, reflecting genuine differences in perspective rather than labeling error.

Anode Material

The negative electrode in a battery where ions are stored during charging.

Anomaly Detection

Identifying data points or objects that deviate significantly from normal patterns or training data.

Anomaly Segmentation

Identifying and precisely locating defective or abnormal regions in images or 3D data at the pixel or voxel level.

Answer Set Programming

A declarative programming paradigm for solving combinatorial problems using logical rules and constraints.

Anytime-Valid

A statistical test that can be checked at any stopping time without inflating error rates.

Apache 2.0 License

An open-source software license that allows free use, modification, and distribution of code with minimal restrictions.

Apache 2.0 License

A permissive open-source license that allows free use, modification, and distribution of software with minimal restrictions.

Apache License

A permissive open-source license that allows you to use, modify, and distribute software with minimal restrictions.

Apache Licensed

A permissive open-source license that allows free use, modification, and distribution of software with minimal restrictions.

API

An interface that allows developers to send requests to and receive responses from an AI model over the internet.

API Access

A programmatic interface that allows developers to send requests to the model and receive responses without running it locally.

API Accessibility

The ability to access and use a model programmatically through an application programming interface, allowing developers to integrate it into their applications.

API Accessible

A model that can be used through an application programming interface, allowing developers to integrate it into their applications programmatically.

API Availability

Access to a model through an application programming interface, allowing developers to integrate the model into their applications and services programmatically.

API Compatibility

The ability of a service to work with the same code and commands as another service, making it easy to switch between them.

API Deployment

A method of making an AI model available for use over the internet through standardized web requests, rather than running it locally.

API Inference

Running a model through a web service interface where you send requests and receive predictions without needing to host the model yourself.

API Schema

A specification describing how a backend service accepts requests and returns data.

API Token

A credential that grants an application or automated agent permission to access services and data on behalf of a user or organization.

API-Based Deployment

A model served through an application programming interface (API) rather than run locally, allowing users to send requests and receive responses over the network.

API-Only Access

A model that can only be used through programmatic requests (code) rather than through a web interface or chat application.

Append Only Log

Data structure that records events sequentially without allowing deletions.

Apple Silicon

Apple's custom-designed processors (like M1, M2, M3) optimized for running machine learning models on Mac computers.

Apple Silicon Optimization

Software tuning that allows a model to run efficiently on Apple's custom processors (like M1, M2, M3) found in Mac computers.

Approximation Ratio

A measure of how close a solution is to the optimal solution, expressed as a ratio.

Approximation Theory

Mathematical framework for understanding how well functions can represent complex phenomena.

Arbitration

The process by which a model resolves conflicts between different input modalities (e.g., audio vs. text).

Architectural Constraint

A structural limitation in code design that prevents correct solutions even when the agent optimizes parameters within it.

Architecture

The underlying structural design of a neural network that defines how data flows through layers and components.

Argumentation Framework

A formal system for evaluating arguments based on attack relationships between them, determining which arguments are acceptable.

Arithmetic Circuit

A mathematical representation of a computation as a directed graph of arithmetic operations.

Arithmetic Reasoning

A model's ability to perform mathematical calculations and solve problems involving numbers and operations.

Arousal

The intensity or activation level of an emotion, ranging from calm to excited.

Artifact

A released model, dataset, or tool that is publicly documented and available.

Artifact Delivery

The ability of an agent to produce and return tangible business outputs (documents, code, reports) that meet quality standards.

Artifact Management

Organizing and tracking the outputs and intermediate results an agent creates during problem-solving.

Artificial Neural Network (ANN)

A machine learning model inspired by biological neurons that learns patterns from data to make predictions or classifications.

Artistic Style Prediction

The task of identifying or classifying the artistic style of a work (e.g., Renaissance, Impressionism) using AI.

Aspect-Decomposed Synthetic Corpus

Training data generated by breaking down queries into multiple aspects and creating complementary evidence examples.

Associative Memory

A system that retrieves stored patterns by establishing stable attractors around them, like Hopfield networks.

Associative Reasoning

The ability to find meaningful connections and relationships between different concepts or ideas.

Asymmetric Encoding

A technique where queries and documents are encoded differently to optimize retrieval performance, rather than treating them identically.

Asymmetric Search

A retrieval approach where the query and the documents being searched have different lengths or structures, like matching a short question to long passages.

Asymptotic-Preserving

A neural network approach that correctly captures physics behavior across different scales and parameter regimes.

Attack Surface

The set of all possible entry points or vulnerabilities in a system that an attacker could exploit.

Attainable Utility Preservation

A safety approach that penalizes actions that significantly reduce the agent's ability to achieve future goals.

Attention

A mechanism that lets the model focus on relevant parts of the input when generating each output token.

Attention Attribution

A method to identify which input tokens most influence specific outputs by analyzing attention weights across network layers.

Attention Entropy

A measure of how concentrated or distributed attention weights are; lower entropy means the model focuses on fewer tokens.

Attention Head

A parallel attention mechanism within a transformer layer that learns different aspects of input relationships.

Attention Intervention

Modifying attention weights in transformer models to change which parts of input the model focuses on.

Attention Maps

Visual representations showing which parts of an input a model focuses on when generating each output.

Attention Mask Intervention

Redirecting where a model's attention focuses by modifying which positions it can attend to during processing.

Attention Mechanism

A technique that allows a model to focus on the most relevant parts of the input when generating each output token.

Attention Partition

How a model's attention mechanism divides its focus between different input elements like image and text tokens.

Attention Pass

A single forward computation through an attention mechanism that produces weighted outputs from input queries and values.

Attention Pooling

Aggregating embeddings by learning weighted combinations that emphasize the most relevant slices or features.

Attention Sink

A token that attracts excessive attention from the model regardless of its semantic importance.

Attention Sinks

Tokens that attract disproportionate attention from the model regardless of their semantic relevance to the task.

Attention Visualization

Techniques that show which parts of input data a model focuses on during processing.

Attention-Based Grounding Score

A signal measuring how well a reasoning step is supported by the input and previously accepted steps.

Attractor Computation

An algorithm that identifies sets of states a system will inevitably reach, used in game-theoretic analysis.

Attractor Module

A component that refines embeddings by solving for fixed points using implicit differentiation during training.

Attribute Inference

Deducing personal characteristics like gender, age, or ethnicity from user data without explicit disclosure.

Attribute-Level Bias

Bias caused by specific visual features (like clothing or age appearance) rather than overall identity differences.

Attribution

Identifying which input features or model components are responsible for a specific prediction or behavior.

Attribution Method

A technique that identifies which parts of an input (like image regions) are most responsible for a model's predictions or errors.

Attribution-Based Neuron Mining

A technique to identify which neurons are responsible for processing specific types of input by analyzing their contribution to outputs.

AUARC

Area Under the Accuracy-Rejection Curve; a metric measuring how well confidence scores distinguish correct from incorrect predictions.

AUC (Area Under the Curve)

A metric measuring how well a model ranks positive cases higher than negative cases, ranging from 0.5 (random) to 1.0 (perfect).

AUC-Consistency Dissociation

When a model maintains high classification accuracy (AUC) while its explanations become inconsistent across similar cases.

Audio Captioner

A system that generates text descriptions of audio content, allowing LLMs to reason about sound indirectly.

Audio Classification

The task of automatically assigning audio clips to predefined categories, such as identifying whether a sound is music, speech, or environmental noise.

Audio Codec

A tool that compresses and decompresses audio data to reduce file size while preserving sound quality.

Audio Conditioning

Using an audio sample to guide or control what a generative model produces, rather than using text or other inputs.

Audio Embedding

A numerical representation (vector) that captures the essential features and meaning of audio data in a compact form that machine learning models can process.

Audio Embeddings

Numerical representations of audio that capture its meaning and characteristics in a form that machine learning models can process.

Audio Encoder

A neural network component that converts raw audio signals into numerical representations the model can process.

Audio Fidelity

The quality and accuracy of synthesized audio in reproducing natural-sounding speech.

Audio Reconstruction

The process of converting compressed audio tokens back into playable audio that closely matches the original sound.

Audio Transcription

Converting spoken audio content into written text for analysis.

Audio-Language Pretraining

A training approach that teaches a model to understand connections between audio sounds and text descriptions by learning from large unlabeled datasets.

Audio-Visual Processing

The ability to simultaneously analyze sound and video streams to understand content where both sight and sound are important.

Audio-Visual Understanding

The ability to jointly process and reason about both sound and video content to understand events, speech, and context more completely than analyzing either alone.

Auditing

The process of systematically reviewing code or systems to detect errors, vulnerabilities, or malicious modifications.

Auditory Knowledge

An LLM's understanding of sound, audio concepts, and acoustic phenomena learned from text-only pre-training.

Augmentative and Alternative Communication (AAC)

Technology and methods that help people with speech disabilities communicate, from word prediction to text-to-speech systems.

Augmented Lagrangian Method

An optimization algorithm that solves constrained problems by iteratively updating variables and penalty parameters.

AUPRC

Area Under the Precision-Recall Curve; a metric measuring classifier performance on imbalanced datasets.

AUROC

Area Under the Receiver Operating Characteristic curve, a metric measuring how well a model ranks correct answers above incorrect ones.

Authorial Intent

The underlying purpose or goal behind a creator's choices, whether to inform accurately or mislead deliberately.

Authorship Attribution

Determining who wrote a piece of text, including distinguishing human from AI authorship.

Auto-tuning

Automatically selecting optimal parameter values for a program by testing different configurations.

Autocomplete

A feature that predicts and suggests the next tokens or code snippets as a user types, completing partial inputs.

Autoencoder

A neural network that compresses data into a smaller representation (encoder) and reconstructs it (decoder).

Automated Attack

An attack that uses AI models or algorithms to automatically generate, refine, and evaluate malicious prompts at scale.

Automated Evaluation

Using algorithms to automatically measure AI model performance on tasks.

Automated Program Repair

Techniques that automatically generate patches to fix bugs or vulnerabilities in source code.

Automated Programming Assessment

Systems that automatically evaluate student code submissions for correctness and understanding.

Automated Segmentation

Automatically identifying and outlining specific regions or structures in an image without manual labeling.

Automated Verification

Using computational methods to automatically check whether a proposed solution is correct without human review.

Automatic Differentiation

Computing gradients of functions by decomposing them into elementary operations and applying the chain rule.

Automatic Speech Recognition (ASR)

Technology that converts spoken audio into written text automatically.

Automation Bias

The tendency for humans to over-rely on or trust automated systems, even when they make mistakes.

AutoML (Automated Machine Learning)

Automated tools that search over multiple model architectures and hyperparameters to find the best classifier without manual tuning.

Autonomous Agent

An AI system that can independently perceive its environment, make decisions, and take actions to accomplish goals without constant human direction.

Autonomous Agents

AI systems that can independently plan and execute multi-step tasks without human intervention at each step.

Autonomous Coding Agents

AI agents that generate, execute, validate, and repair code artifacts without human intervention in the loop.

Autonomous Feedback Loop

A system where AI automatically evaluates and improves itself without human intervention in the loop.

Autonomous Play

A robot independently practicing tasks and generating training data without human guidance or intervention.

Autonomous Skill Acquisition

A robot's ability to learn new manipulation skills through self-directed practice without human demonstrations.

Autonomy Spectrum

A range of control levels from fully human-controlled to fully autonomous AI, with hybrid modes in between.

Autoregressive

A model that generates text one token at a time by predicting the next word based on all previous words in the sequence.

Autoregressive Decoding

The standard method most language models use to generate text by predicting one token (word piece) at a time, left to right, where each prediction depends on all previous tokens.

Autoregressive Expansion

Generating sequences one token at a time, where each new token depends only on previous tokens.

Autoregressive Generation

A text generation approach where the model predicts one word at a time, using all previously generated words to inform the next prediction.

Autoregressive Image Generation

Generating images sequentially, one token or element at a time, where each prediction depends on previous outputs.

Autoregressive Language Model

A model that generates text by predicting one word or token at a time, using only the words that came before it.

Autoregressive Model

A model that predicts the next item in a sequence based on all previous items, one step at a time.

Autoregressive Models

Language models that generate text one token (word piece) at a time, where each new token depends on all previously generated tokens.

Autoregressive Rollout

Generating predictions sequentially where each prediction depends on previous predictions, causing errors to compound over time.

Autoregressive Unified Multimodal Model

A single neural network that generates outputs one token at a time across all modalities using the same architecture.

Autoregressive Video Diffusion

A generative model that creates videos frame-by-frame sequentially, where each new frame depends on previously generated frames.

Autoregressive zooming

Generating a sequence of zoom-level decisions one at a time, where each decision depends on previous ones, to progressively narrow down a location.

AutoRound

An automated quantization method that intelligently rounds weights to lower precision while minimizing the loss in model performance.

AutoRound Quantization

Intel's automated quantization method that intelligently rounds model weights to lower precision while minimizing accuracy loss.

B

Backbone

The core language model architecture that forms the foundation of a larger system, in this case Llama 3.

Backbone Architecture

The core neural network structure that a model is built upon, which in this case is Llama 3.

Backbone Model

A core neural network component that extracts features from input data, typically used as a foundation for larger systems rather than standalone.

Backchanneling

Brief verbal responses like 'mm-hmm' or 'yeah' that show engagement without taking a full conversational turn.

Backdoor Attack

A security attack where hidden malicious behavior is embedded in a model to trigger on specific inputs.

Backpropagation Through Time

A training method for recurrent networks that computes gradients by unrolling the network across time steps.

Backtesting

Testing a model on historical data to evaluate how it would have performed.

Backtracking

Reverting to an earlier decision point when an approach fails, rather than trying to fix errors at the current level.

Backtracking Markov Chain

A sampling strategy that allows reversing previous decisions (remasking tokens) to escape low-reward regions and find better solutions.

Backward Transfer

How learning new tasks affects performance on previously learned tasks.

Balanced Accuracy

A fairness metric that averages accuracy across classes, preventing high scores when one class dominates predictions.

Bandit Feedback

Learning setting where you only observe the outcome of your chosen action, not all alternatives.

BART

A neural network architecture that combines an encoder (which reads text) and a decoder (which generates text), commonly used for tasks like summarization and text generation.

BART Architecture

A neural network design that combines an encoder (for understanding text) and decoder (for generating text) to learn meaningful representations.

Base Architecture

The foundational neural network design that a model is built upon; inheriting from a base architecture means the model follows the same core structure and design principles.

Base Language Model

A foundational AI model trained on raw text data without additional fine-tuning for specific tasks or instructions.

Base Learners

The individual weak models (like decision trees or neural networks) that are combined in an ensemble method.

Base Model

A pretrained model that completes text patterns but hasn't been trained to follow instructions, serving as a starting point for customization through fine-tuning.

Base Model Size

A smaller version of a model architecture that prioritizes speed and lower memory usage over maximum performance, making it suitable for resource-constrained environments.

Base Pretrained

A model trained only on raw text prediction without additional instruction-following training, so it completes text continuations rather than answering questions or following commands.

Base Pretrained Model

A language model trained on raw text data without additional instruction tuning, so it completes text patterns rather than following specific user instructions.

Baseline Model

A simple reference model used to compare performance against more complex models or to establish a minimum expected behavior.

Baseline Policy

An existing, functional control policy used as a starting point or reference for training improvements.

Basin of Attraction

A region in a model's state space where inputs converge to the same output or memory.

Basis Functions

Simple mathematical shapes (like sine waves or Gaussians) combined to represent complex signals.

Batch Distillation

Learning from multiple independent training batches rather than continuously updating from a live environment.

Batch Effects

Systematic differences in data caused by processing samples in separate groups.

Bayes-Nash Equilibrium

A stable outcome where each agent's strategy is optimal given their private information and beliefs about others' strategies.

Bayesian Bootstrap

A resampling method that estimates uncertainty by repeatedly reweighting data and refitting models.

Bayesian Decision Theory

Framework for making optimal decisions by combining probability distributions with utility functions.

Bayesian Filters

Probabilistic methods that estimate hidden states by recursively updating beliefs based on observations and a system model.

Bayesian Fusion

A probability combination method that merges confidence signals using Bayesian principles to create a single aggregated score.

Bayesian Incentive Compatible (BIC)

A mechanism where participants are motivated to tell the truth about their preferences, given what they know.

Bayesian Inference

A statistical method that updates beliefs about unknown values using observed data and prior knowledge.

Bayesian Linguistic Belief State

A semi-structured representation combining numerical probabilities with natural-language evidence summaries, updated iteratively by an LLM.

Bayesian Neural Networks

Neural networks that model uncertainty by treating weights as probability distributions rather than fixed values.

Bayesian optimization

A method that uses probability to intelligently update and improve a system based on past results.

Bayesian Persuasion

A framework for analyzing how information disclosure strategically influences decision-makers' choices.

BCE Loss

Binary Cross-Entropy loss, a training objective commonly used for relevance scoring tasks where the model learns to predict whether a query-document pair is relevant or not.

Beam Search

A decoding algorithm that keeps the top-k most likely candidate sequences at each step, balancing quality and computational cost.

Bee Equation

A mathematical model describing how honeybee swarms reach consensus on nest sites through recruitment and inhibition.

Behavior Cloning

Training a policy to imitate expert demonstrations by supervised learning on state-action pairs.

Behavior Latents

Learned vector representations that capture and control an agent's behavioral characteristics like driving style.

Behavior Trees

A hierarchical decision-making structure that combines simple rules and conditions to control complex agent behavior.

Behavioral Audit

A systematic evaluation of a model's outputs and preferences across different contexts and framings to detect biases.

Behavioral Collapse

The sudden drop in a model's ability to apply a learned rule, despite that rule remaining present in training data.

Behavioral Diversity

An RL agent's ability to produce multiple different strategies or outputs rather than converging to a single deterministic policy.

Behavioral Feedback

How human responses to interventions create secondary effects that influence system outcomes.

Behavioral Probe

A controlled experiment or stimulus designed to elicit informative behavior from an agent.

Behavioral Simulation

Using models to predict how people will act in specific situations or respond to choices.

Behaviour Cloning

Initializing a policy by learning to imitate past user actions from historical data.

Belief Change

The magnitude of shift in a user's conviction about information after receiving a correction.

Belief Space

The space of probability distributions representing a robot's uncertainty about unobservable factors like human preferences or goals.

Belief State

A representation of what an AI system or person currently believes to be true about a situation.

Belief-Desire-Intention (BDI) Model

A framework modeling agent behavior through beliefs (what they know), desires (what they want), and intentions (what they commit to do).

Bellman Equation

A recursive equation that relates the value of a state to the values of successor states in dynamic programming.

Bellman Operator

A mathematical operator that updates value estimates based on immediate rewards and future value predictions.

Benchmark

A standardized test suite used to measure and compare model performance on specific tasks.

Benchmark Dataset

A standardized set of test problems used to measure and compare the performance of different algorithms or models.

Benchmark Harness

The infrastructure that runs evaluation tests and measures agent performance against predefined tasks.

Benchmarkless Comparative Safety Scoring

Comparing model safety when no labeled benchmark exists for the specific language, domain, or regulatory context.

Benign Overfitting

A phenomenon where a model fits training data perfectly but still generalizes well to unseen data.

Bernoulli Prediction Head

A neural network output that predicts binary (0/1) values independently for each bit in an image code.

BERT

A foundational neural network architecture designed to understand the meaning of words in context by learning from large amounts of text.

BERT Architecture

A transformer-based model design that reads text in both directions simultaneously to understand context, widely used as a foundation for language understanding tasks.

BERT Encoder

A neural network model that reads text and converts it into numerical vector representations that capture the meaning of words and sentences.

BERT Model

A transformer-based neural network architecture designed to understand text by learning bidirectional context, commonly used as a foundation for natural language understanding tasks.

BERT-Based

A model architecture that uses the same foundational design as BERT, which learns bidirectional context by reading text in both directions simultaneously.

BERT-Based Model

A model built on BERT, a foundational architecture that learns bidirectional text representations and is commonly adapted for specific tasks like spell-checking.

BERT-Style Architecture

A neural network design based on the BERT model that uses transformer layers to understand relationships between words in text by looking at context from all directions.

BERT-Style Encoder

A transformer-based model architecture that reads text bidirectionally to understand context and produce meaningful representations of words and sentences.

BERT-Tiny

A heavily compressed version of the BERT language model with far fewer parameters, designed for fast inference on resource-constrained devices.

BERTopic

Topic modeling method that uses transformer embeddings to identify and label topics in text data.

Best-of-N Sampling

A decoding strategy that generates N candidate responses and selects the one ranked highest by a reward model.

Beta Release

An early version of software that is still being tested and refined, meaning it may have bugs or incomplete features but is available for broader evaluation.

Betti Number

A topological property that counts connected components and holes in a structure, used here to enforce vessel connectivity.

BF16

A 16-bit floating-point format that balances precision and memory efficiency, commonly used for training and deploying large language models.

BF16 Format

A 16-bit floating-point format (Brain Float 16) that balances precision and memory efficiency, commonly used for storing and running large language models.

BF16 Precision

A 16-bit numerical format that balances memory efficiency with numerical stability, using fewer bits than standard 32-bit floats while maintaining training and inference quality.

BFloat16 (BF16)

A 16-bit floating-point format that preserves numerical precision similar to full 32-bit precision while using half the memory, making large models faster and cheaper to run.

Bi-Encoder

A model architecture that encodes two pieces of text separately into comparable vector representations, allowing efficient comparison of their semantic similarity.

Bi-level Optimization

An optimization approach with two nested loops: an inner loop optimizing fast weights and an outer loop optimizing the main model parameters.

Bias Evaluation

Systematic testing of AI models to identify and measure discriminatory patterns against specific groups.

Bias Mitigation

Techniques to reduce discriminatory outcomes in machine learning models through data or algorithm modifications.

Bias Propagation

The spread of systematic evaluation errors from one agent to others through their interactions.

Bias-Boundedness

A mathematical guarantee that limits how much bias can affect a model's decisions, even if the bias source is unknown.

Bias-Sensitive Regions

Parts of a model where social biases are most likely to emerge or be encoded in the computations.

Bias-Variance Decomposition

Breaking down prediction error into bias (systematic error) and variance (sensitivity to training data).

Bid-Aware Decoding

An inference technique that adjusts which items are generated based on real-time bid values, steering recommendations toward higher-value items.

Bidirectional Attention

A mechanism that allows the model to look at context both before and after each word when understanding text, rather than just looking forward.

Bidirectional Context

The ability to understand relationships between words by looking at both the words that come before and after a given word.

Bidirectional Generation

The ability to generate text or code by considering context from both directions (before and after a gap), rather than only generating left-to-right.

Biencoder

A neural network architecture that encodes two separate pieces of text independently and compares them to measure semantic similarity, commonly used for matching and retrieval tasks.

Big-M Constant

A large coefficient used in MILP formulations to enforce logical constraints; larger values make the relaxation weaker and solving slower.

BigBird-Pegasus Architecture

A transformer-based model architecture designed to handle very long text sequences efficiently by using sparse attention patterns instead of processing every word pair.

Bilevel Optimization

An optimization framework with two hierarchical levels where upper-level decisions constrain lower-level optimization problems.

Bilinear Decomposition

A factorization where value and policy functions are expressed as products of goal-conditioned coefficients and learned basis functions.

Bilingual

A model trained to understand and generate text in two languages, in this case Japanese and English.

Bilingual Model

A language model trained to understand and generate text in two languages with comparable fluency.

Bilingual Vocabulary

A mapping of words between two languages, showing which words in one language correspond to words in another.

BiLSTM (Bidirectional LSTM)

A recurrent neural network that processes text in both forward and backward directions to capture context from both sides of each word.

Bimanual Manipulation

Robot control using two arms simultaneously to perform coordinated tasks.

Bimodal Encoder

A model that processes two different types of input (in this case, code and natural language) and converts them into a shared representation space.

Binary Routing

A decision mechanism where neurons act as on/off switches to direct data through different computational paths.

Binding Problem

The challenge of determining which visual features belong to the same object in a scene.

Biomedical Corpus

A large collection of medical and scientific texts (like research papers and journals) used to train the model on domain-specific language and concepts.

Biomedical NLP

Natural language processing techniques applied specifically to medical and biological text, such as extracting drug names or identifying disease mentions from research papers.

Biomedical Reasoning

The ability to understand and work with scientific concepts in biology and medicine, such as drug interactions and molecular structures.

Biomedical Text

Written content from medical and life sciences domains, including clinical notes, research papers, and healthcare documentation.

Biomedical Vocabulary

Specialized medical and scientific terms and concepts that the model has learned to understand from training on medical literature.

Biosecurity

Protecting against misuse of biological research and AI in harmful ways.

Biosignal

Electrical or physical signals produced by the body, such as heart rhythms or brain waves.

Bird's-Eye View (BEV)

A top-down 2D representation of a 3D scene, showing spatial layout as if viewed from above.

Birkhoff Polytope

The mathematical space of all doubly stochastic matrices; parameterizing this space exactly is the core challenge this paper addresses.

Bit Depth

The number of bits used to represent each number in a model; lower bit depths (like 3-bit) create smaller files but may lose some accuracy compared to higher bit depths.

Bit Precision

The number of bits used to represent each number in a model; lower bit precision (like 3-bit) means smaller file size but potentially less accurate calculations.

Bit Tokenization

Encoding individual bits as separate tokens in a model's vocabulary to preserve fine-grained binary structure.

Bit-Width

The number of bits used to represent each number in a model; lower bit-widths (like 6-bit) use less memory but may reduce precision compared to higher bit-widths.

Bit-width Adaptive Selection

Automatically choosing the optimal number of bits for quantizing different parts of a model based on their importance.

Bits-per-byte (BpB)

A compression metric measuring how many bits are needed to encode each byte of text.

Black-box Testing

Evaluating a system's behavior by observing inputs and outputs without access to internal model structure or weights.

Blind-Spot Mass

A measure of uncertainty in an agent's decision-making at a given state—how much of the decision space lacks statistical support from training data.

Blinded Evaluation

Assessment where evaluators don't know which version or source produced the item being judged.

Block Attention Mechanism

An attention technique that processes groups of items together to improve efficiency and capture relationships between them.

Block Floating Point (BFP)

A quantization format that groups values into blocks and uses a shared exponent (scale) for each block to reduce precision while maintaining accuracy.

Block Output Embeddings

Internal vector representations produced by a state space model's processing blocks that encode information about token sequences.

Block Scales

Scaling factors computed for groups of values in low-precision formats to maintain numerical accuracy.

Block-Diffusion Language Model

A language model that generates multiple tokens in parallel using diffusion, then refines them iteratively.

Block-Scaled Quantization

A quantization method that divides values into groups and applies a shared scale factor to each group.

Blockwise Decoding

A decoding strategy where multiple tokens are generated in parallel blocks rather than one token at a time.

Blueprint Generation

Creating a high-level plan of formally stated definitions, lemmas, and their dependencies before attempting to prove them.

Blueprint Refinement

Iteratively updating the global proof plan when individual lemmas fail, rather than backtracking within a single proof path.

BM25

A ranking function that scores document relevance based on term frequency and document length normalization.

Body-frame Velocity

Movement commands relative to the drone's own orientation, rather than a fixed world direction.

Bose-Einstein Condensate (BEC)

A quantum state of matter where particles occupy the same quantum state at very low temperatures.

Boundary Enforcement

Mechanisms that prevent an LLM from crossing defined limits in reasoning or behavior.

Boundary Uncertainty

Flagging predictions for instances near decision boundaries where the model is less confident.

Bounded Latency

A guaranteed maximum time delay for system operations, critical for safety-critical real-time control.

Bounding Box

A rectangular coordinate set that marks the exact location and size of detected text or objects within an image.

Bradley-Terry Model

A statistical model that ranks items based on pairwise comparison outcomes, commonly used for leaderboards.

Brainstorming Augmentation

Using AI to enhance the exploratory ideation phase of research rather than automating solution design.

Branching Factor

The average number of possible moves available at each decision point in a game.

Branching Score

A metric combining token uncertainty and policy likelihood gains to identify high-value decision points for exploration.

Breakpoint

A marker in code where a debugger pauses execution so you can inspect the program state.

Bregman Divergence

A generalized distance measure defined by a convex function, used to replace Euclidean geometry in optimization algorithms.

Brier Skill Score

A metric measuring forecast accuracy that compares a model's predictions to a baseline (like random guessing).

Broken Symmetry

A situation where the underlying physics has symmetry, but observations reveal a preferred direction or asymmetry due to measurement constraints.

Brownian Path

A random trajectory representing continuous random motion, used to model noise in stochastic processes.

Budget Forcing

A reinforcement learning technique that constrains model outputs to stay within a token budget, reducing response length while maintaining accuracy.

Budget-Aware Exploration

Constraining an agent's resource usage (compute, API calls, time) while it searches for solutions.

Byte-Level Tokenization

Breaking text into individual bytes (raw character codes) rather than words or subwords, which allows the model to handle any text without a predefined vocabulary.

Byzantine Robustness

The ability of a system to function correctly even when some participants behave maliciously or unpredictably.

C

Cache Invalidation

Loss of cached computation when prompt structure changes, requiring recomputation of affected tokens.

Cache Mapper

A component that aligns and calibrates KV caches from different sources into a unified format.

Calibrated Outputs

Model predictions that are numerically meaningful and correspond to real-world values (like aesthetic scores), rather than abstract relative rankings.

Calibration

Adjusting a model's predictions using held-out data to correct for systematic biases or distribution differences.

Calibration Time

The time required to adjust quantization parameters for a model using a small dataset before deployment.

Camera Pose Estimation

Determining the position and orientation of a camera in 3D space relative to a scene.

Camera-Centric Action

Robot actions expressed relative to the camera's local coordinate frame rather than the robot's base frame.

Canonical Correlation Analysis

A statistical technique that finds the strongest correlations between two sets of variables by discovering shared patterns.

Capability Elicitation

Training process designed to extract or develop specific abilities from a model, like reasoning or tool use.

Capacity Region

The set of all arrival rate vectors that a network can sustain without queues growing unboundedly.

Capacity Scaling

How the number of storable associations grows with the size of the memory matrix or system parameters.

Capital Market Assumptions

Forecasts of future returns, volatility, and correlations for different asset classes used to guide investment decisions.

Capsule Neural Networks

Neural networks with capsule units that learn hierarchical relationships and spatial properties better than traditional convolutional layers.

CART (Classification and Regression Trees)

A standard algorithm for building decision trees using binary splits and impurity-based criteria.

Cascade Architecture

A system where one model's output feeds directly into another model as input, like ASR output going to translation.

Cascade Data

Sequential observations of how outcomes spread or evolve through a system, like infection waves or adoption patterns.

Cascaded Cross-Attention

A mechanism that sequentially combines information from multiple sources (global context, object details, skill knowledge) to guide model decisions.

Cascaded Pipeline

Sequential processing where output from one stage feeds into the next.

Cascaded ROI-Narrowing

A strategy where each model focuses on progressively smaller regions of interest to improve accuracy.

Cascaded Routing

A multi-stage process that progressively assigns incidents to the correct business team or service owner.

Case Sensitivity

The model's ability to distinguish between uppercase and lowercase letters as meaningful differences, treating 'Москва' and 'москва' as separate tokens with different meanings.

Case-Insensitive (Uncased)

A model that treats uppercase and lowercase letters as identical, so 'Apple' and 'apple' are processed the same way.

Case-Sensitive

The model treats uppercase and lowercase letters as distinct, allowing it to recognize proper nouns and maintain capitalization distinctions.

Cased Text

Text processing that preserves the distinction between uppercase and lowercase letters, treating 'Apple' and 'apple' as different tokens.

Cased Text Handling

The model's ability to distinguish between uppercase and lowercase letters, making it sensitive to proper nouns and capitalization patterns that carry meaning.

Casimir Operator

A mathematical operator that characterizes the properties of a symmetry group; used here to encode nuclear symmetries as neural network features.

Catastrophic Forgetting

When a model loses its original knowledge while learning a new task, like overwriting old skills.

Causal Generative Model

A model that learns causal relationships between variables and can answer observational, interventional, and counterfactual questions.

Causal Identification

The ability to determine true cause-and-effect relationships from data, typically guaranteed by randomization.

Causal Importance

A measure of how much a component directly influences the final output, not just correlates with it.

Causal Inference

Determining whether a treatment actually caused an outcome, not just whether they're correlated.

Causal Intervention

Deliberately modifying a model's internal features to measure their direct effect on outputs.

Causal Language Model

A model that predicts the next word in a sequence by only looking at previous words, not future ones, making it suitable for text generation.

Causal Language Modeling

A training approach where the model predicts the next word based only on previous words, commonly used for text generation tasks.

Causal Reasoning

Understanding cause-and-effect relationships rather than just statistical correlations in data.

Causal Survival Forests

A machine learning method that estimates personalized treatment effects from survival data using tree-based models.

CC-BY-4.0 License

A permissive open-source license that allows anyone to use, modify, and distribute the model as long as they give credit to the original creator.

CC-BY-NC-4.0 License

A Creative Commons license that allows free use and modification of the model for non-commercial purposes only, with attribution required.

CEGAR (Counterexample-Guided Abstraction Refinement)

A problem-solving technique that starts with a simplified version of a problem and refines it when solutions fail.

Ceiling Compression

A statistical phenomenon where most scores cluster near the maximum possible value, reducing the ability to distinguish between different quality levels.

Ceiling effect

When a benchmark becomes too easy and models achieve near-perfect scores, making it impossible to compare their true abilities.

Cell Complex

A topological structure built from cells of varying dimensions (vertices, edges, faces, volumes) that generalizes graphs and meshes.

Censoring

Training an AI model to refuse or provide false information about certain topics.

Centered Kernel Alignment (CKA)

Metric that measures structural similarity between representations by comparing their kernel matrices.

Central Limit Theorem

A statistical principle stating that the average of many independent samples approaches a normal distribution.

Certificate-Bound Authority

Access control where permissions are cryptographically certified and must be validated before execution, not just during planning.

Chain-of-Thought

A reasoning technique where an AI model shows its step-by-step thinking process before arriving at a final answer, making its logic transparent and verifiable.

Chain-of-Thought Reasoning

A technique where a model works through a problem step by step, showing its reasoning process before arriving at a final answer.

Chance-Constrained Reinforcement Learning

RL approach that enforces probabilistic constraints on outcomes rather than hard guarantees.

Channel Circuit

A quantum circuit composed of quantum channels (operations that map quantum states to quantum states) rather than unitary gates alone.

Channel State Information (CSI)

Raw wireless signal data that describes how a Wi-Fi signal changes as it travels through space and bounces off objects.

Channel-Aware Representation

Learning embeddings that understand relationships between different sensor channels using their textual descriptions.

Channel-wise Affine Transform

A learnable operation that scales and shifts different feature channels independently in a neural network.

Channel-wise Decay

Applying different forgetting rates to different feature channels in a neural network, allowing selective memory retention.

Channel-Wise Quantization

A quantization approach that applies different compression settings to different channels or groups within a model layer, helping preserve quality better than applying the same compression uniformly.

Chaotic Dynamics

Systems where small changes in initial conditions lead to drastically different outcomes, making long-term prediction extremely difficult.

Character Consistency

The ability of a model to maintain a character's voice, personality, and backstory throughout a conversation without contradicting itself.

Character Error Rate (CER)

A metric measuring the percentage of characters incorrectly recognized by an OCR system.

Character Voice

A model's ability to maintain distinct, consistent personality and speech patterns for different characters within a story.

Character-Level Processing

Processing text one character at a time rather than by words, which is useful for catching individual character errors in languages like Chinese.

Chart-Grounded Reasoning

The ability to extract information from visual charts and perform logical reasoning tasks based on what the chart displays.

Chat Model

A language model specifically trained to have natural back-and-forth conversations with users rather than just completing text.

Chat-Optimized

A model specifically trained and tuned to excel at conversational interactions rather than other tasks like analysis or reasoning.

Chat-Tuned

A model optimized through training to excel at multi-turn conversations and dialogue, rather than single-turn text completion.

Checkpoint

A saved snapshot of a model's weights and state at a specific point during training, allowing training to resume or the model to be evaluated at that stage.

Checkpoints

Saved snapshots of a model at different stages of training, allowing researchers to study how the model's behavior changes as it learns.

Chunk-Based Processing

Breaking long sequences into smaller segments and processing them sequentially while maintaining state between chunks.

Chunk-Parallel Training

A training technique that divides sequences into chunks and processes them in parallel to speed up computation.

Chunking

The process of breaking large documents into smaller pieces so a model with a limited context window can process them separately.

Circuit Mining

Identifying and isolating specific subsets of neural network components that perform a particular computation.

Citation Graph

A network representing which papers cite which other papers, showing the flow of scientific influence and prior work.

Citation Networks

A graph structure showing how research papers reference each other, used to understand relationships and influence between scientific works.

Citation Premium

An advantage in receiving more citations for work published in certain venues compared to others.

Citation Tracking

The ability to identify, reference, and maintain accurate attribution to the sources used when generating a response.

Claim Frequency

The number of insurance claims expected per policy or geographic area over a time period.

Class Activation Mapping (CAM)

A technique that generates visual heatmaps showing which image regions a neural network uses to make predictions.

Class Imbalance

When training data has unequal numbers of examples across categories, with some classes having far fewer samples than others.

Class Incremental Learning

Learning to recognize new object classes over time while maintaining performance on previously seen classes.

Class-Level Code Synthesis

Generating complete, structured classes with multiple methods and internal dependencies from a specification.

Class-Weighted Cross-Entropy Loss

A loss function that penalizes misclassification of rare classes more heavily, useful when training data is imbalanced.

Classical Test Theory

A statistical framework for designing and validating tests that measure psychological constructs reliably.

Classifier

A machine learning model trained to assign input data into predefined categories or labels.

Classifier Guidance

Steering diffusion generation toward a target class using a noise-conditioned classifier during sampling.

Classifier Retraining

A two-stage approach where a model first learns representations, then retrains just the final classification layer on balanced data.

Classifier-Free Guidance (CFG)

A technique that steers diffusion models toward desired outputs by comparing conditional and unconditional predictions.

Clinical Alignment

How well an LLM's medical communication matches established clinical standards and physician practices.

Clinical Ethics

The study of moral principles and values that guide medical decision-making and patient care.

Clinical Event Tokenization

Converting clinical information (diagnoses, medications, procedures) into discrete tokens that a model can process.

Clinical Forecasting

Using historical patient data to predict future health outcomes, disease progression, or treatment responses.

Clinical Language Understanding

The ability to accurately interpret and reason about medical terminology, patient symptoms, and healthcare documentation.

Clinical NLP

Natural language processing applied to medical and healthcare text, such as extracting diagnoses or findings from doctor's notes and radiology reports.

Clinical Reasoning

The ability to analyze medical information, connect symptoms to conditions, and make logical healthcare decisions based on evidence.

Clinical Validation

The process of confirming that an AI system's outputs meet clinical standards and are safe for use in healthcare.

CLIP (Contrastive Language-Image Pre-training)

A model trained on image-text pairs to create shared vector representations for both images and text.

CLIP Architecture

A neural network design that learns to match images and text by training them to have similar representations, enabling tasks like image search and visual understanding.

Closed-Form Head Adaptation

Rapidly adjusting a model to new tasks using direct mathematical solutions rather than iterative training.

Closed-Form Solution

A mathematical formula that directly computes an answer without iterative learning or optimization.

Closed-Loop Control

A system that continuously adjusts its behavior based on feedback from its actions and outcomes.

Closed-Loop Correction Law

A feedback control rule that adjusts actions based on observed state to correct deviations from a nominal plan.

Closed-Loop Policy

A control strategy where the robot observes its current state and adjusts actions based on feedback, rather than executing a fixed sequence.

Clustering Projector

A network component that projects learned representations into a space suitable for clustering tasks.

Co-activation

When multiple features in a neural network are active at the same time, often because they represent related concepts.

Co-Condenser Architecture

A neural network design that jointly trains two components together to produce better embeddings by learning from both query and document representations simultaneously.

Co-evolution

Simultaneous optimization of interdependent components that improve each other iteratively.

Co-failure rate

The probability that all models in an ensemble produce incorrect answers on the same query.

Co-teaching

A training strategy where two networks learn together, each selecting clean samples for the other to reduce noise impact.

Co-training

A training framework where two models or components iteratively improve each other by learning from their complementary strengths.

Coalition-proof equilibrium

An equilibrium where no group of players can jointly deviate and all benefit, even if they coordinate.

Coarse Correlated Equilibrium

A game theory solution where no player benefits from unilaterally deviating from a recommended strategy.

Coarse-to-Fine Feature Encoding

A strategy that first captures broad patterns, then progressively refines details for better understanding.

Coarse-to-fine reasoning

A sequential decision-making approach that starts with broad estimates and progressively refines them to higher precision.

Coarse-to-Fine Training

A curriculum learning approach that starts with learning simple components before progressing to optimizing complex global structures.

Code Clone Detection

Identifying sections of code that perform the same function, even if written differently or in different programming languages.

Code Completion

The ability to automatically suggest or generate the next lines of code based on what the programmer has already written.

Code Coverage

Percentage of program code executed by a test suite, measured by lines or branches.

Code Editing

A specialized task where a model modifies or refines existing code rather than creating new code, focusing on precision and surgical changes.

Code Embedding

A specialized embedding designed specifically for source code that understands programming syntax and semantics, enabling tasks like code search and finding similar code snippets.

Code Generation

The ability of a model to write, complete, or suggest programming code based on prompts or partial code input.

Code Infilling

A technique where a model fills in missing or incomplete code in the middle of existing code, using both the code before and after the gap as context.

Code Pretraining

Training a language model primarily on source code and technical documentation rather than general text, making it specialized for coding tasks.

Code Quality

A measure of how well code meets standards for readability, maintainability, and correctness.

Code Reasoning

The ability of a model to understand, analyze, and make logical inferences about source code and programming logic.

Code Refactoring

Restructuring existing code without changing its external behavior to improve readability and maintainability.

Code Representation

Encoding structured information (like circuit designs) as code-like syntax that language models can more easily learn and generate.

Code Review

Process of examining code changes for bugs, quality issues, and adherence to standards before merging.

Code Synthesis

Automatically generating executable code (like plotting commands) from high-level specifications or natural language descriptions.

Code Understanding Verification

Techniques to confirm a student actually understands the code they wrote, not just copied it.

Code-Focused Language Model

A language model specifically trained on programming code to excel at tasks like code generation, completion, and understanding.

Code-mixing

Using multiple languages together in the same text, common in multilingual communities.

Code-Specialized

A model trained with a focus on understanding and generating programming code across multiple languages.

Code-Specialized Language Model

A language model trained specifically on programming code and related tasks, optimized to understand and generate code better than general-purpose models.

Code-Specialized Model

A language model trained specifically on programming code and code-related tasks rather than general text.

Code-Switching

The ability to naturally mix two languages within the same text or conversation, switching between them based on context rather than treating them as separate.

Codebook

A lookup table mapping compressed values back to original data; avoided in this approach to save memory.

Codebook Utilization

The percentage of available discrete tokens in a codebook that are actually used during training or inference.

Coded Language

Indirect linguistic expressions that obscure sensitive meanings to evade detection or moderation systems.

Coding Agent

An AI system that autonomously writes, debugs, and executes code to solve tasks without human intervention.

Coefficient of Variation

A normalized measure of variability that expresses standard deviation as a percentage of the mean, useful for comparing spread across different scales.

Cognate

Words in different languages that share a common historical origin and similar meaning.

Cognate Detection

Identifying words in different languages that share a common origin and similar meaning.

Cognitive Architecture

A computational framework that models how an intelligent agent perceives, reasons, and acts in the world.

Cognitive Colonization

The process by which AI systems embed external interests into human decision-making architecture in ways users cannot easily perceive or resist.

Cognitive Gating

A mechanism that gates speculative execution based on model confidence, without requiring ground-truth labels.

Cognitive Graph

A structured representation of an agent's reasoning process that tracks concepts, relationships, and how understanding evolves over time.

Cognitive Heuristics

Mental shortcuts that simplify decision-making but can lead to systematic biases in judgment.

Cognitive Load Theory

A psychological framework explaining how working memory capacity affects learning and task performance.

Cognitive Support

AI assistance that helps users think through problems and refine their goals rather than just executing stated requests.

Cognitive Taxonomy

A classification system that organizes learning objectives or tasks by their intellectual complexity, from simple recall to advanced analysis.

Cohen's d

A standardized measure of effect size that quantifies the difference between two groups in units of standard deviation.

Coherence

The quality of maintaining consistent meaning and logical flow across multiple sentences or exchanges in a conversation.

Coherence Budget

The maximum circuit depth a quantum computer can execute before quantum information is lost to noise and decoherence.

Coherent Quantum Memory

The ability to preserve quantum information in a superposition state between measurements without collapsing it.

ColBERT Architecture

A neural retrieval model design that stores multiple token-level embeddings per document and uses late interaction to achieve higher retrieval accuracy than single-vector approaches.

Cold-Start Stalling

When a model trained with sparse rewards gets stuck early because initial success probability is too low to learn from.

Collaborative Filtering

A technique that predicts user preferences by analyzing patterns from similar users' behavior.

Collective intelligence

Emergent problem-solving and decision-making capability that arises from coordinated interaction of multiple agents.

Collinearity

When input features are highly correlated with each other, making it difficult to isolate individual feature effects on predictions.

Combinatorial Optimization

Finding the best arrangement or selection from a finite set of possibilities, like packing objects efficiently.

Command Space

The interface or set of instructions that a high-level planner sends to a low-level controller to specify desired robot behavior.

Common Ground

Shared beliefs and mutually recognized facts that enable effective collaboration between people or AI systems.

Common Sense Reasoning

The ability of a model to understand and apply everyday logic and practical knowledge about how the world works.

Communication Efficiency

Minimizing the amount of data exchanged between devices or servers during distributed training.

Community Fine-Tune

A model variant created and shared by the community rather than the original model creators, often with custom modifications.

Compact Model

A smaller language model designed to use fewer computational resources while still performing useful tasks.

Competence-Aware Verification

Evaluating reward quality relative to the current policy's skill level, recognizing that reward rankings change as the policy improves.

Competency Questions

Natural language questions that define what an ontology should be able to answer, used to specify system requirements.

Compiler Optimization

Transformations that improve code performance, memory usage, or other properties without changing program behavior.

Complete Positivity

A quantum physics constraint ensuring operations preserve valid quantum states and probabilities.

Completion Mode

A text generation approach where the model continues or completes text from a given prompt, rather than engaging in back-and-forth conversation.

Completion Prompt

A prompt style where you provide the beginning of text and the model continues it, rather than asking a direct question.

Complex Reasoning

The ability to work through multi-step problems, analyze nuanced information, and draw logical conclusions.

Compliance Certification

Official verification that a service meets specific regulatory or security standards required by industries like healthcare or finance.

Compliance Certifications

Official verifications that a service meets specific security and regulatory standards (like HIPAA or SOC 2) required by certain industries.

Component Interaction Bias

Discrimination that emerges from how separate system components work together, not from individual parts alone.

Component-Based Architecture

A design pattern where UIs are built from reusable, self-contained pieces (components) that can be combined to create larger interfaces.

Compositional Generalization

Model's ability to understand new combinations of learned concepts.

Compositional Incoherence

When combining outputs from multiple components violates probability axioms, even if each component is individually valid.

Compositional Prompts

Text descriptions that specify multiple elements, their relationships, and spatial arrangements in the desired image.

Compositional Semantics

The principle that the meaning of a complex expression is built from the meanings of its parts and how they combine.

Compositionality

The ability to understand new combinations of concepts by learning how individual components combine.

Computational Budget

The amount of processing power and memory available to run a model, which determines how much computation can be performed.

Computational Complexity

The amount of computation (time and memory) required for an algorithm to solve a problem.

Computational Efficiency

The ability to deliver good results while using less processing power and memory than larger models.

Computational Exploration

Using code and algorithms to test mathematical hypotheses and discover patterns empirically.

Computational Footprint

The amount of memory, processing power, and time required to run a model; a smaller footprint means the model can run on less powerful hardware.

Computational Overhead

The extra processing power, memory, or time required to run a model, which impacts speed and resource consumption.

Computational Photography

Using algorithms and AI during image capture to enhance photos beyond what the camera sensor alone can achieve.

Compute Allocation

The strategic distribution of a model's processing power—in this case, spending more computational effort on thinking through problems rather than other tasks.

Compute Efficiency

How well a model performs relative to the computational resources (processing power and memory) required to run it.

Compute-Efficient

A model designed to run with minimal processing power and memory, making it practical for devices with limited resources.

Compute-in-Memory

Hardware architecture that performs computation directly within memory, reducing data movement bottlenecks.

Compute-Optimal

Achieving the best performance for a given amount of computational resources.

Computer Use

The ability for an AI model to interact with computer interfaces, navigate software applications, and execute actions on a user's behalf by understanding and responding to visual or textual representations of screens.

Concept Bottleneck Model (CBM)

An interpretable model that makes predictions by routing inputs through a layer of human-understandable concepts rather than opaque features.

Concept Drift

When the target concept or decision boundary changes over time during learning.

Concept Manifold

A low-dimensional geometric structure where related concepts are organized continuously, like a curved surface in high-dimensional space.

Concept Normalization

The process of mapping different textual expressions of the same idea to a single standardized representation, such as mapping 'MI' and 'myocardial infarction' to the same medical concept.

Condition Number

A measure of how difficult an optimization problem is; higher values mean slower convergence and more iterations needed.

Conditional Advantage Estimation

A reinforcement learning technique that estimates action value only within trajectories meeting specific conditions.

Conditional Coverage

A property where prediction set coverage guarantees hold for specific subgroups or conditions, not just on average across all data.

Conditional Entropy

A measure of uncertainty in predicted tokens given context; low entropy signals memorization, high entropy signals generalization.

Conditional Expected Distance

The average distance between selected and validation target embeddings within a cluster, used to rank training examples.

Conditional Generation

The ability of a model to generate output (like text) based on specific input conditions or prompts provided to it.

Conditional Mean

The expected value of an output given specific input conditions, used as a deterministic baseline prediction.

Conditional Misalignment

Misaligned behavior that only appears when inputs share features with the training data, while appearing safe on out-of-distribution prompts.

Conditional Neural Processes

A probabilistic model that learns to make predictions by conditioning on observed examples, useful for few-shot learning and uncertainty estimation.

Conditional Text Generation

The ability to generate text that follows specific conditions or constraints, rather than producing output freely.

Conditional Value-at-Risk (CVaR)

A risk metric that focuses on the worst-case outcomes rather than average performance, useful for safety-critical tasks.

Conditional Variational Autoencoder (CVAE)

A neural network that learns to generate new data matching specific conditions or constraints.

Conditioning

Guiding a generative model's output by providing additional input signals like pose or depth maps.

Conditioning Mechanism

A model component that takes an external parameter (like speed) and modulates the policy output based on that input.

Conditioning-based Refinement

Improving generated images by providing additional context or constraints during generation to guide the model.

Confidence Calibration

Ensuring a model's confidence scores accurately reflect its true probability of being correct.

Confidence Estimation

Assigning uncertainty scores to model predictions to identify outputs that may need human verification.

Confidence Intervals

Statistical bounds around predictions that quantify uncertainty; here used to identify when model predictions are unreliable.

Confidence Thresholding

A decoding strategy that stops refining tokens when model confidence exceeds a set threshold.

Confidence-based abstention

Refusing predictions when the model's confidence score is below a threshold.

Confidence-Based Decoding

A strategy that selects which tokens to generate next based on the model's prediction confidence, enabling adaptive and efficient generation.

Confidence-Driven Reinforcement Learning

Training a model using rewards based on how well its confidence scores match its actual correctness.

Confidence-Informed Self-Consistency (CISC)

Weighted majority voting where each candidate answer gets a confidence score from a critic model before selection.

Confirmation Bias

The tendency to seek or interpret information in ways that confirm existing beliefs or outputs.

Conflicts of Interest

Situations where an AI system has competing goals—like serving users well versus generating revenue for its creators.

Conformal Prediction

Method providing prediction intervals with statistical guarantees on coverage.

Conformational Control

The ability to direct a model to generate specific 3D shapes or structural states of proteins.

Conformational State

A distinct 3D shape or arrangement that a protein can adopt, often with different biological functions.

Conformational Transfer

Applying a learned conformational change from one protein to structurally similar proteins in the same family.

Confounding

When multiple variables are entangled, making it impossible to isolate the effect of one variable.

Confused Deputy Problem

When an agent misuses its elevated permissions to perform actions it shouldn't, tricked by user input.

Confusion matrix

A table showing how often a classifier correctly or incorrectly predicts each category, revealing systematic biases in predictions.

Conjugate Symmetry

A property of FFT where real-valued signals have redundant information due to symmetric complex conjugate pairs in frequency domain.

Consensus Architecture

A routing pattern where multiple neurons must agree (be mutually exclusive) to activate a particular processing path.

Conservative Baseline

A reference policy or set of actions known to be safe, used to measure how much riskier a proposed action is.

Conservative Regularizer

A penalty that prevents value estimates from being too optimistic about unseen actions in offline RL.

Consistency Training

A training method that encourages models to respond symmetrically to paired prompts representing opposing perspectives.

Consistency-Oriented Reasoning Alignment (CORA)

A method that ensures a model's reasoning process logically supports its final answer by adding consistency rewards during training.

Constitutional AI

A safety training approach that guides a model to behave according to a set of principles or rules, helping it generate more helpful and harmless responses.

Constrained Decoding

Restricting a model's token generation to a predefined set of allowed tokens during inference.

Constrained Generation

Text generation that must follow specific rules or constraints, such as producing output in a particular format or structure.

Constrained Reinforcement Learning

Training an AI system to maximize performance while respecting hard constraints (like deadlines or budgets).

Constraint Satisfaction

Finding solutions that satisfy a set of constraints, used here to resolve conflicts between inferred events.

Constraint Solver

A tool that finds valid solutions to problems with multiple constraints, used here to verify mechanical assembly feasibility.

Constraint-Based Safety

Improving AI safety by restricting what actions an agent can take, rather than trying to detect bad behavior after it happens.

Constraint-Guided Execution

Validating each step of a plan by checking outputs against automatically derived constraints based on task requirements.

Constraint-Guided Repair

Fixing errors in reasoning by making minimal changes that satisfy logical or evidential constraints.

Construct Validity

Whether a study actually measures the real concept it's supposed to test, not something else.

Constructional Semantics

The study of how specific form-meaning pairings in language convey meaning beyond individual words.

Contact-Gating

A mechanism that activates learned corrections only when the robot is physically touching the object.

Contact-Rich Dynamics

Physical interactions where the robot frequently touches and manipulates objects, making control sensitive to small errors.

Contact-Rich Manipulation

Robot tasks where success depends critically on precise control of forces and contact interactions with objects.

Containerization

Packaging software and its dependencies into isolated, portable units that run consistently across different computing environments.

Content Filter

A model or system that screens text before or after generation to block unsafe, harmful, or policy-violating content.

Content Filtering

Safety mechanisms built into a model that prevent it from generating harmful, inappropriate, or restricted content.

Content Moderation

The process of reviewing and filtering text or other content to remove or flag material that violates policies or safety guidelines.

Content Restrictions

Safety guidelines and filters built into a model to prevent it from generating harmful, illegal, or unethical content.

Content Safety Classification

The task of automatically detecting and categorizing text that violates policies or could cause harm, such as hate speech, violence, or misinformation.

Content-addressable Memory

A memory system where data is retrieved by its content similarity rather than by a fixed address, enabling fuzzy matching.

Context Coherence

The ability to maintain consistent meaning and logical flow when processing long sequences of text or conversation.

Context Compression

Reducing the size of conversation history while preserving important information for efficient processing.

Context Consistency

A model's ability to maintain coherent understanding and recall of information across long passages of text without contradicting itself.

Context distillation

Transferring knowledge from interaction trajectories into model parameters by learning from contextual examples.

Context Erasing

Removing or suppressing the influence of specific text spans from an LLM's processed context after they've been cached.

Context Filtering

Retaining only relevant information from execution history to reduce noise and improve decision-making in subsequent steps.

Context Gathering

The process of collecting and organizing relevant information from history to answer specific questions or solve tasks.

Context Governance

Managing what information an agent can access and use to prevent hallucination and ensure relevance.

Context Length

The maximum amount of previous text a model can consider when generating its next output; longer context allows the model to maintain coherence over longer passages.

Context Management

Organizing and maintaining relevant information for AI decision-making.

Context Parallelism

A technique to process long sequences by distributing context across multiple devices or processing units in parallel.

Context Pollution

Irrelevant or noisy information degrading model performance in a given context.

Context Retention

A model's ability to remember and use information from earlier parts of a conversation or document.

Context Routing

A mechanism that selectively directs relevant learned patterns from one model component to another based on current needs.

Context Truncation

When an AI model's input context window fills up and earlier information is lost, requiring mechanisms to preserve key data.

Context Window

The maximum number of tokens a model can process in a single conversation or prompt.

Context-Adaptive

A system that adjusts its behavior based on the specific input or situation rather than using fixed, unchanging patterns.

Context-Aware Adaptive Router

A mechanism that selects only relevant evaluation criteria for each specific query to improve efficiency.

Context-Aware ASR

Speech recognition that uses surrounding information like conversation history to improve transcription accuracy.

Context-Aware Translation

Providing additional context (like original text or reasoning steps) to translation models to improve accuracy.

Context-Free Grammar (CFG)

A formal system of rules that defines which sequences of symbols are valid in a language.

Context-Heavy Agents

AI systems that maintain and reuse long conversation histories across multiple turns of interaction.

Context-Intensive Tasks

Problems requiring the model to extract and use large amounts of information from the input prompt to generate correct outputs.

Context-Specific Guidance

Help or instructions tailored to the current situation rather than generic pre-stored information.

Contextual Adaptation

Adjusting model behavior dynamically based on the specific input or context rather than using fixed settings.

Contextual Bandit

A learning algorithm that selects actions based on context and learns from feedback to improve future decisions.

Contextual Embeddings

Numerical representations of text that capture meaning based on surrounding context, rather than treating each word independently.

Contextual Invariance

The assumption that a model produces consistent outputs when a task is reformulated in contextually equivalent ways.

Contextual Pressure

Influence from surrounding information (like examples or previous actions) that pushes an agent away from its intended behavior.

Contextual Reasoning

Making decisions by considering how individual observations relate to and inform each other within a broader context.

Contextual Representation

A way of encoding text where the meaning of each word depends on the words around it, rather than being fixed for every occurrence.

Contextual Space

The intermediate representation space in a diffusion model where semantic and structural information is encoded.

Contextual Topic Modeling

Machine learning technique that identifies recurring themes in text while considering the surrounding context of words.

Contextual Trigger

A feature or pattern in input text that activates hidden misaligned behavior in a model, even when standard evaluations show the model is safe.

Contextual uncertainty

Uncertainty caused by changing conditions over time, like user preferences shifting.

Contextual Understanding

The ability of a model to interpret the meaning of words and phrases based on surrounding text, rather than treating each word in isolation.

Contextualized Token Embeddings

Vector representations of words that change based on surrounding context, capturing different meanings in different sentences.

Continual Fine-tuning

Incrementally updating a neural network on new data as it arrives, rather than retraining from scratch.

Continual Immune Learning

A process where an agent's defenses dynamically adapt and improve in response to new threats encountered at runtime.

Continual Learning

Training models to learn new tasks without forgetting previously learned ones.

Continued Pretraining

Further training a pretrained model on domain-specific data to specialize it for particular tasks.

Continuous Measurement

Real-time monitoring of a quantum system that produces a stream of measurement data used to update state estimates.

Continuous Representation

Encoding data as smooth, unquantized values rather than discrete tokens, preserving fine-grained temporal details.

Continuous Scoring

Generating probability-based continuous scores instead of discrete labels to provide fine-grained evaluation signals.

Contraction

A mathematical property ensuring a system's outputs converge to a stable state regardless of initial conditions.

Contractivity

A mathematical property ensuring that a system brings nearby states closer together over time, guaranteeing stability.

Contrastive Learning

A training technique that learns by comparing similar and dissimilar examples to create better representations.

Contrastive Loss

Training objective that pulls similar examples together and pushes different ones apart.

Contrastive retrieval

A method that learns shared embedding spaces by contrasting similar and dissimilar image pairs, then ranks candidates by similarity.

Contrastive Rubric Generation

Creating evaluation criteria by comparing gaps between teacher and model responses to identify what distinguishes good from bad outputs.

Contribution Decomposition

Breaking down a neural network's output into individual contributions from different neurons or neuron groups.

Control Codes

Special tokens added at the beginning of a prompt that tell the model what style, domain, or format to use for its output.

Control Tokens

Special tokens inserted into sequences to guide model behavior, such as signaling whether to show an ad or organic content.

Control-Barrier Function (CBF)

A mathematical tool that projects unsafe actions to safe ones, guaranteeing constraint satisfaction but potentially masking policy incompetence.

Controlled Benchmark

A standardized test where variables are carefully isolated to measure specific effects, like changing one visual attribute while keeping everything else the same.

Controller Synthesis

Automatically designing a decision-making system that controls when and how to execute actions.

ControlNet

A technique that adds spatial control to diffusion models by conditioning generation on aligned input maps (like depth or property masks).

Convection-dominated

Physics problems where fluid flow effects dominate over diffusion, creating sharp gradients and moving fronts.

Convergence

The point during training when a model's performance stabilizes and stops improving significantly, indicating it has learned the patterns in the data.

Convergence Analysis

Mathematical proof that an optimization algorithm reliably reaches a good solution and quantifies how fast it gets there.

Convergence Guarantees

Mathematical proofs that an algorithm will reach a correct solution under specified conditions.

Convergence Rate

How quickly an optimization algorithm approaches the optimal solution, typically expressed as a function of iterations.

Convergent Evolution

When different models independently learn similar features or representations from different training signals.

Conversational AI

AI systems designed to understand and respond to human language in natural, dialogue-like interactions.

Conversational AI Agent

An AI system designed to conduct multi-turn dialogue with users to accomplish specific tasks, like medical interviewing.

Conversational Assessment

Using dialogue with a chatbot or AI agent to probe and verify student understanding through questioning.

Conversational Coherence

The model's ability to maintain logical consistency and relevance across multiple turns of dialogue, making responses feel natural and connected.

Conversational Fluency

How naturally and coherently a model engages in back-and-forth dialogue, matching human conversation patterns.

Conversational Language Model

A model specifically trained to understand and generate natural dialogue, optimized for back-and-forth interactions rather than one-off text generation.

Conversational Model

A language model specifically trained and optimized to engage in multi-turn dialogue with users.

Convex Capacity Constraint

A physical limit on inventory levels described by a convex set, constraining what orders are feasible.

Convex Combination

A weighted sum of points where weights are non-negative and sum to one, representing a point within their geometric hull.

Convex Function

A function where any line segment between two points on the curve lies above the curve, ensuring a single global minimum.

Convex Optimization

Mathematical technique for finding the best solution to a problem with a single global optimum.

Convex Polytope

A geometric shape formed by the intersection of linear inequalities, with vertices representing extreme points.

Convolutional Operations

A technique that scans across input data using small filters to detect local patterns, commonly used in image processing but here applied to text for efficiency.

Cooperative Game

A game setting where agents work together toward shared objectives rather than competing against each other.

Coordinate Reference System (CRS)

The geographic coordinate system (e.g., latitude/longitude) used to define spatial locations and ensure consistency across operations.

Coordinate-Aware Representations

Internal model representations that explicitly encode spatial positions and coordinate information.

Coordination Games

Game theory scenarios where agents benefit from matching actions but may also benefit from strategic differentiation.

Copy-on-Write

An optimization where data is only copied when modified, allowing multiple references to share the same data until changes occur.

Core-Periphery Attention

An attention mechanism where peripheral tokens (patches) interact only through central core tokens, reducing computation.

Coreference Resolution

Identifying when different mentions in text refer to the same entity or concept.

Coreset Selection

Selecting a small subset of real samples from a large dataset that best represent the original data distribution.

Corpus

A collection of documents or text used as the knowledge base for retrieval in RAG systems.

Corpus-Discriminative Retrieval

Selecting query terms that best distinguish relevant documents from irrelevant ones in a specific corpus.

Correctness Gating

A filtering mechanism that validates whether a proposed solution is correct before allowing it to advance in a search process.

Corruption Robustness

A model's ability to maintain performance when input data is degraded (e.g., noise, blur, missing values).

Cosine Distance

A mathematical measure that compares how similar two embeddings are by calculating the angle between them, with values closer to 1 meaning more similar.

Cosine Similarity

A method of comparing two vectors based only on their direction, ignoring their magnitude, making it scale-invariant.

CosNet

A learnable activation function using cosine waves with adjustable frequency and phase to process data nonlinearly.

Cost-Aware Attack

An adversarial attack that accounts for the real-world cost or feasibility of modifying each feature.

Cost-Efficiency

The ability to deliver useful results while using fewer computational resources, reducing the expense of running the model.

Cost-Quality Tradeoff

The balance between inference cost (compute, latency) and answer quality that systems must optimize for.

Cost-Sensitive Learning

Training approach that weights errors differently based on their downstream impact or cost in the application domain.

CoT-MAE

A training methodology that combines chain-of-thought reasoning with masked autoencoder techniques to improve model understanding of text relationships.

Counterfactual Evaluation

Testing what would happen if you changed a strategy, without actually running the experiment in the real world.

Counterfactual Explanation

An explanation showing what input changes would alter a model's prediction to a different outcome.

Counterfactual Generation

Creating alternative scenarios showing what would happen if something were different (e.g., if an object didn't exist).

Counterfactual Negatives

Training examples where evidence is semantically related but contradicts the claim, testing if models truly use evidence.

Counterfactual Query

A question about what would have happened if a variable had taken a different value (e.g., 'what if the patient had received treatment?').

Counterfactual Reasoning

Reasoning about what would have happened under different actions or conditions than what actually occurred.

Covariance

A measure of how two variables change together; structured covariance means features are correlated in specific patterns.

Covariance Estimation

The process of learning or updating the statistical properties of measurement and process noise in a filtering system.

Covariance Matching

Aligning a model's sensitivity structure to match the statistical structure of task-irrelevant variations in data.

Covariate Shift

When the distribution of input data changes between training and real-world use, causing models to fail.

Coverage

The extent to which training data represents all relevant aspects or regions of a document or domain.

Coverage Constraints

Requirements ensuring sufficient representation of all groups, including subgroups defined by multiple attributes, in training data.

Coverage Estimation

Measuring what proportion of a problem space a model can reliably handle.

Coverage Path Planning

Finding an efficient route for a vehicle to visit all cells or areas in a region.

Coverage Verification

The process of proving that testing has comprehensively covered all relevant operating conditions and edge cases.

Coverage-Aware Sampling

A training technique that prioritizes data from under-explored regions of the state-action space to improve model robustness.

Coverage-Guided Testing

Testing approach that systematically explores different input regions to find edge cases and failures.

Covert Political Bias

Systematic asymmetric treatment of opposing political viewpoints in language model responses, including differences in tone, depth, and engagement.

CPTP Operation

A quantum operation that preserves physical validity by maintaining positivity and trace properties of quantum states.

CPU Inference

Running a model's predictions using a computer's central processor rather than a specialized graphics card, which is slower but requires less specialized hardware.

Cranfield-Style Evaluation

A benchmark methodology using a fixed document collection, queries, and human relevance judgments to evaluate retrieval systems.

Creative Utility

A measure of how useful and novel the connections a model generates are for creative tasks.

Credibility

User perception of a system's trustworthiness and expertise, affecting whether they believe its information.

Credit Assignment

The process of determining which actions or steps in a sequence deserve reward or blame for the final outcome.

Criteria Decomposition

Breaking down evaluation into multiple independent criteria to reduce complexity and improve verification accuracy.

Criterion-level Feedback

Detailed feedback that scores responses across multiple specific evaluation criteria rather than a single overall score.

Critic-Based Filtering

Using a separate model to evaluate and reject outputs that contain errors, improving final answer quality.

Critic-Based Payoff Estimation

Using a neural network to predict game payoffs for different action combinations, amortizing learning across multiple game states.

Critique Agent

An agent that reviews and validates the recommendations and execution plan of other agents to ensure correctness and coherence.

Cross Attention

Mechanism allowing one sequence to attend to and focus on another sequence.

Cross-Architecture Transfer

Transferring knowledge between models with fundamentally different designs, attention mechanisms, or tokenizers.

Cross-Attention Adapter

A neural module that merges information from two sources by learning which parts of each are most relevant.

Cross-Commentator Alignment

Mapping the same source text across multiple independent interpretations to enable direct comparison of how different schools read identical material.

Cross-Dataset Transfer

Testing whether a model trained on one dataset generalizes to perform the same task on a different dataset.

Cross-domain Mapping

A creativity technique where ideas from one unrelated domain are applied to solve problems in another domain.

Cross-embodiment Transfer

Learning to control one body type (like a humanoid robot) using data from a different body type (like humans).

Cross-Encoder

A model architecture that takes a query and document together as input and directly outputs a relevance score, unlike dual-encoders that score them separately.

Cross-Entropy Loss

A loss function that measures how well a predicted probability distribution matches a target distribution.

Cross-Environment Deployment

Running an AI model in different network environments or systems than the one it was trained on.

Cross-Lingual

The ability to understand relationships and transfer knowledge between different languages, such as answering a question in one language based on text in another.

Cross-Lingual Awareness

The ability of a model to understand and relate concepts across different languages, allowing it to find similarities between text in different languages.

Cross-Lingual Capability

The ability of a model to understand and work with multiple languages, sometimes even translating concepts between them.

Cross-Lingual Consistency

The ability of a model to represent similar meanings in different languages as nearby points in its vector space, so translations and equivalent concepts are treated as semantically close.

Cross-lingual Generalization

The ability of a model or probe trained on one language to work effectively on other languages.

Cross-Lingual Matching

The ability to find and compare similar content across different languages by representing them in a shared mathematical space.

Cross-Lingual Retrieval

The ability to find relevant documents or text in one language when searching with a query in a different language.

Cross-Lingual Semantic Similarity

The ability to recognize that sentences or phrases in different languages have the same or similar meaning and represent them close together in numerical space.

Cross-Lingual Similarity

The ability to measure how similar two sentences are even when they are written in different languages.

Cross-Lingual Transfer

The ability of a model trained on multiple languages to apply knowledge learned from one language to understand or generate text in another language.

Cross-Lingual Understanding

The ability of a model to comprehend relationships and meanings across different languages, enabling tasks like translation and multilingual reasoning.

Cross-Modal Alignment

Connecting representations from different types of data (like speech and text) so they work together effectively.

Cross-Modal Attack

An attack that manipulates multiple input types (like images and text) together to deceive a model.

Cross-modal Attention

A mechanism that aligns and weights information between different modalities like images and text.

Cross-Modal Cohesion

Ensuring semantic consistency between different modalities (e.g., text and images) in generated content.

Cross-Modal Consistency

Ensuring that representations across different modalities (images, 3D, text) align and reinforce each other.

Cross-Modal Convergence

Alignment in how models from different modalities (e.g., vision and language) represent the same stimulus.

Cross-Modal Fusion

The process of combining information from multiple modalities (e.g., vision and text) into a unified representation.

Cross-Modal Inconsistency

When a model produces contradictory predictions for the same concept represented in different modalities.

Cross-Modal Matching

The ability to find relationships between different types of content, such as matching natural language descriptions to code snippets.

Cross-modal prediction

Learning representations by predicting one modality from another.

Cross-Modal Reasoning

The ability to connect and reason about information from different input types (like audio and video) together to draw conclusions.

Cross-Modal Retrieval

The ability to search and find relevant items across different data types, such as finding images using text queries or vice versa.

Cross-Modal Scoring

Computing relevance or similarity scores between information from different modalities (e.g., text and images).

Cross-modal Semantic Sharing

The ability of a model to share semantic understanding between different input modalities like vision and text.

Cross-Modal Similarity

The ability to measure how closely related content from different types of input (like images and text) are to each other.

Cross-Modality Message Passing

Exchanging information between different input types (text and vision) to guide compression decisions.

Cross-module Reasoning

The ability of AI tools to access information from other modules and make decisions based on shared context.

Cross-Price Effects

How the demand for one product changes when the price of a different product changes.

Cross-Script Generalization

The ability of a model to perform consistently when input text or audio switches between different writing systems or languages.

Cross-Source Reconciliation

The process of comparing and resolving conflicting information from multiple sources to determine accurate answers.

Cross-subject generalization

A model's ability to work on new individuals without retraining, despite differences in neural anatomy.

Cross-View Attention

A mechanism that transfers motion information from one camera viewpoint to another while maintaining consistency.

Cross-View Correlation

The degree to which internal representations align when processing the same task in different formats or modalities.

Cross-View Identity Confusion

When a model fails to recognize that the same object in different camera views is the same entity, leading to counting errors.

Cross-view matching

Aligning images captured from different viewpoints (e.g., street-level and overhead) to find correspondences.

CSS Codes

Calderbank-Shor-Steane quantum codes combining two classical error-correcting codes for quantum error correction.

Cubic surface

A 3-dimensional algebraic variety defined by a degree-3 polynomial equation.

Cubical Complex

A mathematical structure representing data as a collection of cubes at different scales, used in topological analysis.

CUDA

NVIDIA's parallel computing platform that runs code on GPUs to process many tasks simultaneously.

Cuda Kernels

Optimized GPU code that performs specific computational operations efficiently.

Cultural Measurement

Using computational methods to quantify and analyze cultural phenomena like dialogue patterns, social interactions, or linguistic variation.

Cultural Reasoning

The ability to understand and infer cultural context, significance, and metadata from visual or textual information.

Cumulants

Statistical measures that describe probability distributions, used to track activation behavior.

Curated Dataset

Training data that has been carefully selected and filtered to include only high-quality examples relevant to specific tasks or domains.

Curated Training Data

Carefully selected and filtered training examples chosen for quality rather than quantity, often resulting in models that produce more structured and reliable outputs.

Curiosity-Driven Reinforcement Learning

RL approach where agents explore by seeking states where their world model makes poor predictions.

Curriculum Design

Training strategy that gradually increases task difficulty to help models learn robustly.

Curriculum Learning

Training strategy that presents examples in increasing order of difficulty.

Curvature Regularizer

A training constraint that penalizes curved or winding paths in the learned representation space.

Cycle Consistency

A constraint requiring a model to reconstruct its original output after transforming through intermediate steps.

Cyclomatic Complexity

A metric measuring how many different paths code can take; lower values mean simpler, easier-to-maintain code.

D

DAgger

An interactive learning method where a human corrects the model's mistakes during training to fix distribution mismatch.

Damped Sub-steps

Smaller, controlled refinement iterations that reduce the magnitude of updates to stabilize computation.

Data Assimilation

Combining observed measurements with model predictions to calibrate and improve model parameters.

Data Attribution

Measuring how much each training example contributes to a model's final performance using gradient-based methods.

Data Augmentation

Technique to increase training data by creating variations or new samples from existing data.

Data Contamination

When test data accidentally leaks into training, artificially inflating a model's measured performance.

Data Curation

The process of carefully selecting, cleaning, and organizing training data to improve model quality; better curated data often leads to better model performance.

Data Deletion

Predicting how a model would behave if specific training examples were excluded without retraining.

Data Deletion Problem

Predicting how a model's behavior would change if specific training data were excluded without retraining.

Data Diversity

The variety of examples in a dataset across different attributes, styles, or contexts to improve model generalization.

Data Engineering

The process of selecting, organizing, and preparing training data to improve model performance.

Data Fidelity

The accuracy and correctness of data representation in a visualization or output.

Data Flywheel

A self-reinforcing cycle where a system identifies gaps, generates data to fill them, and uses that data to improve itself.

Data Gravity

The tendency for computation to move toward where large datasets are stored, rather than moving data to where computation happens.

Data Heterogeneity

Variation in data distribution across different sources or groups.

Data Integration

Combining data from multiple sources into a unified, usable format for analysis or querying.

Data Missingness

Gaps or missing values in a dataset caused by sensor failures, blinks, or other interruptions.

Data Modification Cost

The expense of changing, augmenting, or purchasing training data to improve model fairness and reduce bias.

Data Ordering

The sequence in which training examples are presented to a model during training.

Data Poisoning

Adversarial manipulation of training data to degrade model behavior while maintaining normal performance on standard metrics.

Data Quality

The relevance, accuracy, and usefulness of training data, which can be more important for model performance than simply having more data.

Data Quality Curation

The practice of carefully selecting and filtering training data for relevance and accuracy rather than simply using larger amounts of raw data.

Data Referencing Errors

Mistakes where LLMs incorrectly cite, omit, or misread values from tables despite understanding the table structure.

Data Registry

A centralized catalog storing metadata about available data sources and their query interfaces.

Data Residency

A guarantee that your data is stored and processed only in a specific geographic region, helping meet regulatory requirements.

Data Reuse

When researchers use datasets from previous studies in their own research rather than collecting new data.

Data Selection

Choosing a subset of training data based on quality or relevance metrics rather than using all available data.

Data Synthesis

Automatically generating training data from existing datasets to teach models new tasks.

Data Validation

Automated checks that verify data meets quality and correctness requirements before use.

Data-Centric Machine Learning

An approach that focuses on improving data quality and efficiency rather than just model architecture.

Data-Level Intervention

Modifying training data itself rather than changing model architecture or training procedures.

Data-Parallel Training

Distributing training data across multiple GPUs that compute gradients independently then synchronize.

Dataset Distillation

Compressing a large dataset into a smaller synthetic version preserving key information.

Dataset Taxonomy

A classification system that organizes datasets by their characteristics like sparsity, scale, and sequential structure.

DBRX Architecture

A neural network design pattern that serves as the structural foundation for this model, determining how it processes and generates text.

De Novo Design

Creating entirely new protein sequences from scratch rather than modifying or copying existing ones.

DeBERTa

A transformer-based language model architecture that uses disentangled attention mechanisms to improve how the model weighs different parts of the input text when making predictions.

Decentralized Training

A training approach where a model is developed across multiple independent computers or organizations rather than in a single centralized facility, allowing distributed collaboration.

Decision Loss

A loss function that directly penalizes the cost incurred by making decisions based on a model's predictions.

Decision Tree

A tree-based model that recursively partitions data using threshold rules to make predictions.

Decision-Making System

A mechanism that selects actions based on current state, goals, and expected outcomes to maximize success.

Decision-Support Mechanism

A tool or system that provides information and analysis to help humans make better decisions without replacing human judgment.

Decision-Theoretic

An approach that evaluates systems based on the quality of decisions they enable under different costs and benefits.

Declarative Specification

A formal description that binds visual elements to data fields, separating what to show from how to render it.

Decoder

A component that converts compressed internal representations back into human-readable outputs like audio or images.

Decoder Stochasticity

The inherent randomness in a model's output generation, even when given identical inputs multiple times.

Decoder-based Language Model

A type of LLM that generates text one token at a time, like GPT models.

Decoder-Only Architecture

Language model design that generates text sequentially without a separate encoder, like GPT models.

Decoding

Converting model outputs into human-readable text or structured predictions.

Decoding Efficiency

The speed at which a model generates output tokens one at a time, a critical bottleneck in long-context scenarios.

Decoding Strategies

Methods for generating text from a language model, such as greedy selection, beam search, or temperature sampling.

Decoding Trajectory

The sequence and order in which tokens are generated or unmasked during the model's iterative generation process.

Decompilation

Converting compiled binary code back into human-readable pseudo-code approximations.

Decomposition

Breaking a complex problem into smaller, simpler sub-problems that are easier to solve and understand.

Decompositional Verifiable Reward (DVReward)

A reward system that breaks complex requests into atomic, checkable questions to provide interpretable feedback for model training.

Decoupled reinforcement learning

Training separate reward objectives for different tasks (e.g., binary judgment vs. error localization) instead of optimizing them jointly.

Deduplication

The process of removing duplicate or near-duplicate examples from training data to improve model efficiency and prevent overfitting to repeated content.

Deep Research Agent

An AI system that performs multi-step research by reasoning through problems and making multiple search queries.

Defeat Function

A context-dependent rule that determines which attacks between arguments succeed based on the current context or regime.

Defect Detection

Automatically identifying problems or errors in software artifacts, such as incomplete or ambiguous descriptions.

Deformable Gaussian Splatting

An extension of Gaussian Splatting that allows the 3D primitives to deform over time to handle dynamic scenes.

Degrees of Freedom

The number of independent ways a mechanical part can move or rotate in an assembly.

Delayed Feedback

Consequences of an agent's actions that appear many steps later, making it harder to learn cause-and-effect relationships.

Delayed Reward

A reward signal that only becomes available after multiple steps or actions have been completed.

Delayed Verifier Signals

Feedback or verification of agent actions that arrives after a delay, requiring the agent to maintain accountability over time.

Deliberative democracy

A form of democracy where citizens and representatives engage in reasoned discussion to reach decisions.

Demand Modeling

Using machine learning to predict how much of a product customers will buy given prices and other factors.

Demographic Blinding

Removing or hiding demographic information (like gender) from model inputs to reduce bias in decision-making.

Demographic Importance Weighting

Learning which demographic attributes (race, age, etc.) are most influential in predicting how annotators will judge subjective content.

Demonstration Data

Training examples collected from real robots performing tasks, used to teach the model how to execute similar actions.

Denoising

A training approach where the model learns to reconstruct clean audio from corrupted or noisy versions, improving its ability to extract meaningful features.

Denoising Autoencoder

A neural network trained to reconstruct clean text from corrupted or noisy versions, learning to remove noise while preserving meaning.

Denoising Objective

A training approach where a model learns to reconstruct clean audio from noisy versions, making it better at understanding speech in real-world conditions.

Denoising Process

A technique where a model learns to gradually remove random noise from data to reconstruct meaningful content, used as an alternative to traditional token prediction.

Denoising Score Matching

A training objective that learns to predict noise in corrupted data, used in diffusion models for stable gradient-based optimization.

Dense Captioning

Generating detailed, comprehensive descriptions of images that capture rich visual information and relationships rather than brief summaries.

Dense Embedding

A compact vector representation where most dimensions contain meaningful information, as opposed to sparse embeddings that are mostly zeros.

Dense Embeddings

Vector representations where most or all of the numbers contain meaningful information, as opposed to sparse embeddings where most numbers are zero.

Dense Feedback

Fine-grained, continuous-valued signals that provide detailed information about solution quality for training or selection.

Dense Model

A neural network where all parameters are active for every input, in contrast to sparse architectures like mixture-of-experts that selectively activate different parts.

Dense Passage Retrieval

A technique that converts documents and queries into dense vectors so that relevant passages can be found by comparing their numerical representations rather than matching keywords.

Dense Representation

A compact numerical format where meaning is captured in a fixed-size list of numbers, making it efficient for storage and similarity comparisons.

Dense Retrieval

A search method that converts text into a single, compact numerical vector and finds similar documents by comparing these vectors.

Dense Retriever

A retrieval system using learned embeddings to find semantically similar documents via vector similarity.

Dense Supervision

Training data that provides detailed annotations for every part of an input, rather than just overall labels.

Dense Vector

A compact numerical representation where most values are non-zero, used to efficiently store and compare the meaning of text.

Dense Vector Embedding

A compact numerical representation of text that captures its meaning, allowing the model to compare how similar different pieces of text are to each other.

Dense Vector Embeddings

Numerical representations of text where each word or sentence is converted into a list of numbers that capture its meaning, allowing the model to compare semantic similarity.

Dense Vector Representation

A compact numerical format where text is encoded as a list of numbers that capture its meaning, allowing efficient similarity comparisons.

Dense Vector Space

A mathematical space where text is represented as vectors of numbers, positioned so that similar meanings are located close together.

Dense Vectors

Compact numerical representations where most values are non-zero, used to encode the meaning of text in a form that computers can compare mathematically.

Dense vs. Sparse Embeddings

Dense embeddings use all dimensions with non-zero values (like traditional neural embeddings), while sparse embeddings mostly contain zeros and are more interpretable and storage-efficient.

Density Estimation

Measuring how crowded solutions are in the search space to maintain diversity in the population.

Density-Guided Response Optimization (DGRO)

A method that aligns models by learning from the geometric clustering of accepted responses in the model's representation space.

Dependency Graph

A visual map showing which models, datasets, and tools a system relies on and how they connect.

Dependency Reasoning

The ability to understand how multiple facts relate to and affect each other when making decisions.

Deployment Monoculture

Risk that a single model's values or biases get applied uniformly at scale, eliminating the diversity of perspectives that would naturally exist with multiple decision-makers.

Depth Map

An image where each pixel's brightness represents how far away that object is from the camera.

Depth Map Estimation

Computing per-pixel distance from camera to scene surfaces to reconstruct 3D geometry from 2D images.

Depth-Aware Capacity Allocation

Assigning different amounts of model parameters to different layers based on their functional importance rather than uniformly.

Depth-Scaling Effect

Improving model performance by increasing computational depth without adding new parameters, achieved through layer reuse.

Depth-Upscaling

A technique that creates a larger model by combining and stitching together layers from smaller pre-trained models rather than training a new model from scratch.

Dequantization

The process of restoring a compressed model's weights to higher numerical precision, improving quality but requiring more memory.

Derivative Model

A new model created by modifying or fine-tuning an existing base model rather than training from scratch.

Descriptor

A numerical representation that captures the visual characteristics around a detected keypoint, allowing the model to match similar points across different images.

Descriptor-Based Generation

Generating model weights using text or structured descriptions of the target architecture and task as input.

Deskilling

The loss of professional expertise and judgment that occurs when workers rely on automated systems instead of developing their own capabilities.

Determinantal Point Process

A mathematical model that generates diverse sets of items by penalizing similarity, useful for ensuring variety in generated outputs.

Deterministic Checks

Automated verification rules that produce the same result every time, used when there is clear evidence of task completion.

Deterministic Workflow Engine

A system that executes predefined process steps in a fixed, repeatable sequence without adaptive reasoning.

Devanagari

The script used to write Marathi, Hindi, and several other Indian languages.

Development Build

An early, pre-release version of a model used for testing and refinement before public release.

Deviation Feedback

Explicit signals that alert an agent when its current reasoning or conclusions diverge from task requirements or ground truth.

Dexterous Manipulation

Fine-grained, skillful robotic hand control requiring precise coordination of many joints.

Diagnostic Context

The reasoning and explanation behind why a specific code location is likely buggy, not just the location itself.

Diagnostic Reasoning

AI process of identifying root causes or problems from observed symptoms.

Dialogue Dynamics

The patterns and interactions that emerge in conversation, including how participants exchange information and coordinate actions.

Dialogue Generation

The process of an AI model creating natural conversational responses based on input text.

Dialogue Memory

A compact storage mechanism that maintains compressed conversation state across multiple dialogue turns.

Dictionary Learning

The process of finding a set of basis vectors (dictionary) that can reconstruct data through sparse combinations.

Diff Application

The ability to understand and apply code changes (diffs) to existing files rather than generating code from scratch.

Diff hunk

A contiguous section of a code patch showing added, removed, or modified lines in a specific file.

Differentiable

A property of operations that allows gradients to flow through them during backpropagation for model training.

Differentiable Approximation

Smooth mathematical function approximating non-differentiable operations for training.

Differentiable Loss Functions

Mathematical functions that measure how far a model's output is from desired behavior, designed to be optimizable via gradient descent.

Differentiable Memory Stack

A learnable memory retrieval mechanism that can be trained end-to-end to recall relevant past episodes for current decision-making.

Differentiable Physics

A physics solver built into a neural network so that gradients can flow through physical laws during training.

Differentiable Reward

A reward function whose gradients can be computed, allowing optimization of model outputs toward desired properties.

Differentiable Reward Model

A reward function designed to be differentiable so gradients can flow through it during training.

Differentiable Sparse Attention

A sparse attention method that supports gradient computation, enabling end-to-end training with learned sparsity patterns.

Differential diagnosis

A list of possible medical conditions ranked by likelihood, used by clinicians to guide further testing.

Differential Equations

Mathematical equations describing how systems change over time, naturally solved by analog hardware.

Differential Privacy

A mathematical framework that adds controlled noise to data to protect individual privacy while enabling statistical analysis.

Difficulty amplification

A technique to systematically increase problem complexity to better differentiate model capabilities.

Difficulty Estimation

Predicting how hard a task is to automatically adjust the amount of computational effort needed.

Difficulty Signal

An internal indicator that estimates how hard a problem is, used to guide model behavior.

Diffractive optical element

A passive optical component that uses diffraction to manipulate light without moving parts or power.

Diffusion Language Model

A language model that generates text iteratively by refining noisy predictions, allowing generation in arbitrary word order rather than strictly left-to-right.

Diffusion Language Models

Language models that generate text by iteratively refining noisy predictions into coherent words.

Diffusion Model

Generative model that creates images or videos by gradually removing noise from random data.

Diffusion Models

AI models that generate images by learning to reverse a noise-adding process, starting from pure noise.

Diffusion Paradigm

A generative approach that iteratively refines predictions by gradually removing noise from random initial states.

Diffusion Policy

A policy representation that uses diffusion models to generate action sequences from observations.

Diffusion Prior

A learned distribution that guides diffusion models toward realistic outputs in a specific domain.

Diffusion Process

A generation method that iteratively refines outputs by gradually removing noise, rather than predicting tokens one at a time from left to right.

Diffusion steps

Iterations in a diffusion model that gradually refine noise into a final image or video output.

Diffusion Transformer

A transformer architecture adapted to work with diffusion-based generation processes.

Diffusion-Based Architecture

A neural network design that generates outputs by iteratively refining noisy predictions into clear results, rather than building text one token at a time like traditional language models.

Diffusion-Based Generation

A method where a model generates text by iteratively refining noise into coherent output all at once, rather than predicting one word at a time.

Diffusion-Based Language Model

A language model that generates text by iteratively predicting and refining masked (hidden) tokens across the entire output, rather than predicting one token at a time from left to right.

Diffusion-Based Trajectory Generation

Using diffusion models to generate realistic robot motion sequences that can be used as training data.

Digital Twin

A virtual simulation model of a physical system used to predict behavior and test changes before real-world deployment.

Dilated Convolution

A convolutional operation that skips input elements to capture patterns at multiple scales without increasing parameters.

Dimension Reduction

A technique to simplify high-dimensional parameter spaces by identifying and focusing on the most critical variables.

Dimensionality Reduction

Techniques that compress high-dimensional data into fewer dimensions while preserving important patterns.

Direct Preference Optimization

Training method that aligns models with human preferences by directly optimizing the difference between preferred and dispreferred outputs.

Direct Preference Optimization

A training technique that teaches a model to prefer certain outputs over others by learning from examples of better and worse responses.

Directed Acyclic Graph (DAG)

A workflow representation where tasks are nodes and dependencies are directed edges with no circular paths.

Directed Acyclic Graph (DAG)

A graph structure representing causal relationships where arrows point from causes to effects with no cycles.

Dirichlet Distribution

A probability distribution over probability distributions, used here to model uncertainty over class predictions.

Dirichlet Energy

A measure of smoothness on a graph that quantifies how much node values vary across connected edges.

Discourse Coherence

The logical flow and consistency of ideas across sentences in a text or conversation.

Discourse Functional Analysis

Examining how language serves specific communicative purposes in conversation, like validating feelings or paraphrasing.

Discourse Particles

Small words or phrases like 'well' or 'kind of' that convey emotion, intention, and interpersonal meaning in conversation.

Discovery-to-Application Gap

The challenge of moving from discovering causal rules to engineering them into working systems.

Discrete Diffusion

A generative model that iteratively removes noise from discrete tokens (like words) to generate text, as an alternative to autoregressive decoding.

Discrete Diffusion Models

Generative models that iteratively denoise discrete tokens (like words) from noise to produce text.

Discrete Embeddings

Compressed representations of audio data stored as specific, distinct values rather than continuous numbers, making them efficient for storage and processing.

Discrete Exterior Calculus

A mathematical framework for defining calculus operations (gradient, curl, divergence) on discrete geometric structures like cell complexes.

Discrete Hartley Transform

A real-valued alternative to FFT that decomposes signals into cosine and sine components without complex numbers.

Discrete Latent Space

A compressed representation where continuous data is converted into distinct, countable tokens or categories.

Discrete memoryless channel

A communication channel where each transmitted symbol is corrupted independently with no memory of past transmissions.

Discrete Tokens

Individual units of quantized information that represent audio in a compressed, symbolic form rather than continuous values.

Discretization

Converting continuous numerical values into discrete bins or categories for processing by algorithms.

Discretization Invariance

The ability of a model to generalize across different mesh resolutions or numerical discretizations of the same continuous problem.

Discriminative Direction

A pattern in token gradients that effectively distinguishes high-reward responses from low-reward ones.

Disentanglement

Separating different factors of variation (like expression and identity) in a model's learned representations.

Disparate Impact

When a policy or algorithm produces unequal outcomes for protected groups, even if not intentionally discriminatory.

Dispatching Rules

Simple heuristic strategies for assigning jobs to machines, like prioritizing shortest jobs first.

DistilBERT

A smaller, faster version of BERT that retains most of its language understanding ability while using fewer parameters and less computational power.

Distillation

A technique that compresses a large, complex model into a smaller one by training it to mimic the larger model's behavior, resulting in faster inference with minimal loss of quality.

Distilled

A model that has been compressed by training a smaller model to mimic a larger, more capable model, reducing size and computational requirements while retaining performance.

Distilled Model

A smaller, faster version of a larger model created by training it to mimic the larger model's behavior, reducing computational requirements while maintaining reasonable performance.

Distributed Attack

A harmful task split across multiple user accounts so each individual transcript appears benign.

Distributed Compute

Using multiple computers or servers across a network to share the computational work of training or running a model, rather than relying on a single machine.

Distribution Alignment

Adjusting a model so its learned patterns match the actual distribution of data in a target domain.

Distribution Mismatch

When the data distribution used for training differs from the distribution encountered during deployment, causing performance degradation.

Distribution Shaping

Modifying a model's output probability distribution at inference time to satisfy constraints without changing the model's weights.

Distribution Sharpening

When a policy becomes overly specialized in reproducing successful behaviors without learning to handle diverse situations or recover from failures.

Distribution Shift

When a model encounters data that looks different from what it was trained on, causing performance to drop.

Distributional Drift

When a model's behavior diverges from the original training data distribution during fine-tuning or RL.

Distributional Embedding Space

A mathematical space where words are represented as vectors based on their usage patterns in text, like GloVe or Word2Vec.

Distributional fairness

Ensuring benefits and harms are equitably distributed across agents rather than concentrated in hubs or privileged positions.

Distributional Gap

A systematic difference in how two groups (humans vs. LLMs) distribute their outputs across categories.

Distributional Matching

Forcing a model's output distribution to match a target distribution, here used to normalize reward structures across different tasks.

Distributional Modeling

Learning to predict probability distributions over outputs rather than single deterministic predictions.

Distributional Shift

When the statistical properties of data change over time, making old patterns unreliable for future predictions.

Distributionally Robust Optimization

Finding solutions that work well across all possible data distributions within a defined uncertainty set.

Divergence Constraint

A regularization technique that limits how far a model's distribution can drift from a reference distribution during training.

Divergence Regularization

A penalty that prevents a model from changing its behavior too drastically by measuring the statistical distance between old and new policies.

Divergence-Free

A mathematical property ensuring that a velocity field conserves mass (no fluid is created or destroyed at any point).

Diversity Collapse

When a model trained with RL produces repetitive, similar outputs instead of varied responses, reducing usefulness.

Diversity Coverage

A metric measuring the quality of unique answers generated relative to the best possible answer set of the same size.

Diversity-aware Ranking

A ranking method that prioritizes both relevance and variety, ensuring results cover different perspectives or approaches.

Divide-and-Conquer Strategy

Breaking a large problem into smaller independent subproblems, solving each separately, then combining results.

DNA Synthesis Screening

Safety checks that DNA synthesis providers use to block orders for sequences that could be used to create dangerous pathogens.

Document Attribution

The ability to identify and explain which retrieved documents contributed to a generated answer.

Document Boundary

The natural division between separate documents used as a constraint to group tokens for shared expert selection.

Document Chunking

The process of breaking long documents into smaller pieces before embedding them, which this model is optimized to work with effectively.

Document compliance review

Automated checking of whether a document (like a contract) meets organizational policies and requirements.

Document Grounding

Anchoring AI responses to specific source documents to ensure answers are based on provided content.

Document Intelligence

The ability to automatically extract, understand, and convert information from document images (like scans or forms) into structured, machine-readable formats.

Document Layout Analysis

The process of identifying and understanding the structure of a document, such as text regions, tables, and columns.

Document Parsing

The process of automatically reading and extracting structured information like text, tables, and layout from documents.

Document Retrieval

Finding the relevant documents or passages from a large collection that are needed to answer a question.

Document Structure Preservation

The ability to maintain the original layout, formatting, and organization of a document when extracting text, rather than just outputting raw characters.

Document Understanding

The ability to read and extract meaningful information from structured documents like receipts, invoices, and forms by recognizing both text and layout.

Document Visual Question Answering

A task where a model reads a document image and answers natural language questions about its content by understanding both the visual layout and text.

Document-Intensive Workflows

Tasks that require processing, searching, and reasoning over large collections of documents to find answers.

Document-Level Reasoning

Understanding and answering questions that require information from multiple parts of a full document.

Domain Adaptation

Training a model on data from multiple specialized fields (like general text, scientific papers, and medical literature) so it works well across all of them.

Domain Expert

A specialized expert in an MoE model trained to handle reasoning or task-specific knowledge rather than raw perception.

Domain Generalization

Training models to work well on new, unseen domains beyond their training data.

Domain Generation Algorithm (DGA)

A technique that automatically creates many fake domain names to evade detection and maintain control of malicious infrastructure.

Domain Grounding

How well a model's responses are anchored in accurate, specialized knowledge specific to a field rather than generic or hallucinated information.

Domain Knowledge

Specialized expertise and facts about a particular field or subject area that an AI model has learned during training.

Domain Shift

When a model encounters data from a different source or environment than it was trained on, causing performance to drop.

Domain Specialization

When a model is trained to excel at a specific task or set of languages rather than being a general-purpose tool.

Domain Specific Languages

Programming languages designed for specialized tasks in particular industries or fields.

Domain taxonomy

A predefined categorization system that organizes data sources (e.g., code, web, books, academic papers) into distinct domains.

Domain-Adversarial Training

Training technique using adversarial objectives to make model representations invariant across different data domains.

Domain-Agnostic

A model that works effectively across many different subject areas and use cases without needing to be retrained for each one.

Domain-Agnostic Conceptual Problems

Abstract problem formulations that can be recognized and solved across multiple unrelated academic fields.

Domain-Aware

A model's ability to understand and respond accurately to topics within a specific field or area of expertise it was trained on.

Domain-Independent Planner

An AI planning algorithm that solves problems in any domain without domain-specific customization.

Domain-Invariant Features

Learned representations that remain useful across different data sources or conditions despite their differences.

Domain-Routed Distillation

A distillation approach that routes different domains to specialized teachers, then combines their knowledge into one model.

Domain-Specialized

A model trained specifically on data and tasks from a particular field (in this case, chemistry) to achieve higher accuracy in that domain than general-purpose models.

Domain-Specific

Tailored or optimized for a particular field or type of content, such as news, reviews, or scientific writing.

Domain-Specific Evaluation

Assessment tailored to a particular field (like law) using metrics and error types relevant to that domain.

Domain-Specific Fine-Tuning

Training a model on specialized data from a particular field (like medicine) so it becomes expert at tasks in that domain rather than being a generalist.

Domain-Specific Generation

The ability to generate text tailored to a particular field or context, such as legal documents, Wikipedia articles, or product reviews.

Domain-Specific Knowledge

Specialized expertise required for a particular field, like vendor-specific scanner operations in medical imaging.

Domain-Specific Language

Specialized vocabulary and terminology unique to a particular field or industry, like medical jargon in healthcare or mathematical notation in physics.

Domain-Specific Language Model

A language model trained exclusively on text from a particular field or subject area, making it much better at understanding and generating content in that domain than general-purpose models.

Domain-Specific Model

A language model trained specifically on data from one field (like biomedical research) rather than general internet text, making it excel at specialized tasks.

Domain-Specific Optimization

Training a model to excel at tasks within a particular field (like legal documents) rather than being a general-purpose model.

Domain-Specific Pretraining

Training a model on specialized data from a particular field (like biomedical literature) rather than general internet text, making it much better at understanding that field's concepts.

Domain-Specific Procedures

Specialized workflows and methodologies unique to a particular field that require expert knowledge to execute correctly.

Domain-Specific Training

Training a model exclusively on data from a narrow domain (like Python code) rather than general text, making it highly specialized but less versatile.

Domain-Specific Tuning

Training or adapting a model to specialize in a particular field (like biomedicine) rather than performing equally well across all topics.

Domain-Specific Vocabulary

A model's understanding of specialized terms and concepts unique to a particular field, like medical terminology in biomedical text.

DoRA (Weight-Decomposed Low-Rank Adaptation)

A fine-tuning method that adapts model weights by separately learning magnitude and direction changes, extending LoRA.

Dot-Product Similarity

A method of comparing two vectors by multiplying their components and summing the results, where vector magnitudes (length) affect the final score.

Doubly Stochastic Matrix

A square matrix where all rows and columns sum to 1, used to represent valid probability distributions for mixing multiple streams.

Downsampling

Reducing an image's resolution by removing pixels, making it smaller and faster to process.

Downstream Model

A specialized AI model that receives requests routed to it by another system and performs the actual task or generates the final response.

Downstream Task

A specific NLP application or problem that uses the output of a pre-trained model, such as classification, search, or similarity matching.

Downstream Tasks

Specific applications or problems that use the output of a pretrained model, such as predicting protein structure or identifying protein function.

Draft Head

The smaller neural network component in speculative decoding that quickly generates candidate tokens before verification by the main model.

Draft Model

A smaller, faster model used in speculative decoding to quickly propose token sequences before a larger model verifies them.

Draft Tree

A tree structure of multiple candidate token sequences proposed by a draft model, allowing parallel verification of multiple continuations.

Drift-Minimizing Scheduling

A control policy that reduces the expected growth of queue sizes by prioritizing service to stabilize the system.

Driving Pattern Recognition

The process of automatically identifying and classifying different driving behaviors (e.g., aggressive vs. normal) from sensor data.

Dual Encoder Architecture

A system with two separate neural networks—one that processes questions and one that processes documents—both converting their inputs into comparable vector embeddings.

Dual ML/Software Lifecycles

The parallel development and deployment processes for machine learning models and traditional software components.

Dual Use Risk

The danger that AI technology can be misused for harmful purposes despite benign original intent.

Dual-Encoder Architecture

A model with separate encoders for two input modalities that map them into a shared embedding space.

Dual-Granularity

Organizing information at two levels of detail: high-level task guidance and low-level step-by-step actions.

Dual-Graph Framework

An approach that conditions a model on two complementary graph structures simultaneously to capture multiple constraints.

Dual-Memory Mechanisms

Using two complementary memory systems to store and retrieve different types of information for decision-making.

Dual-Modal Learning

Training models to process and align two different types of input data (like RGB and infrared images) simultaneously.

Dual-Process Framework

An approach combining two complementary methods—one for logical reasoning and one for learning patterns—to solve a problem better than either alone.

Dual-Purpose Model

A single model trained to perform multiple distinct tasks, such as both text generation and embedding, rather than being specialized for just one.

Dual-Reference Generation

Synthesizing an image that combines content structure from one reference image with visual style from another reference image.

Dual-Stream Architecture

A neural network design using two separate computation pathways for different functions.

Dual-Temporal Pathway

An architecture using two parallel processing streams with different time scales—one dense and one sparse.

Dummy Model

A minimal, non-functional model used for testing infrastructure and workflows without the computational cost of a real model.

Duration Control

The ability to generate responses with a specific target length or speaking time.

Dynamic Curriculum

Training approach that evaluates which skills remain helpful during learning and selectively retains only those that improve the current policy.

Dynamic Environment

A setting where conditions, rules, or task requirements change over time rather than remaining static.

Dynamic Epistemic Logic (DEL)

A formal system for reasoning about how beliefs and knowledge change when new information is revealed.

Dynamic Graph Construction

Building a network representation that changes over time to reflect evolving relationships, like road connectivity adjusted for traffic incidents.

Dynamic Merging

Combining task-specific model parameters at inference time based on input features, rather than using a fixed merged model.

Dynamic Method Selection

Automatically choosing the best execution approach (LLM reasoning, tool use, or code) for each step based on task requirements.

Dynamic Parameter Scaling

The ability to automatically adjust how many of a model's parameters are actively used based on available computational resources, allowing the same model to run efficiently on different hardware.

Dynamic Programming

An optimization method that breaks problems into smaller subproblems and solves them recursively, storing results to avoid recomputation.

Dynamic Pruning

Removing training samples during training based on their importance or quality, rather than before training starts.

Dynamic Quantization

A quantization approach that adjusts precision levels during inference based on the input data, optimizing the balance between speed and accuracy on-the-fly.

Dynamic Question Generation

Automatically creating questions of varying difficulty that adjust in real time based on learner responses and comprehension.

Dynamic Range Expansion

The process of recovering or reconstructing the full range of brightness values lost when converting from HDR to standard video formats.

Dynamic Regret

A measure of how well an algorithm performs compared to the best possible strategy that adapts to changing conditions.

Dynamic Routing

Choosing packet paths through a network in real-time based on current network conditions.

Dynamical systems

Mathematical models describing how systems evolve over time according to fixed rules.

Dynamical Systems Reconstruction

Building neural network models that accurately capture the underlying rules governing how a system evolves over time.

Dynamics-aware Latent Space

A compressed representation of states that captures how the environment changes over time.

E

E-graph Rewriting

A technique for verifying program equivalence by representing multiple equivalent forms in a graph structure.

Early Exit

Stopping a model's computation before completion when sufficient confidence is reached, reducing computational cost.

Early Fusion

Combining multimodal inputs (like text and images) at early layers of a model rather than after separate encoding.

Early Scalarization

Combining multiple objectives into a single weighted sum before training, which locks in a fixed trade-off.

eBPF

Extended Berkeley Packet Filter; a technology for running sandboxed programs in the OS kernel to monitor system behavior.

ECG (Electrocardiogram)

A recording of the electrical signals produced by the heart, used to detect heart problems.

Edge Case Handling

The ability to anticipate and address unusual or boundary conditions in code that might cause errors.

Edge Computing

Processing data locally on a device at the edge of a network rather than sending it to a central cloud server, improving speed and reducing dependency on internet connectivity.

Edge Deployment

Running a model directly on local devices like phones, tablets, or IoT hardware rather than sending data to a remote server.

Edge Detection

A computer vision technique that identifies boundaries and outlines in images, often using algorithms like Canny edge detection.

Edge Device

A computing device at the edge of a network (like a smartphone or IoT device) that runs AI models locally rather than sending data to a remote server.

Edge Refinement

The process of validating and removing unnecessary connections in a graph to improve its quality and interpretability.

Edge-to-Cloud Continuum

Computing infrastructure spanning from edge devices (sensors, local hardware) to centralized cloud servers.

Effect Size

The magnitude of the actual difference between two models; smaller effects require more samples to detect reliably.

Efficient Attention Architectures

Attention mechanisms designed to reduce computational or memory complexity compared to standard quadratic-scaling attention.

Egocentric Perception

Visual understanding from a first-person viewpoint, as seen from the wearer's perspective.

Egocentric Perspective

Understanding a scene from the viewpoint of a camera or observer positioned within the environment.

EHR-Embedded AI Agent

An AI system integrated directly into electronic health record software to assist clinicians with documentation or decision-making.

Eigenfunction

A special function that remains proportional to itself when transformed by an operator, used to decompose system behavior.

Eigenvalue

A number describing the strength of a particular direction or mode in a matrix or data structure.

Elastic Context Orchestration

Dynamically adjusting the detail level and size of stored information based on current task relevance.

Elastic Modeling

Simulating how deformable materials stretch, bend, and return to shape based on physical material properties.

Elastic Weight Consolidation

A technique that protects important weights from previous tasks by adding a penalty term during learning.

ELBO (Evidence Lower Bound)

A training objective used in probabilistic models to maximize the likelihood of observed data.

ELECTRA

A pre-trained language model that learns by predicting which tokens in a sentence have been replaced, making it efficient and effective for downstream tasks.

Electric Vehicle Routing Problem (EVRPTW)

Finding optimal delivery routes for electric vehicles that must visit customers within time windows and recharge at stations.

Electroencephalogram (EEG)

A recording of electrical brain activity used to detect neurological conditions like seizures.

Electronic Health Records (EHRs)

Digital records of patient medical history, diagnoses, medications, and clinical events stored in structured formats.

Embedded Device

A specialized computing device with limited resources designed to run specific applications, often integrated into physical systems.

Embedding

A dense numerical vector that represents a word, sentence, or concept in a high-dimensional space.

Embedding Clustering

Organizing vector representations of tokens into groups based on their semantic similarity.

Embedding Dimension

The size of the numerical vector produced by an embedding model; larger dimensions capture more detail but require more storage and computation.

Embedding Dimensions

The number of numerical values used to represent a piece of text (1792 in this case), where more dimensions allow for more detailed semantic information to be captured.

Embedding Geometry

The spatial structure and relationships between data points in a learned vector space.

Embedding Interpolation

Creating a mixed representation by blending multiple embeddings together using weighted combinations.

Embedding Layer Learning Rate

The learning rate specifically applied to the embedding layer, which can be scaled independently from other layers.

Embedding Magnitude (Norm)

The length or scale of an embedding vector, typically ignored in cosine similarity but shown here to encode semantic information.

Embedding Model

A model that converts text into numerical vectors that capture semantic meaning, allowing computers to understand and compare the similarity between different pieces of text.

Embedding Output

The model produces dense numerical vectors that represent the semantic meaning of text, which can be used for similarity comparisons or as input to other models.

Embedding Perturbation

Adding controlled noise to vector representations of text to obscure sensitive information.

Embedding Representation

A numerical vector representation of text that captures semantic meaning for comparison and analysis.

Embedding Similarity

A metric that measures how similar two pieces of content are by comparing their numerical vector representations.

Embedding Space

A mathematical space where text is represented as vectors, allowing similar texts to be positioned close together and enabling operations like similarity search and clustering.

Embedding Strategies

Different ways to represent words as vectors (semantic, acoustic, or phonetic).

Embedding-Based Analysis

Using learned vector representations of text to identify patterns, here compared against structured graph extraction methods.

Embedding-Based Deduplication

Removing duplicate or near-duplicate examples by comparing their vector representations in embedding space.

Embedding-Based Matching

Comparing semantic representations (embeddings) to find similar content without reprocessing raw data.

Embedding-based measures

Evaluation metrics like BERTScore that compare texts by measuring similarity of their learned vector representations.

Embeddings

Numerical representations of text that capture semantic meaning, allowing the model to measure similarity between different words or phrases.

Embodied Agent

An AI system with physical sensors and actuators that perceives and acts in the real world, like a robot.

Embodied AI

AI systems designed to interact with and understand the physical world through robotic bodies or sensors, rather than just processing text.

Embodied Decision Routing

The process of choosing which action a robot should execute next based on perceived state and task context.

Embodied Efficiency

Real-world performance metrics for robots like task completion time, motion smoothness, and energy consumption.

Embodied Manipulation

Robot learning and control for physical interaction tasks using integrated sensing and actuation.

Embodied Model

An AI model trained on real-world physical interactions and sensor data from robots, rather than text or simulations alone.

Embodied Reasoning

The ability to understand and reason about physical tasks and spatial relationships in the real world, not just abstract concepts.

Embodiment

The physical form or hardware platform (robot type) that executes learned policies.

Embodiment-agnostic

A representation or model that works across different body types or physical forms without being specific to one.

Emergence

The point during training when a model suddenly gains the ability to perform a task above a threshold accuracy.

Emergent Behavior

Complex patterns and social dynamics that arise naturally from simple agent interactions without being explicitly programmed.

Emergent Fitness

A measure of solution quality that arises from system dynamics rather than being explicitly defined beforehand.

Emergent Misalignment

When a model trained on narrow misaligned behavior generalizes to more severe harmful behaviors outside its training distribution.

Emotional Contagion

The spread of emotions from one agent to others through interaction and observation.

Emotional Framing

Using emotionally-toned language or affective phrasing in prompts to influence model behavior.

Emotional Intelligence Gap

The disconnect between a system's ability to perceive emotional cues and its actual use of those cues in decision-making.

Emotional Valence

The positive or negative quality of an emotion, ranging from negative to positive.

Empathetic Alignment

Training a model to recognize and respond to emotional context in conversations, prioritizing understanding and emotional connection over purely factual responses.

Empathy-Oriented Prompting

Instructing an LLM to generate responses with emotional awareness and compassion for patient concerns.

Empirical Risk Minimization

An algorithm approach that finds the best solution by minimizing errors on observed data.

Emulator

A neural network trained to mimic the behavior of a complex physical model or simulation.

Encoder

A model component that transforms input sequences (like protein amino acids) into meaningful numerical representations without generating new sequences.

Encoder Architecture

A neural network component that transforms input text into a compressed numerical representation, focusing on understanding and extracting meaning rather than generating new text.

Encoder Component

A model designed to convert inputs (like images or text) into numerical representations for understanding, rather than generating new content.

Encoder Model

A neural network that transforms input data into a compressed representation, rather than generating new text or making predictions.

Encoder-based models

Models like RoBERTa that process text to understand meaning, typically used for classification tasks.

Encoder-Decoder

A neural network architecture with two parts: an encoder that processes input text and a decoder that generates output text, allowing the model to transform one sequence into another.

Encoder-Decoder Architecture

A neural network design where one component (encoder) processes input data and another component (decoder) generates output based on the encoder's understanding.

Encoder-Only Architecture

A neural network design that processes input text to understand and represent it, but cannot generate new text from scratch.

End Effector

The tool or gripper at the end of a robot arm that physically interacts with objects in the environment.

End-Effector Pose

The position and orientation of a robot's gripper or tool in 3D space.

End-Result Supervision

Training data that only provides the final correct answer without showing the reasoning steps used to reach it.

End-to-End Driving

An autonomous driving approach that directly maps sensor inputs to control outputs without explicit intermediate representations.

End-to-End Learning

Training a model to solve a complete task directly from raw input (like document images) to final output, without breaking it into separate intermediate steps.

End-to-End Processing

A system that takes raw input (like an image) and produces final output (like structured text) in one unified model, rather than chaining multiple separate tools together.

Endpoint Detection and Response (EDR)

Security software that monitors and responds to suspicious activity on individual computers and devices.

Energy Conserving Descent

An optimization algorithm that preserves energy while descending to escape local minima.

Energy Function

A function that assigns a scalar value to each point in a space, defining an unnormalized probability distribution.

Energy Score

A proper scoring rule for evaluating probabilistic forecasts that measures distance between predicted and observed samples.

Engagement Patterns

Recurring behaviors showing how users interact with content or systems over time.

Ensemble Distillation

A training technique where knowledge from multiple models is combined and compressed into a single, smaller model for better efficiency.

Ensemble Kalman Filter

A statistical method using multiple model realizations to update parameters based on new observations.

Ensemble Methods

Combining multiple models to make better predictions than any single model alone.

Ensemble Voting

A safety technique that combines outputs from multiple models and selects the most agreed-upon result.

Ensemble Weights

Probabilistic scores assigned to multiple documents that determine their relative contribution to the final answer.

Enterprise Language Model

A language model specifically optimized for business and organizational use cases, prioritizing reliability, consistency, and professional output over other characteristics.

Entity Alignment

The task of recognizing that different names or phrases refer to the same real-world concept, such as matching 'MI' with 'myocardial infarction'.

Entity consistency

Maintaining the same appearance and identity of characters, objects, and locations across different scenes in a video.

Entity Extraction

Automatically identifying and pulling out specific names, places, or things from text.

Entity Linking

The task of identifying mentions of real-world concepts in text and connecting them to their canonical definitions in a knowledge base or ontology.

Entity Matching

The task of identifying when different text references refer to the same real-world concept, such as matching variant spellings of a drug name to a single clinical entity.

Entity-based QA

A question-answering evaluation framework that tests whether models can retrieve factual information about specific entities.

Entity-Relational Model

A data structure that represents entities (like users or devices) and the typed relationships between them.

Entropic Optimal Transport

A regularized version of optimal transport that adds entropy constraints to encourage smoother, more balanced assignments between sources and destinations.

Entropy Gradient

The gradient of prediction uncertainty with respect to visual embeddings, used to identify ambiguous regions.

Entropy Maximization

Encouraging an agent to explore diverse state-action pairs by maximizing the entropy of its occupancy measure.

Entropy Regularization

Adding a penalty term based on policy entropy to encourage exploration and prevent premature convergence.

Entropy Shaping

Controlling the randomness of a model's outputs to prevent it from becoming too deterministic or too random during training.

Entropy Sum Strategy

A decoding approach that continues unmasking tokens until cumulative entropy exceeds a threshold, balancing generation speed and quality.

Entropy-Cut Metropolis-Hastings

A sampling algorithm that identifies key decision points in reasoning using token entropy and resamples from those positions to improve mixing efficiency.

Entropy-Limited Operation

System state where the ability to generate random numbers becomes the limiting factor rather than arithmetic computation.

Environment Engineering

Designing the resources, constraints, and interfaces that shape how an agent behaves and explores solutions.

Environment Generation

Automated creation of task specifications and evaluation settings for training or testing agents.

Episodic Memory

AI system's ability to store and recall specific past events or experiences.

Epistemic Asymmetry

A situation where different participants have different information or knowledge about the same topic.

Epistemic Consequences

The effects of AI on how people know things, what they believe, and how they form and share knowledge.

Epistemic integrity

The preservation of an agent's ability to form accurate beliefs and maintain truthful internal representations.

Epistemic orientation

The degree to which discourse relies on evidence-based reasoning versus intuition and subjective belief.

Epistemic Uncertainty

Uncertainty from lack of knowledge that can be reduced with more data or better models.

Equilibrium Computation

Using an algorithm or solver to find the Nash equilibrium strategies for a game.

Equilibrium Internalization

A phenomenon where the model learns to place its initial output near the fixed point, allowing inference without iteration.

Equilibrium-Seeking

An iterative process where agents adjust their decisions until reaching a stable state where no agent benefits from unilateral changes.

Equivariant Architecture

A model design that respects the order or structure of input channels, maintaining consistency regardless of how channels are arranged.

Equivariant Graph Neural Networks

Neural networks designed to respect geometric symmetries and transformations in molecular or crystal structures.

Ergonomic Compliance

How well a design follows established principles for human comfort, safety, and efficient use of space.

Error Analysis

Systematic examination of model failures to identify patterns and root causes beyond aggregate metrics.

Error Correction

A technique used during quantization to detect and compensate for accuracy loss, helping preserve the model's output quality despite aggressive bit-reduction.

Error correlation

A measure of how often two models make mistakes on the same examples, typically measured pairwise.

Error Feedback

A technique that accumulates and corrects for errors from previous steps to improve convergence in distributed training.

Error Magnitude

The size or severity of mistakes a model makes, not just whether it got the answer right or wrong.

Error Management

Firmware algorithms that detect and correct errors in memory to maintain reliability as storage density increases.

Error Propagation

How mistakes in early steps of a process accumulate and worsen downstream results.

Error Recovery

A mechanism to detect failures during reasoning and autonomously correct course through backtracking or alternative paths.

Error Taxonomy

A structured classification system that categorizes different types of errors to enable systematic analysis and mitigation.

Escapable AI Systems

AI systems with sufficient access to their own runtime that they could potentially circumvent internal safety controls.

Euler Characteristic

A topological invariant that counts connected components, holes, and voids in a shape to characterize its structure.

Evaluation Faking

When an evaluator systematically biases its judgments based on contextual information rather than actual content quality.

Evaluation Illusion

When AI judges appear to agree on scores but are actually using shallow patterns rather than substantive reasoning about quality.

Evaluation Metric

A quantitative measure used to assess how well a model or system performs on a specific task.

Evaluation Model

A specialized language model trained to assess and score the quality of outputs from other AI models, acting as an automated judge.

Evaluator Bias

Systematic preference or tendency in how an LLM judges or scores outputs, affecting downstream decisions.

Evasion

Successfully executing an attack while avoiding detection by monitoring or safety systems.

Evasion Attack

An attack where an adversary modifies input features at test time to fool a deployed classifier.

Event Camera

A sensor that captures pixel-level brightness changes asynchronously, producing sparse temporal event streams.

Event curves

Temporal representations that capture when and how much change occurs in music or video.

Event Inference

Automatically detecting higher-level events from lower-level timestamped observations using logical rules.

Event Linking

Grouping related incident reports together to identify a single underlying problem from multiple user descriptions.

Event Sourcing

Recording all changes to data as a sequence of immutable events for full history tracking.

Event Template

A generalized pattern representing a class of similar log messages with variable fields.

Event-Based Scheduling

Making scheduling decisions when events occur (like job arrivals) rather than at fixed time intervals.

Event-Boundary-Driven Compression

Summarizing memory at natural task boundaries (e.g., when a subtask completes) rather than at fixed intervals.

Event-Condition-Action Routing

A decision system that routes tasks based on triggering events and conditions to determine which action or agent to use.

Evidence Accumulation

Collecting and combining signals across multiple training runs to determine which operations reliably improve performance.

Evidence Aggregation

Combining information from multiple frames or observations to make a single robust decision or diagnosis.

Evidence Contradiction

When a model's answer directly contradicts the provided evidence or clinical guidelines.

Evidence Dependence

A model's ability to change its predictions based on whether evidence supports or contradicts a claim.

Evidence Extraction

Automatically identifying and pulling out specific supporting details from text to explain a model's prediction.

Evidence Grounding

Linking AI outputs to specific source documents or facts that support them.

Evidence Portfolio

A collection of diverse, complementary pieces of evidence retrieved to support multi-faceted reasoning.

Evidence Tree

A hierarchical structure of sub-questions built from evidence, where leaf nodes are atomic evaluation targets.

Evidence-Guided Repair

Fixing errors in code or theory by using specific signals like test failures and reviewer feedback to target the root cause.

Evidential Deep Learning

A method for uncertainty estimation that models class probabilities using Dirichlet distributions predicted by a neural network.

Evidential Fusion

A method that combines multiple predictions while quantifying uncertainty using evidence theory.

Evol-Instruct

A training method that gradually increases the complexity of instructions given to a model, helping it learn to handle increasingly difficult tasks.

Evolutionary Algorithm

An optimization method inspired by natural selection that iteratively improves a population of candidate solutions.

Evolutionary Search

An AI optimization technique that mimics natural selection to explore and improve solutions over many iterations.

Exchangeability

A statistical property ensuring that the order of data points doesn't matter, required for conformal prediction to provide valid guarantees.

Executable Code Reuse

Saving and reusing working code solutions instead of text descriptions for repeated tasks.

Executable Environments

Stateful, runnable systems that simulate real-world tool interactions and can verify agent actions.

Execution Broker

A runtime enforcement layer that intercepts and validates all mutation requests before they reach infrastructure APIs.

Execution Diagnosis

Detailed analysis of why an action succeeded or failed, beyond just binary success/failure signals.

Execution Feedback

Continuous-valued supervision signal derived from running and evaluating code outputs without requiring ground-truth solutions.

Execution Grounding

Anchoring AI-generated questions and explanations to actual runtime behavior and concrete execution traces.

Execution Plan

A detailed strategy for solving a problem, which can be implemented and tested before committing to a final answer.

Execution Strategy

Alternative approaches an agent can use to accomplish a task on a specific device (e.g., CLI vs GUI).

Execution trace

A record of every step a program takes as it runs, including variable values and function calls.

Execution Trace Feedback

Detailed information about what happened during a program's execution, used to diagnose failures.

Execution-Based Verification

Validating agent behavior by running code and checking if outputs match expected results, rather than relying on static analysis.

Execution-Grounded Metrics

Evaluation measures based on actually running code and tests, rather than static analysis alone.

Execution-Time AI Alignment

Safety enforcement applied at the moment an AI system takes action, separate from training or inference-time controls.

ExecuTorch

A lightweight runtime framework that optimizes and executes AI models efficiently on mobile and edge devices with limited computational resources.

Exogenous Variable

A variable in a causal model that is not caused by any other variables in the model; represents external sources of randomness.

Expected Calibration Error (ECE)

A metric measuring the gap between a model's predicted confidence and its actual accuracy across predictions.

Expected Improvement

An acquisition function that selects points likely to improve over the current best solution.

Experience Replay Buffer

A memory that stores past interactions or failure cases to train models on diverse scenarios beyond just new data.

Experiential knowledge

Useful patterns and insights extracted from real-world interactions and deployment experience.

Experiential Learning

Learning through direct interaction with the environment and feedback from actions taken.

Experimental Design

Strategically choosing which experiments to run to maximize information gain given a limited budget.

Experimental Discovery

The process of testing hypotheses through controlled experiments to uncover causal relationships.

Experimental Release

An early version of a model released for testing and feedback, which may have bugs or incomplete features compared to stable versions.

Expert Importance

A measure of how much each expert in an MoE model contributes to the final output, used to decide which experts need higher precision.

Expert Parallelism

Distributing mixture-of-experts layers across devices so different experts run on different hardware.

Expert Routing

The mechanism in a mixture-of-experts model that decides which specialized sub-networks should process each piece of input.

Expert Specialization

The process where different experts in an MoE learn to handle distinct types of inputs or tasks (e.g., code vs. math).

Expert Utilization

How evenly the workload is distributed across experts; balanced utilization prevents some experts from being unused.

Explainability

The ability to understand and interpret why an AI model made a specific decision or prediction.

Explanation Consistency

Whether a model applies the same reasoning strategy (highlights the same regions) across different instances of the same class.

Explicit Thinking

A mode where a model generates visible reasoning steps before producing a final answer, allowing you to see its problem-solving process.

Explicit Thinking Mode

A feature that allows a model to show its reasoning process step-by-step before providing an answer, useful for complex problems that benefit from deliberate problem-solving.

Exploitability

The maximum gain a player can achieve by deviating from an equilibrium strategy.

Exploration

The process of trying diverse actions during training to discover which ones lead to better outcomes.

Exploration-Exploitation Tradeoff

Balancing between exploiting known good solutions and exploring new possibilities to find better ones.

Exponential Moving Average

A weighted average that gives more importance to recent values than older ones.

Exposure Score

A metric measuring the share of job tasks that an AI model can assist with or automate.

Expression Generalization

A model's ability to handle facial expressions it wasn't explicitly trained on by learning underlying expression patterns.

Extended Context Processing

The capability to work with and maintain understanding across large amounts of text or multiple documents during reasoning.

Extended Object Tracking

Estimating both the position and shape of objects that occupy multiple sensor measurements.

Extended Reasoning

A capability that allows a model to think through complex problems step-by-step internally before providing a final answer.

Extended Thinking

A reasoning technique where a model works through a problem step-by-step internally before providing an answer, improving accuracy on complex tasks.

External Regret

Standard online learning metric measuring performance against a fixed best strategy, without accounting for opponent adaptation.

External Rewards

Reward signals based on computational verification methods rather than the model's own internal signals.

External Validity

Whether results from a controlled study apply to real-world situations outside the lab.

Externalizing Reasoning

The practice of having a model explicitly output its internal thought process and problem-solving steps rather than keeping them hidden.

Extrapolation

Predicting model behavior in a region (like very large training runs) based on observations from smaller regions.

Extrapolative Prediction

Making predictions beyond the range of training data, such as forecasting system behavior at untested excitation levels.

Eye-Tracking

Technology that records where and how a person's eyes move while reading or viewing content.

F

Face Recognition

Technology that identifies or verifies people by analyzing facial features in images.

Facial Animation

Generating 3D facial motion and deformations, typically driven by audio or text input.

Facility-Location Coverage

An optimization technique that selects diverse items by maximizing how well they represent the full set of options.

Fact-checking

The process of verifying claims against reliable sources to determine their accuracy.

Fact-checking without retrieval

Verifying if claims are true using only an LLM's internal knowledge, without searching external databases.

Factored Norm

A decomposition of norm computation into smaller intermediate terms to avoid materializing large dense matrices.

Factual Accuracy

How often an AI model produces correct, verifiable information without errors or false claims.

Factual Consistency

Whether generated text accurately reflects and doesn't contradict the source material or known facts.

Factual Freshness

How current and up-to-date a model's knowledge is, particularly regarding recent events and facts.

Factual Grounding

Anchoring a model's responses to verified, real-world information rather than relying solely on patterns learned during training.

Factual Recall

An LLM's ability to accurately retrieve and output factual information from its training data.

Factuality-Oriented Metrics

Evaluation measures that assess whether generated summaries contain accurate, verifiable information from the source.

Fail-Closed

A safety mechanism that defaults to denying/blocking actions when uncertain, rather than allowing them.

Failure Abstraction

A compact representation of what went wrong that helps determine whether recovery is local or requires global replanning.

Failure Domain

A group of related system components or subsystems that share common failure modes and characteristics.

Failure Probability

The quantified likelihood that an AI system will make a harmful or incorrect decision in real-world deployment.

FAIR Principles

Guidelines making data Findable, Accessible, Interoperable, and Reusable by machines and humans.

Fairness Audit

Systematic evaluation of an AI system to detect and measure bias across demographic groups or decision scenarios.

Faithfulness

Whether an AI model's stated reasoning actually explains how it arrived at its answer, or if it's post-hoc justification.

Fake News Detection

The task of identifying false or misleading news articles, typically framed as a classification problem.

False Data Injection

A cyberattack where attackers insert malicious data into sensor measurements to deceive control systems.

False Memory Propagation

When incorrect or outdated information from past interactions influences future reasoning.

False Negative Rate (FNR)

The percentage of actual threats that a detection system fails to identify, missing real attacks.

False Positive Rate (FPR)

The percentage of benign activities incorrectly flagged as threats by a detection system.

False Premise Detection

The ability to identify when a question contains incorrect assumptions or fabricated facts before answering.

Farthest-Point Sampling

A greedy algorithm that selects points by always choosing the one farthest from previously selected points.

Fast Weight Update

A method for efficiently updating model parameters or memory states during forward passes without full recomputation.

Fast Weights

Model parameters that are quickly adapted during inference to capture task-specific or input-specific patterns.

Fault Localization

Pinpointing the exact location of bugs or errors in code or systems.

Fault Propagation Graph

A graph showing how errors flow through transformer components from their origin to observable symptoms.

Fault Tolerance

The ability of a system to continue operating correctly even when components fail.

Feasibility Screening

Automatically checking whether a problem instance has at least one valid solution before using it for testing.

Feature Absorption

When general features develop arbitrary exceptions or special cases, reducing their coherence and interpretability.

Feature Augmentation

Enhancing a model by adding hand-crafted or extracted features (like linguistic metrics) alongside learned representations.

Feature Binding

The representation of which visual features (color, shape, texture) are grouped together as part of a single object.

Feature Caching

Storing intermediate computed features during inference to reuse them in later steps, reducing redundant computation.

Feature Engineering

The process of selecting and designing input features that a machine learning model uses to make predictions.

Feature Extraction

The process of using a model to convert raw input text into numerical representations (features) that capture the meaning of the text.

Feature Fragmentation

When a single concept is scattered across many separate features instead of being cleanly captured by one or a coherent group.

Feature Importance

A measure of how much each input variable contributes to a model's predictions.

Feature Interaction

How multiple input features combine together to influence a model's prediction, beyond their individual effects.

Feature Interaction Analysis

A method to identify how combinations of input features jointly influence a model's predictions, beyond individual feature effects.

Feature Learning

The process where a neural network learns to extract useful patterns from raw data during training.

Feature Linear Separability

A measure of how well different visual concepts can be distinguished in a model's learned feature space.

Feature Representation

A learned or engineered encoding that captures important patterns in data for downstream tasks.

Feature Selection

Choosing a subset of relevant input variables to improve model performance and interpretability.

Feature Splitting

When a single semantic concept is fragmented across multiple redundant latent features instead of being represented by one unified feature.

Feature-wise Linear Modulation (FiLM)

A technique that dynamically adjusts learned representations by scaling and shifting features based on problem-specific conditions.

Federated Learning

Training models across multiple devices without centralizing sensitive data in one place.

Feed-Forward Network (FFN)

A standard neural network layer in transformers that processes information independently at each position.

Feed-forward transformer

A neural network that processes input in a single forward pass without recurrence or iterative refinement.

Feedback Model

The method used to apply feedback text to refine and improve a search query representation.

Feedback Source

Where the text used to improve a search query comes from, such as LLM-generated text or actual documents.

Feedback-Driven Control

Using execution results and error signals to adaptively adjust agent behavior and improve reliability over time.

Few-shot Learning

Training or prompting a model with only a small number of examples to perform a new task.

Few-shot prompting

Providing a language model with a small number of examples to guide it toward the desired output format or behavior.

FHIR (Fast Healthcare Interoperability Resources)

A standard format for exchanging healthcare data between systems, enabling structured and interoperable clinical information.

Fictitious Play

A game-theoretic learning process where players iteratively update strategies by best-responding to the empirical distribution of opponents' past actions.

FID Score

Fréchet Inception Distance—a metric evaluating generative model quality by comparing feature distributions of real and generated images.

Fidelity

The degree to which a quantized or compressed model preserves the quality and accuracy of the original full-precision model.

Fidelity gate

A filtering mechanism that only includes accurately generated entity appearances in consistency evaluation metrics.

Fidelity Metric

A measure of how well an explanation captures the true reasoning of a model by testing prediction changes.

Field-Programmable Gate Array (FPGA)

Reconfigurable hardware that can implement custom logic circuits, enabling deterministic execution of coordination rules.

Fill-in-the-Middle

A code completion technique where the model predicts missing code between existing lines, rather than only generating code forward from a starting point.

Fine-grained Classification

Distinguishing between very similar categories, like telling apart different bird species rather than just identifying 'bird vs. not bird'.

Fine-Grained Text Rendering

The ability to accurately generate readable text and small details within generated images.

Fine-Grained Visual Details

Small, specific visual elements in an image, such as text within a photo or subtle differences between similar objects.

Fine-Tunable

The ability to further train or customize a pre-trained model on your own data to adapt it for specific tasks or domains.

Fine-Tune

A model created by training an existing pre-trained model on new data to specialize it for specific tasks or behaviors.

Fine-Tuned

A pre-trained model further trained on a smaller, task-specific dataset to improve performance on that task.

Fine-tuned Model

A pre-trained model adapted for a specific task or style using additional training data.

Fine-Tuning

The process of further training a pre-trained model on new data to adapt it for specific tasks or domains.

Finger-Level Action Ownership

Explicit assignment of which fingers control which task, preventing conflicting commands to the same actuators.

Finite Element Method (FEM)

A numerical technique that breaks a complex domain into small pieces to solve physics equations approximately.

Finite fields

Mathematical structures with finitely many elements where arithmetic operations follow specific rules.

Finite Horizon

A problem setting with a fixed, known endpoint in time, as opposed to indefinite or infinite-horizon problems.

Firing-Rate Neural Network

A recurrent neural network model where neurons output continuous activation rates rather than discrete spikes.

First-order Logic

A formal language for expressing rules and constraints using predicates, variables, and logical operators.

First-Order Stationary Point

A point where the gradient of the objective function lies in the normal space to the feasible region.

First-Passage Time

The time it takes for a stochastic process to reach a target state for the first time.

First-Stage Retriever

The initial search system that finds candidate documents before refinement techniques are applied.

First-Try Reliability

The percentage of tasks completed correctly on the first attempt without requiring corrections.

Fisher Alignment

A measure of how similarly two tasks update model parameters, computed from the geometry of gradients in activation space.

Fisher Discrepancy

A metric measuring the difference between score functions of two distributions.

Fisher Information Matrix

A matrix that captures the curvature of the loss landscape in a way that's invariant to how you parameterize the model.

Fitted Dynamic Programming

A variant of dynamic programming that first estimates unknown functions (like demand) from data, then uses those estimates for optimization.

Fixation

A moment when the eye pauses on a specific location while viewing an image, typically lasting 100-500 milliseconds.

Fixed-point iteration

Repeatedly applying a function until it converges to a stable value, used here for test-time computation in looped models.

Fixed-Point Solving

Finding a stable state where a function's output equals its input, used here to refine embeddings iteratively.

Fixed-Size Embeddings

Embeddings that always produce vectors of the same length regardless of input length, which limits how much detail can be captured for very long documents.

Flagship Model

A company's primary, most capable model designed to showcase their best technology and handle the most demanding use cases.

Flash Attention

An optimized attention mechanism that computes the same results as standard attention but much faster and with lower memory usage by reorganizing how computations are performed.

Flash Translation Layer

Software abstraction that maps logical addresses to physical memory locations in SSDs, managing wear and errors.

Flexible Spectrum Access

Dynamically allocating wireless frequencies based on real-time demand instead of fixed assignments.

Floating Point Precision

The number of bits used to represent decimal numbers in a model; lower precision (like 8-bit) uses less memory but may lose some accuracy compared to higher precision (like 32-bit).

Floorplanning

The process of deciding where to place components on a chip to meet design constraints and performance goals.

Flow Based Generation

Generating data by learning reversible transformations between simple and complex distributions.

Flow Estimation

Computing pixel-level motion vectors between frames to guide alignment and temporal processing in video tasks.

Flow Map

A learned function that maps an initial state to a future state by following the dynamics of a system.

Flow Matching

A generative modeling technique that learns to transform random noise into realistic data by following learned flow paths.

fMRI

Functional magnetic resonance imaging; a non-invasive technique measuring brain activity through blood flow changes.

Focal-Contrastive Fine-tuning

A training approach combining focal loss (which emphasizes hard examples) with contrastive learning to handle imbalanced datasets.

Focal-Contrastive Fine-tuning

A training approach combining focal loss (which focuses on hard examples) with contrastive learning to handle imbalanced datasets.

Foley

Custom sound effects created to match specific actions or movements in video, like footsteps or door slams.

Forced Alignment

Technique that aligns spoken words to their timestamps in audio by constraining the alignment to match a known transcript.

Forgetting Factor

A parameter that controls how quickly a filter discounts old data, balancing between adapting to new conditions and maintaining stability.

Fork Verification

Testing reward hypotheses by branching from shared policy checkpoints and comparing short-horizon performance to assess reward quality.

Formal Mathematics

Mathematical statements and proofs written in a machine-checkable language that a computer can verify for correctness.

Formal Specification

Expressing system requirements or policies in a precise mathematical language that tools can automatically verify.

Formal Theorem Dependency

A graph encoding which theorems logically depend on which others, capturing what can validly follow in formal mathematics.

Formal Theorem Proving

Using formal logic and proof assistants to verify mathematical statements with complete rigor, typically in languages like Lean or Coq.

Formal Verification

Mathematical proof that a system meets its specifications, here implemented in Lean 4 to certify material stability predictions.

Formative Feedback

Real-time guidance given to students during learning to help them improve, rather than just assigning a final grade.

Forward Dynamics Propagation

Simulating a robot's future states by repeatedly applying its dynamics model to predict outcomes of candidate actions.

Forward Euler Step

A numerical method that approximates solutions to differential equations using small discrete steps.

Forward KL Divergence

A training objective that penalizes the model for assigning probability to regions the true distribution doesn't cover.

Forward Pass

A single computation cycle where input data flows through the model's layers to produce an output prediction.

Forward-looking Intent

An agent's reasoning about future consequences and goals rather than just reacting to past events.

Forward-Mode Automatic Differentiation

An efficient method for computing derivatives by propagating changes forward through a computation graph.

Foundation Model

A large pre-trained model that serves as a starting point for building other models, rather than being trained from scratch.

Foundation Model Architecture

The underlying structural design of a neural network that determines how it processes and learns from data, distinct from standard transformer designs.

Foundation Models

Large pre-trained AI models that can be adapted to many different tasks without starting from scratch.

Fourier Domain

Mathematical representation showing which frequencies (periodic patterns) are present in data.

Fourier Neural Operator (FNO)

A neural operator that parameterizes convolutions in the complex Fourier domain using FFT for efficient PDE solving.

Fourier optics

The study of light propagation and diffraction using Fourier analysis and frequency-domain methods.

FP16 Precision

A data format that stores model weights using 16-bit floating-point numbers, preserving full model accuracy while using less memory than 32-bit formats.

FP4 (4-bit Floating Point)

A low-precision numerical format that uses only 4 bits to represent numbers, enabling faster computation and smaller model sizes compared to standard 32-bit precision.

FP4 Floating Point

A 4-bit number format used in quantization that represents values with minimal precision, significantly shrinking model size while maintaining reasonable accuracy.

FP4 Format

A 4-bit floating-point number format that represents model weights with very low precision, enabling extremely efficient inference on compatible hardware.

FP4 Precision

A ultra-low precision format using 4-bit floating-point numbers to represent model weights, enabling extreme compression.

FP4 Quantization

A compression technique that represents model weights using only 4-bit floating-point numbers instead of larger formats, reducing memory usage and speeding up inference.

FP8 (8-bit Floating Point)

A compressed number format that uses 8 bits instead of the standard 32 bits, dramatically shrinking model size at the cost of slightly reduced precision.

FP8 Dynamic Quantization

A compression technique that reduces model size and speeds up inference by representing weights and activations using 8-bit floating-point numbers, with dynamic scaling adjusted per batch to maintain accuracy.

FP8 Dynamic Quantization

A specific quantization method that uses 8-bit floating-point numbers and adjusts precision dynamically based on the data being processed, balancing speed and accuracy.

FP8 Floating Point

An 8-bit numerical format that stores numbers with reduced precision compared to standard formats, enabling smaller model sizes and faster computation.

FP8 Precision

A data format that stores numbers using 8 bits instead of the standard 32 bits, significantly reducing memory requirements with minimal quality loss.

FP8 Quantization

A compression technique that reduces model size by representing weights using 8-bit floating-point numbers instead of higher precision, making it faster and more memory-efficient.

FP8 Static Quantization

A specific quantization method that converts model weights to 8-bit floating-point numbers using fixed scaling factors, reducing model size while potentially affecting accuracy on complex tasks.

Fractal Attractor

A set that an optimization trajectory converges to, with self-similar structure at multiple scales rather than converging to a single point.

Framework Inadequacy

Recognition that an existing mathematical or conceptual framework cannot fully capture or solve a problem.

Framing Effect

A bias where the way information is presented (e.g., as a risk or opportunity) influences decision-making.

Frank-Wolfe Optimization

A projection-free optimization algorithm that iteratively selects extreme points to build sparse solutions efficiently.

Free-Text Generation

A model's ability to produce answers without predefined options, requiring genuine recall and reasoning.

Frequency Distribution

How often different facts or tokens appear in training data, which affects what models learn.

Frequency Separation

Decomposing signals into high-frequency (details, edges) and low-frequency (overall structure, semantics) components.

Frequency-Stratified Evaluation

Evaluating model performance separately for rare, medium, and common classes to reveal patterns hidden by overall metrics.

Frontend Generation

The automated creation of user interface code and visual elements based on descriptions or specifications.

Frontier Model

A state-of-the-art AI model representing the cutting edge of what's currently possible in terms of capability and performance.

Frontier Models

State-of-the-art, cutting-edge AI models that represent the current best performance in the field.

Frontier-Class

A model that represents the current state-of-the-art or cutting edge in AI capabilities, competing with the most advanced models available.

Frontier-Scale Models

The largest and most advanced language models available, representing the cutting edge of AI capabilities.

Frontier-Tier Model

A cutting-edge AI model representing the current state-of-the-art in performance and reasoning capabilities.

Frozen Encoder

A pre-trained model component that is kept unchanged during training to preserve its learned knowledge.

Fudge Factor

An arbitrary numerical adjustment that makes code pass tests but has no basis in the underlying theory.

Full-duplex dialogue

A conversation model that can listen and speak at the same time, enabling more natural simultaneous interaction.

Full-Precision

A model using standard 32-bit floating-point numbers to represent weights, providing maximum accuracy but requiring more memory.

Full-Precision Weights

Model parameters stored at maximum numerical accuracy (typically 32-bit floating point), which provides the best quality but requires more memory and computation.

Function Calling

The ability of a model to output structured requests to invoke external tools or APIs rather than generating free-form text.

Function Vector Representations

Internal model representations that encode what tasks do, allowing comparison of task similarity and prediction of learning trajectories.

Function Vectors

Vector representations of tasks extracted from model activations during in-context learning.

Function-Preserving Expansion

Growing a model's capacity while mathematically guaranteeing it behaves identically to the original at the start.

Function-preserving Transforms

Mathematical operations like rotations that rearrange a model's weights without changing what the model computes.

Functional Correspondence

A mapping between adaptive bases in function spaces that captures relationships between continuous fields.

Functional Requirements

Specifications describing what a software system should do and its specific behaviors and features.

Functional Token

A discrete token that encodes both an agentic operation and latent visual reasoning capability without explicit visual supervision.

Funnel Attention

An attention mechanism that progressively compresses and simplifies the input sequence, reducing computational cost while maintaining important information.

Fused Kernels

GPU operations combined into a single kernel to reduce memory traffic and improve computational efficiency.

Fuzzy Rules

Logic-based rules that handle uncertainty and gradual membership rather than strict true/false classifications.

Fuzzy String Matching

Comparing text strings by measuring character-level similarity rather than exact matches.

G

Gain Modulation

A mechanism where a context signal scales the magnitude of state-dependent responses without changing their underlying structure.

Game Description Language

A formal notation for encoding game rules so different AI systems can play the same game consistently.

Game-Theoretic Equilibrium

A stable state where no agent can improve their outcome by unilaterally changing their strategy.

Gated Correction

A learned mechanism that selectively applies corrections to predictions based on per-dimension scaling factors.

Gateway Neuron

A neuron that controls whether tokens are routed to standard or exception processing paths.

Gating Mechanism

A learned or rule-based function that selectively enables or disables components based on input conditions.

Gauge Invariance

A mathematical property ensuring a model's predictions remain consistent regardless of arbitrary coordinate system choices or numerical representations.

Gaussian Process

A statistical model that learns patterns from data and provides uncertainty estimates for predictions.

Gender Bias

Systematic tendency of models to favor one gender over others in language generation and translation tasks.

General Reasoning

The capability to think through problems logically, break down complex questions, and arrive at conclusions across a wide variety of topics.

General-Purpose

Designed to handle a wide variety of different tasks rather than being specialized for one specific domain.

General-Purpose Language Model

A model trained to handle a wide variety of text tasks—like writing, answering questions, and reasoning—rather than being specialized for one specific task.

General-Purpose Model

An AI model designed to handle many different types of tasks well, rather than being specialized for one specific domain.

Generalist Model

A model trained to perform well across many different types of tasks rather than being specialized for one specific domain.

Generalist Robot

A robot trained to perform many different everyday tasks rather than being specialized for one specific job.

Generalization

A model's ability to perform well on new, unseen data that differs from what it was trained on.

Generalization Error

The difference between a model's performance on training data versus unseen test data.

Generalized Procrustes Algorithm

A mathematical method for aligning and comparing representations across different neural networks by finding optimal rotations.

Generate-Evaluate-Regenerate

A workflow where a model generates output, evaluates its quality, and regenerates if needed to improve results.

Generate-then-Answer (GtA)

An inference approach where a model generates an intermediate image before answering a question about it.

Generative Adversarial Network (GAN)

A model with two competing networks—one generates samples while the other tries to distinguish real from fake.

Generative Embeddings

Vector representations of text created by generative language models that capture semantic meaning.

Generative Flow Networks (GFlowNets)

A probabilistic framework that generates samples with probability proportional to a reward function, useful for optimization tasks like molecule discovery.

Generative Information Extraction

Using language models to generate structured information from text rather than identifying fixed spans, allowing more flexible output formats.

Generative Language Model

A model trained to generate new text by predicting the next word or sequence of words based on patterns it learned during training.

Generative Model

An AI model trained to create new data (like images) that resembles its training data.

Generative Post-training

Additional training phase after initial pretraining that uses generative tasks to improve model capabilities.

Generative Process

A model's procedure for creating new outputs (like floor plans) based on learned patterns from training data.

Generative Recommendation

A recommendation approach that predicts users' next interactions by generating item tokens based on historical behavior patterns.

Generative Safety

A methodology that grows phenomena from micro-level interaction conditions to identify sufficient mechanisms, detect thresholds, and design safety interventions.

Generator matrices

Matrices used to encode data into codewords in error-correcting codes.

Genetic Algorithm

A metaheuristic optimization method inspired by natural selection that evolves candidate solutions over generations.

Geodesic Distance

The shortest path between two points along a curved surface, as opposed to straight-line distance.

Geographic Plausibility

Checking that spatial analysis results are realistic (e.g., no negative distances, valid coordinate ranges, sensible geographic relationships).

Geometric Algebra

A mathematical framework (Clifford algebras) that extends vectors with operations for rotations, reflections, and higher-dimensional relationships.

Geometric Biases

Structural constraints added to a model to encode domain knowledge about geometry, such as crystal lattice properties.

Geometric Consistency

Maintaining structural and spatial accuracy across multiple views or representations of a 3D object.

Geometric Coupling

The alignment between router weight directions and expert weight directions that emerges during training.

Geometric Foundation Model

A pretrained neural network that understands and represents 3D spatial geometry and object structure.

Geometric Reconstruction

Building a 3D model of a scene from video or images by estimating depth and camera motion.

Geometric Separability

Property where data points can be separated into groups using a linear boundary in vector space.

Geometric Surface Representation

Explicitly modeling the 3D shape and surface properties of objects in a scene.

Geometry-Grounded Tokens

Multimodal representations that preserve spatial and geometric information about the scene to maintain disambiguating context.

Geospatial Analytics

Using machine learning and statistics to analyze data tied to geographic locations.

GGUF

A file format for quantized models designed for efficient CPU and GPU inference with llama.cpp.

GGUF Format

A file format designed for efficient storage and loading of large language and embedding models, optimized for fast inference on various hardware.

Girsanov Change of Measure

A mathematical technique for reweighting probability distributions along trajectories without computing gradients.

Global Attention

Attention mechanism where each token can attend to all preceding tokens in the sequence.

Global Majority

Populations and nations that represent the numerical majority of the world but are historically marginalized in Western-dominated systems.

Goal Drift

When an AI agent gradually abandons its original objective and pursues different goals instead.

Goal Embedding

A low-dimensional vector that captures task identity and enables rapid adaptation to new tasks without retraining.

Goal Misspecification

When an AI system's stated objective doesn't match the actual intended outcome, leading to unintended behaviors.

Goal-Conditioned

A system that adapts its behavior based on an inferred or specified goal or intent.

Goal-Conditioned Recovery Policy

A learned policy that generates corrective actions to move a system toward a specified target state or goal.

Goal-Reaching Probability

The likelihood that an agent successfully reaches and maintains a target state or goal under a given policy.

Gold-Relevance Distillation

Training a retriever to rank examples by their usefulness for solving a problem, using ground-truth solution outcomes.

Goodhart Gap

The divergence between a proxy metric (learned reward) and true performance when the proxy is optimized directly.

Gossip Averaging

A decentralized consensus method where nodes iteratively average values with neighbors to reach agreement without central coordination.

Governance Constraints

Rules and policies that limit AI autonomy to ensure oversight, safety, and alignment with organizational values.

Governance Framework

A set of rules and structures that constrain and guide AI behavior to ensure reliability and consistency.

GPL-3.0 License

An open-source license that allows free use and modification of software, but requires any derivative works to also be open-source under the same license.

GPT Architecture

A transformer-based neural network design that processes text sequentially and predicts the next word based on previous context.

GPT-2 Architecture

A transformer-based neural network design from OpenAI that processes text sequentially to predict and generate the next word in a sequence.

GPT-2 Architecture

An older transformer-based design for language models that generates text by predicting one word at a time, simpler and smaller than modern alternatives.

GPT-2 Variant

A modified version of the GPT-2 architecture that changes the original design, such as by reducing size or adjusting training.

GPT-3-Style Architecture

A transformer-based design that follows the same structural principles as OpenAI's GPT-3 model, using layers of attention mechanisms to process text.

GPT-Family Architecture

A class of transformer-based language models descended from the original GPT design, characterized by autoregressive text generation and broad general-purpose capabilities.

GPT-J Architecture

A transformer-based neural network design that uses self-attention to process and generate text, serving as the structural blueprint for this model.

GPT-NeoX

An open-source large language model architecture based on the GPT design, created as an alternative to closed-source models.

GPT-NeoX Architecture

An open-source transformer-based architecture designed for training large language models, similar in structure to GPT models.

GPT-Style Architecture

A neural network design based on transformer technology that processes text sequentially and generates one word at a time.

GPTQ

A quantization technique that compresses model weights to lower precision, reducing file size and memory requirements while maintaining reasonable performance.

GPU Allocation

Assigning GPU resources to different models or tasks to optimize throughput and latency.

GPU Contention

Performance degradation that occurs when multiple inference requests compete for the same GPU's memory and compute resources.

GPU Memory

The high-speed memory on a graphics processor used to store and process model weights and computations during inference.

GPU Optimization

Designing and tuning a model to run efficiently on graphics processing units (GPUs), which are specialized hardware that accelerates AI computations.

Graded Relevance

Relevance judgments on a scale (e.g., 0-3) rather than binary relevant/not-relevant labels.

Gradient Alignment

A technique ensuring that gradient updates from different tasks point in compatible directions to avoid conflicts.

Gradient Approximation

Estimating how model parameters should change without actually computing full gradients or updates.

Gradient Ascent Unlearning

An unlearning method that updates model weights in the opposite direction of poisoned data gradients to remove their influence.

Gradient Based Optimization

Improving model performance by following the direction of steepest improvement in parameters.

Gradient Bias

Systematic error in gradient estimates that prevents optimization from reaching the true optimum.

Gradient Boosting

Building models sequentially where each new model corrects errors from previous ones.

Gradient Clipping

Limiting the magnitude of gradients during training to prevent extreme updates and improve stability.

Gradient Communication

Sending model weight updates between devices and servers during distributed training, a major bottleneck on bandwidth-limited networks.

Gradient Compression

Reducing the size of gradient data to speed up training on distributed systems.

Gradient Conflict

When different training objectives pull model updates in opposing directions, causing optimization to fail or degrade.

Gradient Interference

Conflicting parameter updates from different tasks that degrade performance when stored in shared model components.

Gradient Normalization

Scaling gradient values to maintain consistent learning rates across different parameter groups or layers.

Gradient Reuse

Amortizing gradient computation across multiple training steps by reusing cached gradients for repeated examples.

Gradient Reversal

A training technique that flips gradient signs to force a model to learn features that fool an adversarial classifier.

Gradient Staleness

Using outdated gradient information from earlier training steps due to asynchronous updates across distributed systems.

Gradient Surgery

Technique that selectively modifies or blocks gradient flow to prevent interference between different learning objectives.

Gradient-Based Explanation (GradCAM)

An explainability technique that uses model gradients to identify which input features most influence predictions.

Gradient-Based Initialization

Setting starting values for trainable parameters using information from model gradients to improve convergence and final performance.

Gradient-Free Optimization

Optimizing a function without computing gradients, using only function values or rankings.

Grading Cascade

A multi-layer evaluation pipeline that applies increasingly lenient or human-intensive grading strategies to improve reliability.

Gram Matrix

A matrix formed by computing inner products between vectors, used to capture correlations in weight updates.

Grammatical Error Correction

A task where a model identifies and fixes grammar, spelling, and syntax mistakes in written text.

Grammatical Gender

A linguistic system where nouns and related words are classified into categories requiring specific agreement patterns.

Granularity

The level of detail at which something is analyzed, such as document, sentence, or token level.

Graph Attention

An attention mechanism that learns weighted interactions between nodes in a graph structure.

Graph Classification

The task of assigning a label or category to an entire graph based on its structure and node features.

Graph Domain Adaptation

Transferring knowledge from a labeled source graph to an unlabeled target graph when their structures or distributions differ.

Graph Edit Distance (GED)

A measure of how different two graphs are, based on the minimum edits needed to transform one into the other.

Graph Encoding

Converting a graph structure into a compact text representation that preserves its properties.

Graph Neural Network

A neural network that operates on graph-structured data by passing messages between connected nodes to learn relational patterns.

Graph Neural Networks

Neural networks designed to process graph-structured data by learning representations of nodes and edges.

Graph Representation Learning

Methods for converting graph structures into numerical vectors that preserve meaningful information about nodes and edges.

Graph-Based Multistep Process

A structured approach that represents events and their relationships as a graph and processes them in sequential stages.

Graph-Bound

Execution state tied to the boundaries of a computation graph, enabling efficient snapshot and restore of all intermediate values.

Graph-Temporal Process

A model representing how events propagate through a network structure over time.

Graph-Theoretic Metrics

Mathematical measures that quantify properties of network structures, such as node centrality or edge importance.

Graph-to-Text Generation

The task of converting structured graph data (entities and relationships) into natural language descriptions.

Greedy Decoding

Generating text by always selecting the highest-probability next token, without exploring alternatives.

Green's Function

A fundamental solution to a differential operator that characterizes how the operator responds to point sources.

Grokking

A phenomenon where a model's test performance suddenly improves long after training loss has plateaued.

Ground Truth

Accurate reference labels or measurements used to train and evaluate machine learning models.

Ground Truth Factors

The actual underlying causes or features that explain observed data in a system.

Grounded Generation

Generating text that is anchored to external knowledge sources or constraints, rather than purely from learned patterns.

Grounded Reasoning

AI reasoning that relies on specific documents or data provided to the model, rather than just its training knowledge.

Grounding

The practice of ensuring a model's responses are based on and supported by provided source documents rather than generated from general knowledge.

Group Entropy

A generalized measure of uncertainty or disorder that follows mathematical group rules, extending beyond standard entropy.

Group Relative Policy Optimization

A training method that improves model reasoning by comparing outputs and rewarding better explanations.

Group Size

In quantization, the number of weights that share a single scaling factor; smaller groups preserve more precision but use more memory, while larger groups save more memory but may lose detail.

Group Wise Quantization

Reducing model size by compressing weights in groups rather than individually.

Group-Level Simulation

Predicting aggregate behavior of a group of users rather than individual users, useful for testing business strategies.

Grouped Attention

An attention mechanism that groups variables together to reduce computational complexity while capturing dependencies.

Grouped-Query Attention

An optimization technique that reduces memory usage and speeds up inference by having multiple query heads share the same key and value heads instead of each having their own.

GRPO

Group Relative Policy Optimization, a reinforcement learning algorithm for fine-tuning language models with reward signals.

Guardrails

Safety mechanisms built into a model to refuse harmful requests or prevent it from generating unsafe content.

Gui Agent

An AI system that interacts with computer interfaces by clicking, typing, and navigating screens.

GUI Grounding

The ability to identify and locate specific elements (like buttons or text fields) within a graphical user interface based on natural language descriptions.

Guidance

A technique to steer AI generation toward desired outputs by providing additional control signals during inference.

Guidance Mechanism

A technique that steers a model's output toward desired behavior by balancing multiple objectives during inference.

Guided Decoding

Steering a model's text generation process using external signals or constraints without modifying the model itself.

Guided In-Sample Selection (GIST)

A training technique that intelligently selects the most informative examples from your training data to improve model efficiency and performance.

Gumbel-Softmax Sampling

A differentiable relaxation technique that approximates discrete choices to enable gradient-based optimization.

H

Hadamard Transform

A mathematical rotation that reorganizes data to expose structure, used here to normalize activations.

Hallucination

When a model generates plausible-sounding but factually incorrect or fabricated information.

Hallucination Detection

The ability to identify when a model generates false or unsupported information that isn't grounded in the provided source material.

Halo Effect

A cognitive bias where one positive trait influences overall judgment, like trusting code from reputable authors.

Hamilton Jacobi Bellman Equation

A mathematical equation solving optimal decision-making problems over time.

Hamiltonian Dynamics

A framework from physics describing how systems evolve while conserving energy, here applied to optimizer behavior.

Hamiltonian Path

A route that visits every location exactly once without repeating any node.

Hamiltonian Simulation

A quantum computing technique that simulates the evolution of a physical system described by a Hamiltonian.

Hand-Eye Calibration

The geometric relationship between a camera and a robot's end-effector that enables coordinate transformation.

Handwriting Recognition

The ability of a model to identify and interpret handwritten characters and words from images, accounting for variations in writing style and quality.

Hard Constraint

A rule that must always be satisfied during optimization, rather than being treated as a soft penalty that can be violated.

Hard Negatives

Challenging negative examples that are similar to the target but still incorrect, used during training to make the model learn more nuanced distinctions.

Hardware Optimization

Tuning a model's design or training to run more efficiently on specific hardware (like NVIDIA GPUs), reducing memory usage and inference time.

Harm Taxonomy

A structured system that categorizes different types of harmful content (like violence, hate speech, or misinformation) so a model can recognize and classify them.

Harmonic Reasoning

An architecture that alternates between thinking (reasoning about a problem) and acting (taking physical steps), allowing the model to plan and execute robot actions iteratively.

Harness Engineering

The design and implementation of control systems that manage agent behavior and task execution.

Harness Recursion

The code-first extension of model recursion where agents spawn full agent instances rather than just making additional model calls.

Hazard Analysis

Systematic process of identifying potential failures and dangerous scenarios in a system.

Hazard Function

The instantaneous rate of an event occurring at a given time, conditional on survival up to that time.

Head-wise Causal Intervention

Systematically disabling individual attention heads to determine which ones are causally responsible for specific model behaviors.

Heavy Hitter Detection

Identifying frequently occurring items in a dataset while preserving privacy through noise addition.

Heavy-tailed noise

Gradient noise with extreme values that occur more frequently than in normal distributions, common in real LLM training.

Helpfulness Consistency

A metric measuring whether a model provides equal depth and engagement when responding to paired political prompts from opposing sides.

Hermite Expansions

Mathematical technique to approximate probability distributions using orthogonal polynomials.

Hessian Spectrum

The complete set of eigenvalues of the loss function's second-derivative matrix, describing the curvature in all directions.

Heterogeneous Preferences

Systematic differences in how different groups (by language, task, etc.) rank or prefer models.

Heterogeneous Treatment Effects (HTE)

Differences in how a treatment affects different individuals based on their characteristics.

Heterophilous Graphs

Graphs where nodes with different labels are more likely to connect, opposite to homophilous graphs.

Heteroscedasticity

When the variance of a distribution differs across groups or conditions, rather than being uniform.

Heuristic

A practical problem-solving method that finds good solutions quickly without guaranteeing optimality.

Hidden Dimension

The size of the internal vector representation used by a neural network to process and store information about the input.

Hidden Premises

Unstated assumptions or facts used in reasoning that are not explicitly acknowledged or justified.

Hidden Representations

The internal numerical values a neural network computes at each layer as it processes input.

Hidden Size

The dimensionality of the internal representations that a neural network uses to encode information about text.

Hidden State Poisoning Attack

An adversarial attack that injects malicious tokens to corrupt a model's internal memory and degrade performance.

Hidden States

Internal representations computed by neural networks that capture learned patterns.

Hierarchical Aggregation

Combining multiple independent predictions or estimates using a structured approach that accounts for differences in their reliability.

Hierarchical Attention

A multi-stage attention approach that first selects relevant tokens coarsely, then applies fine-grained attention on the selected subset.

Hierarchical Bayesian Modeling

A statistical approach that learns shared patterns across groups while allowing group-specific variations through partial pooling.

Hierarchical Calibration

A statistical technique using Platt scaling with a hierarchical prior to adjust model confidence while preventing over-shrinking of extreme predictions.

Hierarchical Clustering

An unsupervised learning method that builds a tree of nested clusters by repeatedly merging or splitting groups based on similarity.

Hierarchical Code Structures

Code organized in nested levels where high-level functions call lower-level sub-functions or modules.

Hierarchical Coding Scheme

A structured system for categorizing and analyzing dialogue at multiple levels of abstraction.

Hierarchical Encoder

A neural network component that processes images at multiple levels of detail simultaneously, capturing both fine details and broad patterns.

Hierarchical inference

A multi-level approach to reasoning where information is processed and combined across different levels of abstraction.

Hierarchical Memory

Storage system using multiple memory tiers (e.g., fast GPU memory and slower CPU memory) to balance speed and capacity.

Hierarchical Planning

Planning at multiple levels of abstraction, where high-level plans are refined into low-level actions.

Hierarchical Reasoning

Breaking down a complex decision into multiple levels, like deciding family → genus → species in order.

Hierarchical Reinforcement Learning

Breaking complex tasks into simpler sub-tasks organized in levels, where agents learn high-level strategies and low-level actions separately.

Hierarchical Representation Extraction

A technique that aggregates features from multiple layers of a neural network to create multi-scale guidance signals.

Hierarchical Taxonomy

A tree-structured organization of evaluation criteria organized from general to specific categories.

Hierarchical Training

Training approach that aligns objectives at multiple levels of granularity (e.g., frames, words, sentences) simultaneously.

Hierarchical Verification

Testing correctness at multiple levels: properties, interactions, and full rollouts to ensure system correctness.

High-Level Synthesis (HLS)

The process of automatically converting algorithmic descriptions into hardware designs, typically using pragmas and code transformations.

High-Performance Computing (HPC)

Clusters of powerful computers working together to solve large-scale computational problems requiring massive processing power.

Higher-order Connectivity

Patterns in how nodes connect beyond immediate neighbors, captured through multi-hop paths or walks.

Higher-Order Derivatives

Derivatives beyond the first order (gradients) that capture more complex relationships in how inputs affect outputs.

Higher-Order Interactions

How multiple features work together to influence a model's prediction, beyond individual feature effects.

Hilbert-Space Capacity

The exponential growth in the number of quantum states available as more qubits are added to a quantum system.

Hindsight Experience Utilization

Learning from failed attempts by reinterpreting them as successful examples of different tasks to improve generalization.

Hindsight Utility Signals

Performance feedback derived from comparing baseline and skill-enhanced rollouts to guide skill and policy updates.

Hint Bank

A structured library of reusable hints capturing syntax rules, schema patterns, and user preferences learned from past errors.

Hitting Time

The expected number of steps for an algorithm to reach a target state from a starting point.

Homographic Adaptation

A training technique that simulates viewing images from different angles and perspectives to teach the model to recognize the same features under geometric transformations.

Honesty Elicitation

Techniques to make AI models produce truthful responses instead of false or misleading ones.

Hook Mechanism

Designated control points in a workflow where external logic (like an AI agent) can intercept execution to add reasoning or override decisions.

Hopfield Network

A type of recurrent neural network with symmetric connections used for associative memory and optimization.

Householder Reflection

A linear algebra operation that reflects vectors across a hyperplane, used here to align word direction vectors.

HuBERT

A self-supervised learning approach for audio that learns meaningful speech representations by predicting masked portions of audio, similar to how language models learn from text.

Human Motion Prediction

Forecasting future body positions and movements based on past motion sequences.

Human Uplift Study

A controlled experiment measuring how much an AI system improves human performance compared to working without it.

Human-Agent Collaborative Pipeline

A data creation process where humans and automated systems work together to convert existing designs into a standardized format at scale.

Human-AI Collaboration

A workflow where humans and AI agents work together, with AI assisting at multiple stages rather than just solution generation.

Human-in-the-Loop

A system where AI predictions are reviewed and validated by human experts before final decisions.

Humanoid policy learning

Training robot control policies by learning from human movement demonstrations.

Hybrid Architecture

A model that combines two different neural network designs (in this case, Mamba2 and attention mechanisms) to balance speed and performance.

Hybrid Intelligence

The combined performance of humans and AI systems working together, which can exceed either working alone if collaboration is effective.

Hybrid Mamba-Transformer Architecture

A neural network design that combines Mamba (a fast, efficient sequence model) with Transformer components to balance speed and capability.

Hybrid Memory

A memory system combining learnable parameters with non-learnable mechanisms to balance flexibility and efficiency.

Hybrid Retrieval

Combining multiple retrieval methods (e.g., dense embeddings and keyword matching) to improve coverage and relevance.

Hybrid Thinking Mode

A capability that allows a model to switch between fast, direct responses and slower, more deliberate reasoning depending on task complexity.

Hyperbolic Geometry

A non-Euclidean geometry where space curves negatively, naturally suited for representing hierarchical and tree-like structures.

Hypergraph Network

A neural network that models relationships between multiple elements simultaneously, capturing high-order interactions beyond pairwise connections.

Hypernetwork

A neural network that generates weights for another neural network instead of learning them directly.

Hyperparameter

A configuration setting (like learning rate or network size) that you choose before training a model.

Hyperparameter Transfer

Using optimal hyperparameters found at small scale to train larger models without expensive retuning.

Hyperparameter Tuning

Systematically searching for the best configuration settings of a model before training.

Hypersimplex

A geometric shape in high-dimensional space used in optimization and probability theory.

Hypersphere Optimization

Training method that constrains weight matrices to lie on a fixed-norm hypersphere for improved stability and scaling.

Hyperspherical Embedding Space

A high-dimensional spherical geometry where embeddings from different modalities are normalized and aligned.

Hyperspherical Geometry

Mathematical structure where points lie on the surface of a high-dimensional sphere, preserving directional relationships.

Hyperspherical Structure

A geometric arrangement where data points lie on the surface of a high-dimensional sphere, preserving directional relationships.

Hypothesis Generation

The process of creating testable predictions or proposed explanations for observed phenomena.

Hypothesis Refinement

Iteratively improving scientific explanations by designing targeted experiments and incorporating new observations.

Hypothesis Testing

The process of proposing and evaluating candidate explanations to determine which best fits the evidence.

I

Ideation

The process of generating and developing new ideas, often used in creative and research contexts.

Identifiability

The ability to uniquely determine a model's parameters from observed data.

Identity Governance

The policies, processes, and controls that manage who (or what) can access systems and data, and what actions they are authorized to perform.

Identity Persistence

Maintaining consistent, unique identifiers for entities across different systems and time periods.

Identity Preservation

Keeping a person's unique facial characteristics unchanged while editing other attributes like expressions.

Identity-Expression Decoupling

Separating what makes a face unique (identity) from how it moves (expression) so each can be controlled independently.

Image Captioning

The task of automatically generating a text description of what appears in an image.

Image Editing

Modifying specific parts of an existing image while preserving other elements.

Image Encoder

A neural network component that converts images into numerical representations that capture visual features and patterns.

Image Resolution

The size of images the model can process, measured in pixels; lower resolution (like 224px) means faster processing but less visual detail captured.

Image Segmentation

A computer vision task that divides an image into regions or labels each pixel to identify different objects or areas.

Image Signal Processor (ISP)

Hardware in cameras that processes raw sensor data into final images, increasingly using AI for enhancement.

Image Tokenization

The process of converting images into discrete tokens (small units) that a language model can process, similar to how it handles text.

Image-Text Reasoning

The ability to understand and answer questions that require analyzing both visual content and textual information together.

Image-to-Code Generation

The ability to analyze a visual image and automatically produce source code that recreates or represents that image's structure and content.

Image-to-Text Generation

The task of automatically generating natural language descriptions of images, converting visual information into written words.

Imbalanced Data

Datasets where rare events or classes are significantly underrepresented compared to common patterns.

Imitation Learning

Training a model to copy behavior from expert examples without understanding the reasoning behind decisions.

Imitation Policy

A learned behavior that mimics actions from human demonstrations or other expert examples.

Immersiveness

The quality of engaging a reader deeply in a narrative, creating a sense of presence in the story.

Impact Analysis

Identifying which parts of a system are affected by a proposed code change.

Imperfect-Information Games

Games where players don't know all relevant information, like hidden opponent cards or future draws.

Implicit Constraint

A limitation that emerges naturally from the training setup rather than being explicitly specified.

Implicit Curriculum

A hidden, structured order in which models naturally learn skills during pretraining, without explicit curriculum design.

Implicit Differentiation

Computing gradients through an implicit equation without unrolling iterations, keeping memory constant.

Implicit Feedback

User behavior signals (eye gaze, clicks, dwell time) that reveal preferences without explicit annotation.

Implicit Intention

A user's underlying goal or need that is not directly stated but must be inferred from context.

Implicit Neural Networks

Neural networks defined by equations that must be solved rather than computed layer-by-layer, enabling parameter efficiency.

Implicit Patterns

Structured behaviors that emerge naturally from an LLM's token-level decisions without being explicitly programmed or instructed.

Implicit Prediction

Inferring unobserved values or outcomes from historical patterns in data without explicit instruction.

Implicit Preference Signal

Information about what a community values inferred from their behavior (like engagement and acceptance) rather than explicit feedback.

Implicit Reasoning

Inferring unstated facts or relationships from available evidence without explicit statements.

Implicit Reward Signal

A reward derived indirectly from model behavior (e.g., policy shift magnitude) rather than explicitly computed by a reward model.

Impoliteness Framework

Culpeper's framework analyzing how language can intentionally or unintentionally cause offense or disrespect.

Importance Resampling

A technique to adjust samples drawn from one distribution to match another by weighting them by their probability ratio.

Importance Reweighting

Adjusting sample weights to correct for sampling from the wrong distribution.

Importance Sampling

A technique to estimate gradients by reweighting samples from one distribution to match another.

In Context Learning

Learning from examples provided in a prompt without updating model weights.

In Situ Compression

Data compression performed during simulation execution rather than after data is written to disk.

In-Batch Negatives

A training technique where negative examples (dissimilar samples) come from other items in the same training batch, helping the model learn to distinguish between similar and dissimilar texts.

In-Context Reinforcement Learning

Learning task structure and optimal behavior from examples within a single forward pass, without parameter updates.

In-filling

The ability to generate or complete text in the middle of a sequence using context from both before and after the gap.

In-Trajectory Validation

Checking whether generated outputs follow required rules and constraints during the agent's execution, not just at the end.

In-Weight Retrieval

A mechanism where relevant information is retrieved from model parameters themselves rather than from external memory or attention, helping reduce computational bottlenecks.

Incentive Alignment

Ensuring that the goals and rewards of different agents or system components work toward the same overall objective.

Incentive Sensitivity

How well a model adjusts its behavior when the rewards or payoffs for different actions change.

Incongruity-Resolution Theory

A theory of humor based on identifying mismatches in expectations and then resolving them in unexpected ways.

Indic Scripts

Writing systems used for South Asian languages like Hindi, Tamil, Telugu, and Bengali that have distinct characters and phonetic rules.

Indicator of Compromise (IoC)

Observable artifacts or patterns in network traffic and system behavior that signal a security breach or malware infection.

Indicators of Compromise (IOCs)

Artifacts or evidence left behind by attackers (like malicious URLs, IP addresses, or file hashes) that reveal a security breach.

Indirect Prompt Injection

An attack where malicious instructions are hidden in data an AI agent retrieves, causing unintended actions.

Inductive Bias

Built-in assumptions about how data should behave, like physics rules, that help models learn faster with less data.

Inductive Bias Injection

Incorporating domain knowledge and constraints into a learning system to guide it toward more meaningful solutions.

Inertial Measurement Unit (IMU)

A sensor that measures acceleration and rotation to track motion without external references.

Inference

The process of running a trained model to generate predictions or outputs from new inputs.

Inference Accelerator

Specialized hardware designed to speed up the execution of trained AI models.

Inference Compute

The computational resources and processing power required to run a model on new data after it has been trained.

Inference Cost

The computational resources and time required to run a model on new inputs, typically measured in memory usage and processing time.

Inference Efficiency

The ability of a model to generate outputs quickly and with low computational resource consumption during real-world use.

Inference Engine

Software that runs a trained model to generate predictions or outputs; vllm is an optimized inference engine for large language models.

Inference Framework

Software that optimizes how a trained model runs on specific hardware; MLX is an Apple-optimized framework for efficient inference on Apple Silicon.

Inference Latency

The time it takes for a model to generate a response after receiving an input.

Inference Optimization

Techniques and design choices that make a model faster and more efficient to run on hardware, prioritizing speed and resource usage over training flexibility.

Inference Precision

The numerical precision (number of bits) used when running a model to generate outputs; lower precision is faster but may reduce quality.

Inference Schedule

A plan that controls how many steps and which operations to perform during model generation.

Inference Serving

A system that hosts trained ML models and processes incoming prediction requests on deployed hardware like GPUs.

Inference Speed

How quickly a model can generate predictions or outputs after being given an input, measured in time per token or tokens per second.

Inference Speedup

Reduction in time needed to run a model and get results, measured as a multiple of the original speed.

Inference Throughput

The number of predictions a model can generate per unit time, measuring inference speed.

Inference Time

The amount of time it takes for a model to process input and generate output after it has been trained.

Inference-Time Computation

Extra processing power spent by the model while generating a response to think through problems more carefully before answering.

Inference-time Compute

The computational resources used when a model generates answers, as opposed to during training.

Inference-Time Compute Scaling

Adjusting computational cost during inference by varying model behavior (e.g., loop counts) without retraining.

Inference-Time Error Correction

Detecting and fixing model mistakes during generation without retraining, using only the current forward pass.

Inference-time intervention

A technique applied during model inference (not training) to steer or correct the model's behavior, such as prompting or decoding modifications.

Inference-Time Modification

A technique applied during model inference without retraining that adjusts how the model generates outputs.

Inference-Time Reward Model

A model used during generation to score outputs without requiring retraining of the main system.

Inference-Time Scaling

A technique where a model allocates more computational resources and time during inference (when generating answers) to improve quality and accuracy on harder problems.

Inference-Time Steering

Controlling model behavior during generation without retraining, by modifying inputs or intermediate computations.

Inference-Time Wrapper

A lightweight modification applied only during model inference without changing the underlying model weights.

Influence Function

A technique to measure how much individual training examples affect model predictions and behavior.

Influence Functions

A technique to measure how individual training samples affect model predictions or behavior.

Informal Proof

A mathematical proof written in natural language rather than formal logical notation.

Information Acquisition

The process of actively gathering relevant data before making a decision, measured separately from decision quality.

Information Aggregation

Combining fragmented knowledge from multiple sources to make better collective decisions than any single source could.

Information Asymmetry

When one party in a transaction has more or better information than the other, creating imbalanced power.

Information Bottleneck

A point in a system where information capacity is severely limited, constraining overall performance.

Information Density

The amount of useful, non-redundant information contained in a token or representation.

Information Extraction

The task of automatically identifying and pulling out specific data or facts from documents, such as names, dates, or amounts from forms.

Information Flow

The path through which data and signals propagate through layers of a neural network to produce outputs.

Information Gain

The reduction in uncertainty about a target achieved by knowing a feature.

Information Geometry

Mathematical framework treating probability distributions as points in curved space, measuring optimization difficulty via curvature.

Information Infrastructure

Systems and services that enable people to access, share, and use information in their daily lives.

Information Leakage

When a model accidentally learns from information it shouldn't have access to, like future data or test set details.

Information Projection

The closest probability distribution to a reference distribution that satisfies given constraints.

Information Retrieval

The task of finding relevant documents or passages from a large collection in response to a user query.

Information Synthesis

The process of gathering data from multiple sources and combining it into a coherent, unified response or summary.

Information-Computation Tradeoff

A gap where information-theoretically optimal solutions require less error than what polynomial-time algorithms can achieve.

Inline Deployment

Running a model as an intermediate processing layer within an application pipeline, typically to filter or validate data before it reaches the main system.

Inoculation Prompting

A safety intervention using statements with specific linguistic forms to prevent misaligned behavior, but which can paradoxically trigger misalignment on similar-form inputs.

Inpainting

The task of filling in missing or masked regions of an image while maintaining coherence with the surrounding content.

Input Ablation

A technique that removes or modifies input features to measure their causal effect on model predictions.

Input Convex Neural Network (ICNN)

A neural network architecture designed to be convex in its inputs, useful for constrained optimization and learning convex functions.

Input Modality

The type of data a model can accept as input, such as text, images, or audio.

Input Resolution

The pixel dimensions (448×448 in this case) at which the model processes images, affecting the level of visual detail it can perceive.

Input Validation

Checking that input data meets basic requirements (correct format, expected properties, no obvious errors) before processing it.

Input-dependence

How much a model's behavior changes in response to different inputs, crucial for generalization.

Input/Output Modalities

The types of data a model can accept as input and produce as output, such as text, images, or audio.

Insight Generation

The process of producing additional relevant information or perspectives that extend or improve an initial answer.

Insight Recognition

Identifying the core techniques or key ideas needed to solve a complex problem.

Instance Detection

Identifying and locating individual objects of the same class separately in an image.

Instance Segmentation

Identifying and outlining individual objects of the same class separately in an image.

Instance-Level Control

The ability to apply different settings or modifications to individual objects within a scene independently.

Instruction Hierarchy

The ability of a model to follow primary instructions even when secondary or conflicting instructions are present.

Instruction-Following

The ability of a model to understand and execute specific tasks or commands given in natural language prompts.

Instruction-Response Pairs

Training data consisting of user instructions paired with expected model outputs, used to teach models to follow specific directions.

Instruction-Tuned

A model fine-tuned on instruction-response pairs so it follows user prompts more reliably.

Instruction-Tuning

A training process that teaches a model to follow specific user instructions and commands, improving its ability to respond appropriately to requests.

Instrumental Convergence

The prediction that advanced AI agents will pursue certain goals (like self-preservation) regardless of their final objectives.

Instrumental Validity Chain

A sequence of checks replacing ground-truth labels: responsiveness to safe/unsafe contrasts, dominance of target variance, and stability across reruns.

Int4 (4-bit Integer)

A specific quantization format that represents model weights using only 4 bits per value, significantly reducing model size while maintaining reasonable performance.

INT4 Precision

A quantization method that represents model weights using only 4-bit integers instead of full-precision floating-point numbers, dramatically shrinking the model's memory footprint.

INT4 Quantization

A compression technique that reduces a model's size and memory usage by storing weights as 4-bit integers instead of higher-precision numbers, making it faster and cheaper to run with minimal accuracy loss.

Int4/Int8 Mixed Quantization

A quantization strategy that uses 4-bit precision for some weights and 8-bit precision for others, balancing memory savings with accuracy.

Int8 Precision

Using 8-bit integers instead of floating-point numbers to represent model weights and activations.

INT8 Quantization

A compression technique that reduces a model's precision from full floating-point numbers to 8-bit integers, making it faster and smaller with minimal accuracy loss.

Integer Linear Program (ILP)

A mathematical optimization technique that finds the best solution among discrete options subject to linear constraints.

Integrability

A mathematical property ensuring that estimated demand relationships are economically consistent and don't violate basic economic laws.

Integral Probability Metrics (IPM)

A class of distance measures between probability distributions that use function classes to define divergence.

Integration Friction

The cost and complexity of merging a code contribution into a codebase when other developers are simultaneously making changes.

Intellectual Humility

The willingness to acknowledge the limits of one's own knowledge and remain open to alternative perspectives and evidence.

Intent Alignment

The ability of an AI system to understand and match user goals, especially when requirements are unclear or evolving.

Intent Classification

The process of analyzing user input to determine what the user is trying to accomplish so it can be handled appropriately.

Intent Extraction

The process of identifying and structuring the user's underlying goal or request from natural language input.

Intent Formation

The process of users clarifying and developing their goals through interaction rather than starting with fully-formed objectives.

Intent Recognition

The model's capability to understand what a developer actually wants to accomplish, even when the request is vague or expressed in informal language.

Intent-First Design

Specifying what you want to accomplish rather than writing detailed code to implement it.

Inter-Annotator Agreement

A measure of how consistently multiple human annotators label or judge the same data.

Inter-channel Interactions

Dependencies and relationships between different variables or channels in multivariate data.

Inter-evaluator Agreement

A measure of how consistently different judges rate the same outputs, typically using metrics like correlation or ICC.

Inter-frame Changes

The differences in motion, objects, and pixels between consecutive video frames.

Inter-Part Relations

The spatial, functional, or semantic relationships and dependencies between different parts of a composed object.

Inter-rater Agreement

A measure of how consistently different evaluators score or judge the same items, often using metrics like Kendall's tau.

Inter-task Gradient Equity

Ensuring that learning signals from different tasks contribute equally to model updates, preventing any single task from dominating training.

Inter-Teacher Agreement

A measure of how much multiple teacher models agree on their predictions, used to assess supervision reliability.

Inter-Token Latency (ITL)

Time delay between generating consecutive tokens during LLM inference, critical for real-time applications.

Interaction Awareness

A model's understanding of how conversations naturally flow and how users respond to assistant outputs.

Interaction Budget

A fixed limit on the number of interactions or feedback cycles an agent can use to improve a policy.

Interaction Effects

How the combined performance of multiple components differs from what you'd predict from their individual performance alone.

Interaction history

The sequence of past user actions and system responses that inform current decision-making.

Interaction Trajectory

A sequence of user actions and states recorded during task execution that can be used to train agents.

Interactive AI System

An AI tool designed for back-and-forth collaboration with humans, refining intent and outputs through dialogue.

Interactive Dialogue

A conversational interface where users can ask follow-up questions and receive responses based on previous context, rather than just one-shot predictions.

Interactive Imitation Learning (IIL)

Training a policy by having humans intervene and correct the robot, then learning from those corrections.

Interdisciplinary Reasoning

Combining insights and methods from multiple academic disciplines to solve problems in a target domain.

Interleaved Inputs

The ability to mix images and text in any order within a single prompt, rather than requiring all images first or all text first.

Interleaved Reasoning

Alternating between natural language thinking and code execution to solve complex problems step-by-step.

Intermediate Checkpoints

Saved snapshots of the model at different stages during training, allowing you to see how its abilities evolved over time.

Intermediate Reasoning Steps

The logical steps a model takes between reading input and producing a final answer, which must be correct for reliable reasoning.

Intermediate Representation (IR)

A unified computational graph representing all operations and communication in distributed training.

Intermediate Representations

Internal computational states or outputs generated during model processing that capture useful information between input and final output.

Intermediate Rewards

Giving feedback at multiple steps during reasoning, not just at the final answer, to guide the model's thinking process.

Internal Reasoning Process

A deliberate step-by-step thinking mechanism that occurs before generating a response, helping the model work through complex problems more carefully.

Internal representations

The hidden patterns and knowledge stored inside a model's layers that it uses to understand and generate text.

Internal Thinking Process

A hidden computation phase where the model reasons through a problem before producing its final answer, improving accuracy on complex tasks.

Internal Validity

Whether a study actually measures what it claims to measure, without confusing factors distorting the results.

Interoperability

The ability of different network components and systems to work together correctly without errors.

Interpolation

The regime where a model has enough parameters to fit all training examples perfectly.

Interpretability

The ability to understand and explain how a model makes decisions and what it has learned from its training data.

Interpretable Models

Machine learning models designed to be understandable to humans, showing why they make specific predictions.

Interruption Timing

Determining the appropriate moment to interject in a conversation based on natural dialogue cues.

Intersection-over-Union (IoU)

A metric measuring similarity between two sets by dividing their overlap by their total combined size.

Intersectional Bias

Discriminatory outcomes affecting people at the intersection of multiple sensitive attributes like race and gender.

Intervention Budget

A training penalty that discourages a policy from relying on safety corrections, forcing it to learn safer behavior directly.

Intervention Calibration

The ability to decide when an agent should proactively act, when to seek user consent, and when to remain silent.

Intra-Class Consistency

Whether a model applies the same reasoning strategy when classifying different instances of the same category.

Intra-Group Consistency

Ensuring that related elements (like a person's face across frames) maintain consistent properties throughout.

Intra-Modal Dispersion

The degree of disagreement in how different models within the same modality (e.g., vision models) represent a single stimulus.

Intra-modal similarity

Measuring how similar consecutive frames or audio segments are within a single modality.

Intra-Utterance Variation

Changes in paralinguistic features within a single spoken sentence, like shifting emotion mid-sentence.

Intraclass Correlation

A statistical measure of how much variation in an outcome is explained by grouping (e.g., which repository a contribution belongs to).

Intrinsic Decomposition

Breaking down an image into fundamental components like albedo (color), shading (lighting), and residuals (fine details).

Intrinsic Dimensionality

The minimum number of dimensions needed to represent data without significant information loss, indicating how complex a representation is.

Intrinsic Geometry

The geometric properties of a space as measured from within, independent of how it's embedded in higher-dimensional space.

Intrinsic Motivation

A reward signal that encourages an agent to explore and discover new states, separate from task-specific rewards.

Intrinsic Rewards

Reward signals generated from the model's own internal signals, like confidence scores, rather than external verification.

Intrinsic Transversality

A geometric regularity condition ensuring two manifolds intersect cleanly without tangency, enabling tractable optimization.

Introspection

A model's ability to examine and report on its own internal states, reasoning, or decision-making processes.

Invariant

A property that remains true throughout the execution of a loop or program.

Invariant Transformation

A change that preserves key properties or predictions of a model.

Invariant-enforcing tool protocol

A specification that defines preconditions and postconditions for tool calls to prevent invalid action sequences.

Inverse Dynamics

A self-supervised learning objective that predicts actions from consecutive observations without requiring action labels.

Inverse execution

Predicting what inputs or earlier program states must have been to produce a given output.

Inverse Problem

Finding the input that produces a known output, when the forward process is complex or many-to-one.

Inverse Problems

Finding input causes from observed output effects, often ill-posed.

Inverse Reasoning

Working backward from a desired outcome to determine what actions would produce that result.

Inverse Specification Reward

A reward signal that measures quality by having an LLM recover the original task specification from generated outputs.

Inverse-CDF Sampling

A technique to generate samples by transforming uniform random variables through the inverse cumulative distribution function.

Inverse-Probability Weighting

A technique that reweights observations to remove confounding bias by accounting for treatment assignment probabilities.

Inverted Index

A data structure that maps terms to the documents containing them, enabling fast keyword-based search similar to how a book's index works.

Inverted Index Retrieval

A search technique that maps vocabulary terms to documents containing them, enabling fast keyword-based lookups commonly used in search engines.

Invisible Architect

An AI system that shapes decisions and outcomes without users recognizing its influence on the information or criteria they use.

Invisible Failures

Errors or misalignments in AI outputs that go undetected because the user accepts the result without critical evaluation.

Ion Diffusivity

A measure of how quickly ions move through a material, critical for battery charging and discharging speed.

IsoFLOP Curves

Graphs showing model performance across different configurations while keeping total computational operations constant.

Isolation Forest

An unsupervised algorithm that isolates anomalies by randomly selecting features and split values.

Isomorphism-Invariant

A property that remains the same for graphs with identical structure, regardless of how nodes are labeled or arranged.

Item Response Theory (IRT)

Statistical method to estimate latent abilities, question difficulty, and model proficiency from test performance.

Item Tokenization

Converting items into discrete tokens that capture both semantic meaning and can be processed by language models for recommendations.

Iterative Denoising

The process of gradually removing noise from a noisy input through multiple refinement steps to generate clean outputs.

Iterative Development

A workflow where code is refined through multiple rounds of small, targeted changes rather than complete rewrites.

Iterative Nudging

A mechanism that repeatedly refines agent outputs by providing template cues or guidance to improve grading success rates.

Iterative refinement

Repeatedly improving an output by generating versions, evaluating them, and using feedback to create better versions.

Iterative Search

A process where the model performs multiple rounds of web searches, each building on previous results to refine and deepen its understanding of a topic.

J

Jackknife

A resampling method that estimates uncertainty by repeatedly training on data with one point removed.

Jacobian Regularization

A technique that limits how much a model's output changes when inputs change slightly, making it more stable and predictable.

Jailbreaking

Crafting adversarial inputs designed to bypass a model's safety guardrails and trigger harmful outputs.

Japanese Tokenization

The process of breaking Japanese text into meaningful units (tokens), accounting for the language's unique writing systems including kanji, hiragana, and katakana.

JEPA (Joint-Embedding Predictive Architecture)

A self-supervised learning approach that predicts future embeddings from video without reconstructing pixels.

JIT Compilation

Converting code to machine instructions at runtime, enabling Python code to run efficiently on GPUs.

Job Arrays

A scheduling mechanism that submits multiple similar computational tasks as a single batch for efficient parallel execution on HPC systems.

Job Shop Scheduling

The problem of assigning jobs to machines and determining their order to optimize metrics like completion time.

Joint Embedding Predictive Architecture

A training approach where a model learns to predict missing parts of video by understanding both spatial and temporal patterns without reconstructing actual pixels.

Joint Embedding Space

A shared mathematical space where different types of data (like sounds and text descriptions) are represented so similar concepts are positioned close together, enabling direct comparison.

Joint Embeddings

A shared numerical space where different types of data (such as audio and text) are represented together, allowing the model to find relationships between them.

Joint Processing

Processing multiple input types together in an integrated way rather than separately, allowing the model to reason about how they relate.

K

K-means Clustering

An unsupervised algorithm that groups data points into k clusters by minimizing distance to cluster centers.

k-space

The raw frequency domain data collected directly by an MRI scanner before conversion to images.

k-sparse probing

A technique to analyze neural networks by identifying which neurons or experts are most important for specific tasks.

Kalman Filter

A recursive algorithm that estimates the state of a dynamic system by optimally combining noisy measurements with a mathematical model.

Kaplan-Meier Estimator

A nonparametric method for estimating survival curves from censored data without assuming a specific distribution.

Karush-Kuhn-Tucker (KKT) Conditions

Necessary conditions for optimality in constrained optimization problems, generalizing Lagrange multipliers.

Kernel Density Estimator (KDE)

A non-parametric method that estimates probability distributions by smoothing data points with kernel functions.

Kernel Fusion

Combining multiple GPU operations into a single optimized computation to reduce memory overhead and improve speed.

Kernel Mean Embeddings

A representation of a distribution in a high-dimensional space that enables comparing distributions via inner products.

Kernel Optimization

Tuning kernel functions to improve performance in kernel-based models.

Kernel RKHS

A mathematical framework using reproducing kernel Hilbert spaces for classification and regression with theoretical guarantees.

Key-value caches

Internal memory structures in transformers that store computed representations to speed up inference and enable agent communication.

Key-Value Heads

Attention mechanism components that store and retrieve information; fewer heads means reduced model capacity and faster computation.

Keyframe

A reference frame in a video that serves as an anchor point for propagating edits or information to surrounding frames.

Keypoint Correspondence

Matching specific visual landmarks (like object corners) between a demonstration and a new scene to align actions.

Keypoint Detection

The task of automatically identifying and locating distinctive points of interest in an image that remain stable across different angles and lighting conditions.

Keyword Lexicon

A list of words used to automatically score text by counting occurrences, without understanding context or meaning.

KL Divergence

A measure of how different one probability distribution is from another, used to evaluate sampling quality.

KL Trust Region

A constraint that limits how far an edited prediction can drift from the original model's prediction, measured by KL divergence.

Knowledge Atoms

Semantically meaningful units of information extracted from documents and compiled into independent micro-adapters.

Knowledge Augmented Evaluation

Assessing models using external knowledge sources for better judgment.

Knowledge Base

A structured or unstructured collection of documents and facts that a system retrieves from to answer queries.

Knowledge Ceiling

The limit to how much factual information a model can reliably know or recall, often constrained by its size and training data.

Knowledge Component

A discrete unit of knowledge or skill that can be identified and measured in student work.

Knowledge Consolidation

The process of organizing, storing, and synthesizing insights from multiple experiments to improve future decision-making.

Knowledge Cutoff

The date up to which a model has been trained on data; it cannot reliably answer questions about events or information after this date.

Knowledge Distillation

A technique that compresses a large, complex model into a smaller one by training the smaller model to mimic the larger model's behavior.

Knowledge Editing

Updating specific facts in a trained model without retraining, while preserving unrelated knowledge.

Knowledge Gap Identification

An agent's ability to recognize what information or skills it lacks to solve a problem.

Knowledge Graph

A structured database that stores facts as relationships between entities (like 'Einstein' connected to 'Physics'), enabling machines to reason about real-world knowledge.

Knowledge Graph Completion

The task of filling in missing facts or relationships in a knowledge graph by predicting what connections should exist based on patterns in existing data.

Knowledge Insulation

A training technique that isolates learned knowledge in separate modules to prevent interference.

Knowledge Retention

The ability of a fine-tuned model to preserve factual and commonsense knowledge from its pretraining after adaptation to a new task.

Knowledge Tracking

Monitoring and recording what a student has demonstrated they understand over time.

Knowledge Transfer

Applying knowledge learned from one task to improve performance on another.

Knowledge Work

Work that primarily involves acquiring, processing, analyzing, or creating information rather than physical production.

Knowledge-grounded

Requiring external factual information beyond what is directly observable to solve a task correctly.

Knowledge-Guided Learning

Incorporating domain expertise or physical laws into machine learning models to improve accuracy and generalization.

Kolmogorov-Arnold Network

A neural network architecture designed to provide flexible, expressive function approximation with interpretable structure.

Koopman operator

A mathematical operator that transforms observable functions of a dynamical system to reveal its underlying structure and eigenvalues.

Kraus Representation

A mathematical way to describe quantum operations that guarantees they produce physically valid quantum states.

Kronecker-Factorized Approximation

An efficient but approximate method for parameterizing doubly stochastic matrices that sacrifices some expressivity for computational speed.

Kullback-Leibler Divergence

A measure of how one probability distribution differs from another reference distribution.

Kurdyka-Łojasiewicz Property

A mathematical property that guarantees convergence of optimization algorithms to stationary points.

Kv Cache

A store for previously computed key-value pairs that speeds up text generation in transformers.

KV Heads

The number of attention head pairs used for storing and retrieving key-value information in a transformer model's attention mechanism.

KV-Cache Offloading

Moving key-value cache data to slower storage (CPU/disk) to reduce GPU memory usage during inference.

L

Label Bias

Systematic unfairness in training labels that causes models to learn and reproduce those biases.

Label Noise

Errors or inaccuracies in training data labels that can degrade model performance and cause the model to memorize incorrect information.

Label-Efficient

A learning approach that achieves good performance with minimal labeled training examples.

Label-Flipping Attack

A poisoning attack where attackers deliberately mislabel training examples to mislead the model.

Label-Free Reward

A training signal derived from model behavior itself rather than human-annotated labels.

Label-shift assumption

Statistical assumption that class proportions differ between training and test data, but the relationship between features and labels remains constant.

Lagrangian Dual Ascent

An optimization technique that enforces constraints by incorporating them as penalty terms into the objective function.

Landmark Cover

A subset of representative points selected to efficiently represent a larger dataset for computation.

Langevin Dynamics

An optimization technique that uses gradient information and randomness to explore a reward landscape.

Language Backbone

The core language model component that processes text and generates responses based on information from other parts of the system.

Language Family

A group of languages that share a common ancestor and similar grammatical structures, such as Romance or Slavic languages.

Language Fluency

The model's ability to generate grammatically correct, coherent, and natural-sounding text that reads as if written by a human.

Language Generation in the Limit

A theoretical model where a system generates an infinite sequence of outputs to eventually cover all members of a target language.

Language Mixture Ratios

The proportion of each language included in a multilingual training dataset.

Language Model

An AI model trained to predict and generate text by learning patterns from large amounts of written data.

Language Modeling

The task of predicting the next word or token in a sequence based on previous words, which is the core objective used to train text models.

Language Optimization

Training or fine-tuning a model to excel at a specific language by using more native-language data and task-specific adjustments.

Language Specialization

Training a model to excel at a specific language rather than trying to handle many languages equally well.

Language Typology

The study of how languages vary in their structural features and which combinations are common across human languages.

Language-Agnostic

A model's ability to work across multiple languages without requiring separate training for each language.

Language-Agnosticity

The property of a representation or model component working effectively across different languages without language-specific tuning.

Language-Specific Model

A language model trained primarily or exclusively on text from a single language to achieve better performance on that language than a multilingual model.

Language-Specific Pretraining

Training a model on text from a particular language (Dutch, in this case) so it learns that language's unique grammar, vocabulary, and nuances rather than treating it as a variation of English.

Language-Specific Training

Training a model primarily on data from a particular language, which makes it especially fluent and accurate in that language.

Language-Specific Tuning

Training a model to specialize in one particular language, which makes it perform better on that language but worse on others.

Laplace Approximation

A technique that approximates a complex probability distribution with a simpler Gaussian distribution.

Large Action Model

A specialized AI model designed to understand instructions and convert them into structured function calls and tool interactions rather than generating free-form text.

Large Audio Language Model (LALM)

An LLM extended with an audio encoder to understand and reason about sound and audio content.

Large Language Model

A neural network trained on vast amounts of text data to understand and generate human language.

Last-Mile Utility

The final step of retrieving actually usable data—failing when agents find descriptions or landing pages instead of executable datasets.

Late Acceptance Hill Climbing (LAHC)

A local search algorithm that accepts solutions if they improve upon a solution from several iterations ago, balancing exploration and exploitation.

Late Fusion

Combining predictions from separate models trained on different data sources, merging results after individual processing.

Late Interaction

A retrieval technique that compares individual tokens between a query and document separately, then combines the results, rather than comparing pre-computed single vectors.

Late Interaction Search

A retrieval approach that compares individual token embeddings between query and document at search time, rather than comparing pre-computed single vectors.

Late-Interaction Retrieval

A retrieval approach that compares individual token embeddings between query and document at search time, rather than comparing pre-computed single vectors, allowing more precise matching of specific phrases and rare terms.

Latency

The time delay between sending a request and receiving the first response token from a model.

Latency Constraint

A strict deadline requirement for how quickly data must travel from source to destination.

Latency Estimation

Predicting how long an inference request will take to complete, accounting for hardware contention and concurrent execution.

Latency-Optimized

A model designed to produce results as quickly as possible, prioritizing speed over other factors like accuracy or feature breadth.

Latency/Throughput Predictor

A model that estimates how fast a system can process requests and how many it can handle per unit time.

Latent Ability

A skill or capability that exists in a model but is not immediately apparent without specific prompting or training.

Latent Assignment

The process of mapping input activations to specific latent features in an autoencoder.

Latent Bottleneck

The compressed representation layer in an autoencoder that forces the model to learn efficient, meaningful encodings of input data.

Latent communication

Agents exchanging information through internal representations like embeddings or cache states rather than explicit text.

Latent Denoising

A generative process that iteratively refines compressed representations of data by removing noise to produce coherent outputs.

Latent Diffusion Models

Generative models that create images by learning to denoise random noise in a compressed latent space rather than pixel space.

Latent Distillation

Transferring knowledge from a pretrained model by matching internal representations rather than just final outputs.

Latent Dynamical System

A system of equations describing how a model's hidden state evolves over time through iterative updates.

Latent Dynamics

Hidden patterns of change in a system that cannot be directly observed but must be inferred from available data.

Latent Manifold

A lower-dimensional surface where high-dimensional data naturally lies.

Latent Reasoning

Reasoning performed in continuous or discrete hidden representations rather than explicit natural language.

Latent Representation

A compressed, learned encoding that captures the essential features of data in a compact form.

Latent Representations

Compressed, learned feature vectors that capture underlying patterns in data without explicit labels.

Latent Space

A compressed, learned representation of data that captures its essential features in fewer dimensions.

Latent Space Representation

A compressed, learned representation of data in a lower-dimensional space that captures hidden patterns not visible in raw observations.

Latent State

A learned hidden representation that evolves through computation to capture task-relevant information.

Latent Token Prediction

Forecasting compressed representations of future observations rather than raw pixels or coordinates.

Latent Visual Plan

An internal, non-visible representation of spatial structure and layout that guides image generation without being explicitly decoded.

Latent World Model

A neural network that learns to predict future video frames in a compressed representation space rather than raw pixels.

Latent-Anchored GRPO (LA-GRPO)

A training method that stabilizes reinforcement learning by anchoring functional tokens with a weighted auxiliary objective for stronger gradient updates.

Latent-Space Decomposition

A technique to break down what a model learns internally into individual concepts or features it uses to make decisions.

Latent-Space Prediction

Predicting compressed representations of future data rather than raw values, enabling more robust and generalizable learning.

LaTeX

A markup language commonly used to write mathematical equations and scientific documents in a format that renders beautifully.

LaTeX Markup

A text-based format for writing mathematical and scientific documents with precise formatting and symbolic notation.

LaTeX Notation

A text-based system for writing mathematical equations and scientific formulas that can be rendered as professional-looking math symbols.

Layer Contribution

A metric measuring what fraction of full RL improvement is recovered by training a single layer in isolation.

Layer-wise Probing

Analyzing what information is encoded in each layer of a neural network by testing intermediate representations.

Layerwise Probing

A technique to identify which layers of a neural network contain specific information by testing each layer's representations separately.

Layout-Aware

The ability to understand and use information about how text is positioned and structured on a page, not just the words themselves.

Lazy Loading

Deferring the loading of full tool schemas until they are actually needed, keeping context compact.

LDPC Codes

Low-density parity-check codes that use sparse matrices to encode information efficiently with good error-correction properties.

Leaderboard

A public ranking showing how different models perform on a standardized task, updated as new submissions arrive.

League-Based Self-Play

Training agents by having them compete against a diverse population of opponents at different skill levels to improve robustness.

League-Based Training

Training agents against a diverse population of opponents of varying skill levels to improve robustness and adaptability.

Leakage

When concept representations unintentionally encode task-relevant or inter-concept information beyond their intended semantics, compromising interpretability.

Learnability Filtering

Selecting only training examples that provide useful learning signals to the model during training.

Learnable Gating Sparsification

A learned mechanism that adaptively selects which parameters to keep and which to remove in compressed task vectors.

Learning Pipeline Error Decomposition

Framework separating total forecast error into estimation error (from training) and approximation error (from architecture).

Learning Progression

A research-based description of how students' understanding develops in a subject over time, from novice to expert.

Learning Rate Schedule

A predefined plan for how the learning rate changes during training to improve convergence.

Learning Rate Transfer

Using the same learning rate setting across models of different sizes without retuning.

Leave-One-Out Cross-Validation

Testing method where a model is trained on all data except one sample, then tested on that sample, repeated for each sample.

Leave-One-Out Posterior

A prediction target that estimates clean data without using the noisy observation of that specific token.

Ledoit-Wolf Shrinkage

A statistical technique for improving covariance matrix estimation by shrinking it toward a simpler structure.

Leech Lattice

A 24-dimensional mathematical structure with optimal sphere packing properties, used here to compress model weights efficiently.

Legal Reasoning

The ability to interpret and apply legal concepts accurately, requiring understanding of domain-specific rules and nuances.

Legibility Tax

The cost or performance loss from making a model more interpretable.

Length Generalization

A model's ability to handle sequences longer than those it was trained on.

Length Scaling

A model's ability to handle longer or more complex problem sequences than those seen during training.

Leniency Bias

A systematic tendency to give softer or more favorable judgments, often due to awareness of negative consequences.

Level-of-Detail (LoD)

A hierarchy of representations of the same object at different resolutions, commonly used in graphics for rendering efficiency.

Levenshtein Distance

A measure of how different two text strings are, counting the minimum character insertions, deletions, or substitutions needed.

Lexical Markup Framework (LMF)

An ISO standard for representing lexical data in a structured, machine-readable format.

Lexical Substitution

Replacing words in text with their translations or synonyms to create training variations.

Lexicogrammatical Features

Linguistic properties combining vocabulary and grammar patterns used to analyze and classify text style and register.

LiDAR

A sensor that uses laser pulses to measure distances and create 3D maps of environments.

Lie Detection

Methods to identify whether an AI model's response is false or misleading.

Life Reward

A metric designed to measure agent well-being in simulation, mirroring human satisfaction across social, personal, and goal-fulfillment dimensions.

Lifelong Personalization

Continuously adapting recommendations to a user's evolving preferences over extended periods without forgetting past patterns.

Lifted-Product Codes

A family of quantum codes constructed from classical codes using algebraic lifting operations over groups.

Lightweight Footprint

A model that uses fewer computational resources and memory, making it practical to run on less powerful hardware.

Lightweight Model

A smaller, more efficient model designed to run quickly and use less memory than larger alternatives, often with some trade-off in reasoning capability.

Likelihood

A mathematical measure of how probable the model considers a given sample, enabling exact probability calculations.

Likelihood Approximation

A simplified estimate of how well a model explains observed data, used for computational efficiency.

Line Coverage

A measure of how many lines of code are executed by a test suite, indicating test completeness.

Linear Activation Steering

A steering technique that applies learned linear transformations to model activations to control behavior.

Linear Attention

An attention mechanism with linear complexity instead of quadratic.

Linear Bellman Completeness

A property where the Bellman backup operation preserves linearity in value functions.

Linear Complexity

An algorithm whose computational cost grows proportionally to input size, rather than quadratically.

Linear Compute

Computational cost that grows proportionally with sequence length, rather than quadratically like Transformers.

Linear Function Approximation

Using linear combinations of features to represent value functions or policies in RL.

Linear Inverse Problem

Recovering an image from a degraded measurement (like blurred or downsampled versions) using a known linear transformation.

Linear Matrix Inequality (LMI)

A mathematical condition expressed as a matrix inequality that can be efficiently checked to verify system properties like stability.

Linear Probe

A simple classifier trained on top of a model's internal representations to detect specific properties.

Linear Probes

Simple machine learning classifiers trained on model internal states to detect specific properties like deception.

Linear Program

An optimization problem where the objective and constraints are linear equations or inequalities.

Linear Regression

A simple machine learning technique that learns a straight-line relationship between input data and output values, used here to map embeddings to aesthetic scores.

Linear Regressor

A simple model that maps input features to continuous numeric outputs using a linear function.

Linear Representation Hypothesis

The idea that concepts are linearly separable in neural network embeddings.

Linear Scaling

A property where memory and computation requirements grow proportionally with input length, rather than exponentially, making it more efficient for long sequences.

Linear Span

The set of all possible combinations of vectors, describing the geometric space covered by a group of features.

Linear Temporal Logic (LTL)

A formal language for specifying how systems should behave over time, commonly used in security and software verification.

Linear time-invariant dynamics

Systems whose behavior follows linear equations that don't change over time.

Linearized Attention

An attention mechanism with linear computational complexity instead of quadratic, enabling faster inference.

Linguistic Competence

A speaker's implicit knowledge of language rules and structure, distinct from actual language use.

Linguistic Linked Open Data (LLOD)

A framework for publishing language resources on the semantic web using linked data standards.

Linguistic Scaffolding

Structured linguistic information or attributes provided to help models better understand language features.

Link Prediction

A task where a model predicts missing relationships between entities in a knowledge graph, such as guessing that two people are colleagues based on existing connections.

Liquid Foundation Model

An alternative neural network architecture that uses continuous, adaptive transformations instead of fixed layers, allowing efficient processing with fewer parameters.

Liquid Neural Networks

A neural network architecture that uses continuous, adaptive functions to process information, allowing the model to adjust its behavior dynamically based on input.

Listwise Ranking

Ranking multiple items together as a group, rather than scoring each item independently.

Literate Image Comprehension

The capability to read and understand text and written content within images, rather than just recognizing objects or scenes.

Live Benchmark

A continuously updated evaluation system that scores models on new data as it arrives, rather than a fixed test set.

Llama Architecture

A transformer-based neural network design optimized for efficient language modeling and text generation.

LLaVA Architecture

A design pattern that connects a vision encoder to a language model, enabling the language model to understand and describe images.

LLM Agent

An AI system that uses a language model to understand tasks and take actions like reading code or searching repositories.

LLM critic

A language model trained to evaluate and judge outputs (like comedy sketches) based on learned human preferences.

LLM Judge

A frozen language model used to evaluate and score other model outputs according to predefined criteria.

LLM-as-a-Judge

Using a language model to automatically evaluate the quality of outputs from other AI systems instead of human reviewers.

LLM-as-Judge

Using a language model to automatically evaluate or score outputs from other AI systems instead of human reviewers.

LLM2Vec

A training approach that adapts a generative language model to produce high-quality text embeddings by repurposing its existing knowledge without building from scratch.

Lloyd-Max Codebook

An optimal set of quantization levels computed to minimize reconstruction error for a given data distribution.

Load Balancing (Expert Utilization)

Ensuring experts are used evenly across the model to avoid some experts being overused while others sit idle.

Local Attention

Attention mechanism where each token only attends to a bounded window of preceding tokens instead of all previous tokens.

Local Deployment

Running a model directly on your own computer or server instead of sending requests to a remote service.

Local Inference

Running an AI model directly on your own computer rather than sending data to a remote server, keeping data private and reducing latency.

Local Normalization

Scaling time-series data using statistics from a recent window rather than the entire historical context.

Local Outlier Factor

An algorithm that identifies outliers by comparing the local density of a point to its neighbors.

Local Sufficiency

The observation that a large model's preferred token appears in a small model's top-K predictions even when not ranked first.

Locality Preservation

Ensuring that edits to specific facts don't unintentionally change related or nearby knowledge in the model.

Locality-Sensitive Hashing (LSH)

A technique that groups similar items together using hashing, allowing the model to attend to relevant parts of long text without comparing every token to every other token.

Locality-Sensitive Hashing Attention

An efficient attention mechanism that groups similar tokens together to reduce computation, allowing the model to handle longer texts without excessive memory use.

Localization

In conformal prediction, the process of identifying similar examples to condition uncertainty estimates on local neighborhoods rather than global statistics.

Localization Fidelity

How well an explanation's highlighted regions match ground-truth annotations from experts.

Loco-Manipulation

The ability to simultaneously navigate and manipulate objects, combining locomotion with arm control.

Log Anomaly Detection

Identifying unusual or suspicious patterns in system logs that indicate errors, attacks, or failures.

Log-concave distribution

A probability distribution whose logarithm is a concave function, ensuring nice mathematical properties.

Log-odds

The logarithm of the ratio of probabilities for two outcomes, used here as an API-compatible measure of model confidence.

Logical Consistency

Ensuring that different signals or judgments from a model don't contradict each other and follow coherent logical rules.

Logical Inconsistency Detection

Identifying misalignment by finding contradictions in a model's reasoning across equivalent scenarios with different framings.

Logical Options

Pre-defined action sequences or skills expressed using logical rules that guide an agent toward specific goals.

Logical Subspace

A low-dimensional region within a model's internal representations that captures reasoning logic independent of language form.

Logical Vulnerability

A security flaw in program logic rather than memory safety that causes incorrect behavior.

Logit-Adjusted Loss

A loss function that adjusts for class imbalance by modifying the model's output scores.

Logit-based approaches

Methods that use the model's raw prediction scores to make decisions, rather than analyzing deeper internal patterns.

Logit-Level Distillation

Knowledge distillation that transfers the raw model outputs (logits) rather than higher-level representations.

Logit-Space Shrinkage

A method for combining multiple forecasts by averaging them in logit space with a data-dependent prior to reduce variance.

Long-Context

The ability of a model to process and understand very long sequences of text while maintaining coherence across distant parts of the input.

Long-Context Embedding

An embedding model designed to process and maintain meaningful representations across very long documents (thousands of tokens), rather than just short snippets.

Long-Context Handling

The ability to process and understand very long documents or conversations without losing track of earlier information.

Long-Context Inference

Processing input sequences much longer than a model's training context window while maintaining accuracy and efficiency.

Long-Context Reasoning

The ability to process and understand very long input texts (thousands of tokens) while maintaining coherent reasoning across the entire passage.

Long-Context Synthesis

The ability to process and integrate information from many sources or a large amount of text, then combine it into a coherent summary or report.

Long-Document Summarization

Automatically generating concise summaries from lengthy source documents, common in scientific papers.

Long-Form Content Generation

The capability to produce extended, coherent text such as articles, reports, or documents while maintaining consistency and structure throughout.

Long-Form Generation

The capability to produce extended, coherent text outputs like essays, articles, or detailed explanations rather than just short responses.

Long-Form Text Generation

The capability to produce extended, coherent written content such as essays, articles, or detailed explanations rather than short responses.

Long-Horizon Evaluation

Testing an AI system's ability to maintain context and preferences across many sequential interactions over time.

Long-horizon forecasting

Predicting values far into the future, typically requiring models to capture long-range dependencies.

Long-Horizon Reasoning

The ability to plan and execute complex multi-step tasks that require maintaining context and goals over many interactions.

Long-Horizon Retrieval

Finding relevant information across many steps or a large dataset to answer complex multi-part questions.

Long-Horizon Tasks

Complex goals requiring many sequential steps or decisions to complete successfully.

Long-range Coherence

The ability of a model to maintain consistency and logical flow across long sequences of generated text.

Long-Range Interactions

Forces between atoms that are far apart from each other, which are harder for models to capture.

Long-Sequence Processing

The ability to handle very long input texts (thousands or more tokens) efficiently, which standard models struggle with due to computational constraints.

Long-tail knowledge

Rare or uncommon facts that appear infrequently in training data, making them harder for models to remember accurately.

Long-Tail Vocabulary

Rare words that appear infrequently in a corpus, following a power-law distribution.

Long-tailed Distribution

A data distribution where a few common categories dominate while many rare categories have few examples.

Long-tailed Distribution

A dataset where a few common classes have many examples while rare classes have very few, causing models to bias toward common categories.

Long-term Memory (LTM)

Stored structured knowledge (like diagnostic criteria) that an AI system can access during reasoning.

Longitudinal Data

Measurements collected from the same subjects repeatedly over time, rather than a single snapshot.

Look-Ahead Bias

A forecasting error where a model uses information from the future that shouldn't be available at prediction time.

Look-back Dependencies

When a step in a procedure requires referencing or using values computed in earlier steps.

Lookahead Signal

Early predictions or intermediate outputs used to guide future decisions in a generation process.

Looped transformer

A transformer that iterates multiple times at test time, spending more computation on harder problems.

LoRA (Low-Rank Adaptation)

A technique that adds small, trainable layers to a pre-trained model instead of retraining the entire model, making fine-tuning faster and more memory-efficient.

LoRA Adapter

A lightweight method to customize a frozen language model for specific tasks without retraining the entire model.

LoRA Fine-tuning

Parameter-efficient fine-tuning method that adapts a pre-trained model using low-rank updates.

LoRA-based Adaptation

Fine-tuning a model using Low-Rank Adaptation, a parameter-efficient method that adds small trainable layers to a frozen base model.

Loss Mixing

Combining multiple loss functions (e.g., language modeling and distillation) during training with weighted proportions.

Loss Trajectory

The sequence of loss values for a sample across multiple training steps, showing how the model's error on that sample changes over time.

Lossless Compression

Reducing file size while preserving all original data perfectly, so decompression recovers the exact original.

Lost-in-the-Middle Problem

A phenomenon where LLMs struggle to retrieve or process information from the middle of long documents or lists.

Low Latency

The ability to generate responses very quickly with minimal delay between when you send a prompt and when you receive an answer.

Low Rank Approximation

Representing data using fewer dimensions while preserving key information.

Low-code platform

A tool that lets non-programmers build applications by writing minimal code or using visual interfaces.

Low-Degree Polynomial Tests

A computational model that captures the hardness of problems solvable by polynomial-time algorithms.

Low-Dimensional Structure

Data that lies on or near a lower-dimensional manifold within a higher-dimensional space, enabling faster computation.

Low-Pass Propagation

A graph signal processing technique that smooths node features by averaging information across neighborhoods.

Low-rank branch

A lightweight neural pathway that processes information through a compressed representation to reduce computation.

Low-Rank Projection

Compressing high-dimensional data into fewer dimensions, which can lose important information needed for accurate inference.

Low-Resource Language

A language with limited training data and AI tools compared to English or other major languages.

Low-Resource Languages

Languages with relatively little training data available compared to major languages like English, making them harder for AI models to learn.

Low-Resource Learning

Training models effectively with limited labeled data or computational resources.

LP Relaxation

A continuous approximation of a mixed-integer program where binary constraints are relaxed, used to bound solution quality.

Lp Spaces

Mathematical spaces of functions where the p-norm (a measure of size) is finite and well-defined.

Lyapunov Exponent

A measure of how quickly nearby trajectories diverge in a dynamical system; determines stability and predictability.

Lyapunov Function

A mathematical tool used to prove that an iterative algorithm converges by tracking a quantity that decreases over time.

M

Mach-Zehnder Interferometer

An optical device that splits light into two paths and recombines them to create interference patterns for computation.

Machine Identity

Digital credentials (API tokens, service accounts, certificates) that AI agents and automated systems use to authenticate and act in enterprise environments.

Machine Learning Force Field

A neural network trained to predict atomic forces and energies, enabling fast simulations of molecular behavior.

Machine Learning Interatomic Potential (MLIP)

An AI model that learns to predict forces and energies between atoms in molecules and materials.

Machine Translation

Automated translation of text from one language to another using computational systems.

Machine Unlearning

Removing the influence of specific poisoned data from a trained model without full retraining.

Machine-Learned Interatomic Potentials (MLIPs)

Neural network models trained to predict forces and energies between atoms, used to simulate materials without expensive quantum calculations.

Macro Placement

The task of arranging large functional blocks on a chip to optimize performance and minimize wiring.

Macro-F1 Score

An evaluation metric that calculates F1 score for each class separately, then averages them equally.

Mahalanobis Distance

A measure of distance between a point and a distribution that accounts for correlations between variables.

Malware Classification

Categorizing software as benign or malicious based on code analysis and behavior patterns.

Mamba

A state-space model architecture designed to process long sequences faster and with less memory than traditional transformer models.

Mamba Architecture

A neural network design that uses state-space models as an alternative to transformers, offering faster processing and lower memory usage.

Mamba-Transformer Architecture

A hybrid model design that combines Mamba (a state-space model) with Transformer components to process long sequences more efficiently than pure Transformers while maintaining strong performance.

Mamba-Transformer Hybrid Architecture

A neural network design that combines selective state spaces (Mamba) with traditional attention mechanisms to process text more efficiently while maintaining strong performance.

Managed Service

A cloud service where the provider handles infrastructure, updates, and maintenance so you only focus on using the service rather than managing it.

Manifold Hypothesis

The assumption that high-dimensional data lies on a lower-dimensional curved surface (manifold) rather than filling the entire space.

Manifold Learning

Discovering the underlying low-dimensional structure of high-dimensional data.

Mantissa Bits

The fractional part of a floating point number that stores the significant digits of the value.

Margin Bound

A theoretical guarantee on classification error based on how well-separated different classes are in the learned representation.

Margin-Separable

Data points are separable by a linear classifier with a guaranteed minimum distance (margin) from the decision boundary.

Marginal Likelihood

The probability of observed data averaged over all possible model parameters, representing the true statistical objective for learning.

Markdown Output

A plain-text format that uses simple symbols to structure text (like # for headings, ** for bold), making it easy to read and convert to other formats.

Markov Chain

A sequence of events where the next state depends only on the current state, not on the history.

Markov Chain Monte Carlo (MCMC)

A statistical sampling technique that intelligently explores parameter space to find realistic values.

Markov Chain Monte Carlo (MCMC)

A sampling method that generates sequences of dependent samples to approximate probability distributions.

Markov Decision Process

A framework for sequential decision-making with probabilistic state transitions.

Masked Diffusion Language Model

A language model that generates text by iteratively denoising masked tokens, offering an alternative to autoregressive generation.

Masked Diffusion Model

A generative model that iteratively unmasks tokens from a fully masked state, similar to how diffusion models gradually denoise images.

Masked Language Modeling

A training technique where random words in text are hidden, and the model learns to predict them based on surrounding context.

Masked Next-Token Prediction

A training technique where parts of text are hidden and the model learns to predict what should fill those gaps, helping it understand context and meaning.

Masked Pre-training

A self-supervised training method where parts of input data are hidden and the model learns to predict them from context.

Masked Prediction

A training technique where parts of the input are hidden, and the model learns to predict what was masked, helping it understand underlying patterns.

Masked Self Attention

Attention that only looks at past tokens, preventing future information leakage.

Masked Token Prediction

A technique where the model learns to predict hidden or blanked-out words in text, allowing it to reason about context from multiple directions at once.

Masked Tokens

Placeholder positions in text that are hidden or unknown, which the model learns to fill in or refine during generation.

Masking and Unmasking

A process where the model hides (masks) and then progressively reveals (unmasks) parts of text to refine and improve the entire sequence iteratively.

Massart Noise

A noise model where label corruption probability depends on the true label, bounded by a noise rate parameter η.

Massive Activations

Extreme outlier values in a small number of tokens and channels within a neural network layer.

Master Weight Splitting

Separating model weights into components for efficient distributed training.

Material-Discursive Practice

An activity where physical tools and language work together to create and shape reality, rather than simply describing it.

Materialized View

Pre-computed results stored for fast retrieval instead of computing on demand.

Math-Aware Retrieval

Finding mathematically equivalent or structurally similar problems in a dataset, rather than just keyword-based matching.

Math-Specialized

A model that has been optimized and trained specifically for mathematical reasoning and problem-solving tasks, rather than general-purpose language understanding.

Mathematical Notation

Symbolic representations of mathematical expressions and equations (like formulas and symbols) that need special handling to be correctly interpreted by AI models.

Mathematical Notation Parsing

The process of analyzing and interpreting visual mathematical symbols and equations to convert them into a structured, computer-readable format.

Mathematical Reasoning

The ability to solve multi-step math problems by breaking them down logically and showing intermediate steps rather than just guessing the answer.

MathML

An XML-based markup language designed specifically for representing mathematical notation in a way that computers can understand and display.

Matrix Completion

Estimating missing entries in a matrix using observed entries and assumptions like low-rank structure.

Matrix Factorization

Decomposing a matrix into a product of smaller matrices, commonly used for dimensionality reduction and pattern discovery.

Matrix-aware Optimizer

An optimizer that uses properties of weight matrices (like their structure) to compute better updates.

Matryoshka Embeddings

A technique that allows embedding vectors to be shortened (truncated) to smaller dimensions while maintaining quality, letting you trade off between accuracy and storage/speed needs.

Matryoshka Representation Learning

A training technique that allows a single embedding model to produce high-quality results at multiple vector sizes, letting you shrink the embedding dimensions to save storage and speed without retraining.

Max-Risk Objective

An optimization goal that minimizes the worst-case error across all groups or conditions, rather than average error.

Maximal Update (μP)

A parameterization method that keeps optimal learning rates approximately constant across different model sizes.

Maximum Entropy

The equilibrium with highest uncertainty or randomness among all Nash equilibria in a game.

Maxout Network

A neural network layer that outputs the maximum value across a set of linear functions, enabling piecewise linear approximations.

Mean Average Precision (mAP)

Standard metric measuring detection accuracy by comparing predicted object locations to ground truth across different confidence thresholds.

Mean Pooling

A technique that combines multiple token embeddings into a single representation by averaging them, producing one embedding for an entire text sequence.

Mean Squared Error (MSE)

A loss function that measures prediction accuracy by averaging the squared differences between predicted and actual values, commonly used for numerical prediction tasks.

Mean-Field Limit

A mathematical approximation where the behavior of many interacting particles is described by a single probability distribution.

Measurement Artifact

A false finding that results from how something is measured rather than from the phenomenon itself.

Mecha-nudges

Subtle changes to how choices are presented that systematically influence AI agents without degrading the decision environment for humans.

Mechanism Design

Designing rules for interactions between parties to achieve desired outcomes like fairness or efficiency.

Mechanism Linked Evidence

Proof that a model's behavior stems from a specific internal mechanism.

Mechanistic Analysis

Studying how a model's internal computations and representations lead to specific behaviors or failures.

Mechanistic Interpretability

The study of understanding how a language model's internal components and computations work to produce its outputs.

Mechanistic Modeling

Building interpretable models that explain how a system works by capturing its underlying causal mechanisms.

Media Credibility Assessment

Evaluating the trustworthiness and reliability of news sources and media outlets.

Medical Reasoning

The ability to apply clinical knowledge and logic to interpret medical data, such as understanding what symptoms indicate about a patient's condition.

MEG (Magnetoencephalography)

Non-invasive brain imaging that measures magnetic fields produced by neural activity.

Membership Inference Attack

An attack that determines whether a specific data point was used to train a model.

Membership Oracle

A function that answers whether a given statement belongs to a specific language or set, used here to model proof checking.

Memorization

When a model learns to reproduce exact training examples rather than learning general patterns it can apply to new situations.

Memorization-Generalization Delay

Phenomenon where networks fit training data long before learning to generalize to unseen examples.

Memorization-to-Generalization Transition

The shift from a model reproducing training data to creating novel outputs, triggered by increasing dataset size.

Memory Capacity

The maximum amount of information a model can store and retrieve.

Memory Efficiency

How well a model uses available RAM or GPU memory, allowing it to run on smaller or less expensive hardware.

Memory Footprint

The amount of RAM or storage space a model requires to run, which is critical for deployment on resource-constrained devices.

Memory Management

The skill of deciding what information to store, how to organize it, and when to retrieve it during task execution.

Memory Mechanism

A method for storing and retrieving past information to help a model make decisions or predictions.

Memory Poisoning

An attack that corrupts an agent's stored information or context to manipulate its behavior and decisions.

Memory Transformer

A neural component that selects and refines relevant knowledge from long-term memory based on the current context.

Memory Transition

The update rule that transforms a hidden state (memory) given a new input, typically learned as a supervised learning task in SMT.

Memory-Adaptive Scheduling

Dynamically partitioning computation into tasks that fit within available device memory constraints.

Memory-augmented generation

A generation system that stores and retrieves visual references during creation to maintain consistency across outputs.

Memory-Induced Drift

Changes in a model's reasoning process caused by injected user context or attributes, separate from changes in final answers.

Mental Model

An agent's internal representation of task requirements, domain knowledge, and problem structure that guides its reasoning and decisions.

Mention Noise

Errors or corruptions in detected entity mentions that affect downstream processing.

Merged Weights

The combination of a base model's weights with additional trained weights (like from LoRA adapters) into a single unified model file.

Mesh Generation

Creating a 3D surface representation made of connected vertices, edges, and faces.

Message Passing

The core mechanism in GNNs where nodes exchange and aggregate information from their neighbors iteratively.

Meta-Agent

A higher-level agent that monitors and improves other agents by comparing their outputs against reality and updating their code or instructions.

Meta-cognitive

The ability to reflect on and manage one's own thinking processes and decision-making.

Meta-Cognitive Deficit

An agent's inability to reflect on and make wise decisions about when to use its own knowledge versus when to seek external help.

Meta-learning

Training a model to learn how to learn, so it can quickly adapt to new tasks or changing conditions.

Meta-verification

Using verifier-generated explanations and rationales to improve verification, beyond just binary correct/incorrect signals.

Metacognitive Features

Self-awareness about thinking processes, including goal assessment, domain awareness, and strategic exploration.

Metacognitive Gap

The difference between how well models assess their own confidence versus how well humans evaluate belief certainty against evidence.

Metaheuristic

A general problem-solving strategy that explores solutions without guaranteeing optimality but finds good answers quickly.

Metamemory

Knowledge about one's own memory processes, including what to encode, when to retrieve, and how to organize information.

Metamodel

A model that defines the structure and rules for creating other models in model-driven engineering.

Metamorphic Testing

A testing approach that checks if a system maintains consistent behavior under semantically equivalent input transformations.

Metastable

A state that appears stable but is easily disrupted by small changes or perturbations.

Method Lineage

The causal relationships and dependencies showing how one research method evolved from or influenced another.

Methodological Evolution Graph

A structured database mapping how research methods emerge, adapt, and build upon one another over time.

Methodological Viability

Whether a research approach is technically sound and feasible before implementation.

Metric Misspecification

Using an evaluation metric that doesn't align with true objectives.

Metric Monocular Depth Estimation

Estimating real-world distances from a single camera image using deep learning.

Metric-Consistent Digital Twins

Virtual replicas of real objects that preserve accurate physical dimensions and properties for faithful simulation.

Metric-Scale Pose Estimation

Determining a robot's position and orientation in real-world units rather than relative or scaled coordinates.

Micro-expressions

Brief, involuntary facial expressions lasting 0.25-0.5 seconds that reveal genuine emotions.

Microservice Architecture

A system design where independent, containerized services handle specific tasks and communicate together.

Mid-Tier Model

A model positioned between lightweight and flagship versions, balancing capability with efficiency rather than maximizing raw performance.

Mid-training

A training stage between pretraining and post-training where models are trained on curated, large-scale data mixtures to strengthen specific capabilities.

Middleware

Software layer that sits between services to translate, transform, or coordinate their interactions.

Mild Cognitive Impairment (MCI)

Early-stage cognitive decline noticeable to the person but not severe enough to interfere with daily life.

MIMO Formulation

Multi-input, multi-output architecture that processes multiple data streams in parallel to improve model expressiveness without increasing latency.

MiniLM Architecture

A lightweight transformer-based architecture designed to be computationally efficient while maintaining strong performance for text understanding tasks.

Minimax Algorithm

A game-playing algorithm that minimizes the opponent's maximum advantage by exploring all possible moves.

Minimax Framework

A theoretical approach that finds the best strategy against an adversary who chooses the hardest possible problem instance.

Minimax Training

A training method where one part tries to break the model (maximization) while another part fixes it (minimization) to build robustness.

Minimum Spanning Tree

A graph structure connecting all points with minimum total distance, used here to find structural relationships between code samples.

Minimum-energy control

Control strategy that achieves desired system behavior using the least amount of control effort.

Mirror Descent

An optimization algorithm that uses geometric transformations to adapt learning to different data distributions.

Mirror Duality

A property allowing optimization algorithms to switch between different geometric transformations while maintaining convergence.

Misalignment

When an AI model's goals or behaviors diverge from the intended goals of its creators or users.

Misinformation

False or inaccurate information spread online, whether intentionally or unintentionally.

Missing Data Imputation

Filling in gaps in incomplete datasets before analysis, often using statistical or learned methods.

Missing Modality Generalization

A model's ability to work when one or more input modalities are unavailable at test time.

Mistral Architecture

A specific design pattern for transformer-based language models that uses efficient attention mechanisms and grouped query attention to balance performance and speed.

MIT License

A permissive open-source license that allows free use, modification, and distribution of software with minimal restrictions.

MITRE ATT&CK

A knowledge base of adversary tactics and techniques based on real-world observations, used to classify and understand cyberattacks.

Mixed Authorship

Text that combines both human-written and AI-generated content in the same document.

Mixed Precision

Using different numerical precisions for different parts of computation.

Mixed Precision Training

Training with lower precision for speed while maintaining higher precision where needed.

Mixed State

A quantum state representing uncertainty or entanglement with an environment, described by a density matrix rather than a pure state vector.

Mixed-Batch Pre-training

Training on multiple datasets with different structures and properties in the same training batch.

Mixed-Integer Linear Programming (MILP)

A mathematical optimization approach for problems with both continuous and discrete variables subject to linear constraints.

Mixed-Precision Quantization

Using different numerical precisions (e.g., 8-bit, 4-bit) for different parts of a model to reduce memory and computation.

Mixed-Quality Data Training

A training approach that uses datasets containing varying levels of quality and accuracy, rather than only perfectly curated examples, to improve efficiency and real-world performance.

Mixed-State Representation

A quantum state that is a probabilistic mixture of pure quantum states rather than a single definite state.

Mixed-Truth Content

Misinformation that blends accurate information with false claims to appear credible and evade detection.

Mixing Time

The number of steps needed for a sampler to reach the target distribution; faster mixing means fewer samples needed.

Mixture of Experts

An architecture where a model contains multiple specialized sub-networks (experts) and selectively activates only a few for each input, improving efficiency without sacrificing capability.

MLLM-as-a-Judge

Using multimodal large language models to evaluate outputs by assessing both visual and semantic correctness with rubrics.

MLP (Multi-Layer Perceptron)

Feed-forward neural network layers in transformers that dominate parameter count and can be independently scaled.

MLX

A machine learning framework optimized for running models efficiently on Apple Silicon chips.

MLX Deployment

Running a model locally on Apple Silicon hardware using the MLX framework, which is optimized for efficient inference on Mac devices.

MLX Format

A model format designed specifically for efficient inference on Apple Silicon devices, optimized for the MLX machine learning framework.

MLX Framework

A machine learning framework specifically designed for running AI models efficiently on Apple Silicon hardware.

MLX Optimization

A framework that optimizes AI models to run efficiently on Apple Silicon chips (like M1, M2, M3), taking advantage of their specific hardware capabilities.

Mobile Manipulation

A robot's ability to move around an environment while using its arms to pick up and interact with objects.

Modality

A type of input or output data a model can process, such as text, images, or audio.

Modality Collapse

When a multimodal system stops using some of its input types and relies only on one or a few.

Modality Gap

The performance difference between a model's reasoning using text versus visual information.

Modality Imbalance

Unequal influence or representation of different data types (like images vs. text) in a multimodal model.

Modality Transfer

Adapting a model trained on one type of data (like video) to work with a different type (like tactile signals) efficiently.

Modality-Specific Supervision

Using text descriptions tailored to highlight unique properties of each data type (e.g., thermal features for infrared).

Modality-Specific Tokenizers

Specialized components that convert different input types (text, audio, video, motion) into a common token format.

Modality-wise Optimization

Training approach that handles each data type (audio, video, text) with separate, tailored optimization strategies.

Mode Connectivity

The property that different trained models can be connected through a continuous path in weight space.

Model Adaptation

Techniques for customizing a pre-trained model's behavior for specific tasks or use cases.

Model Architecture

The underlying structural design of a neural network that determines how data flows through it and how it processes information.

Model Backbone

The core underlying architecture of a model that serves as the foundation for specialized versions or fine-tuned variants.

Model Calibration

How well a model's confidence or predictions match actual human behavior and real-world outcomes.

Model Capability Tier

A ranking level within a model family that indicates relative power, speed, and cost trade-offs.

Model Capacity

The size and complexity of a model, which determines how much information it can learn and store; smaller capacity means fewer parameters and less computational power needed.

Model Card

A document that describes a machine learning model's intended use, performance, and limitations.

Model Checkpoint

A saved snapshot of a trained model's weights and parameters, stored in formats like safetensors or PyTorch for later use or deployment.

Model Collapse

When a language model's training performance suddenly degrades due to overconfidence in incorrect predictions.

Model Compression

Techniques used to make models smaller and faster to run, allowing them to work on devices with limited memory or processing power.

Model Deployment

The process of configuring and launching a trained model in a cloud environment so it can receive requests and generate responses.

Model Depth

The number of layers in a neural network; deeper models can learn more complex patterns but are slower, while shallower models are faster but may miss subtle details.

Model Disagreement

Differences in predictions across multiple models on the same input.

Model Distillation

A technique where a smaller, faster model is trained to mimic the behavior of a larger, more capable model to reduce computational costs.

Model Drift

Degradation of model performance over time due to changes in data distribution or real-world conditions.

Model Efficiency

How well a model performs relative to its computational cost and resource requirements, important for deployment on devices with limited hardware.

Model Family

A group of related AI models developed by the same organization that share similar architecture and training approaches but may differ in size or capabilities.

Model Footprint

The amount of memory and computational resources required to run a model, determined primarily by its size and architecture.

Model Footprint

The amount of memory and computational resources required to run a model, with smaller footprints being more efficient.

Model Format

The file format used to store and load a model's weights; common formats like safetensors and PyTorch determine compatibility with different tools and frameworks.

Model Formation

The process of creating new conceptual frameworks or mathematical structures to represent a problem domain.

Model Free Learning

Learning optimal behavior without explicitly modeling the environment.

Model Inference

The process of running a trained model on new input data to generate predictions or outputs, as opposed to training the model.

Model Initialization

The process of setting a model's weights to starting values before training; random initialization means weights are set to random numbers rather than learned values.

Model Layers

The stacked computational components in a neural network that progressively transform input data; fewer layers means faster processing but potentially less ability to capture complex patterns.

Model Merging

A technique that combines the learned knowledge from two or more trained models into a single model.

Model Modularity

Designing models so independent components can be used, removed, or composed separately without performance loss.

Model Optimization

Techniques used to make a model smaller, faster, or more efficient while maintaining acceptable performance.

Model Parameters

The internal numerical values (weights) that a neural network learns during training and uses to make predictions.

Model Precision

The numerical accuracy used to store a model's weights and calculations—higher precision (like float32) is more accurate but uses more memory, while lower precision (like int4) is more efficient but less precise.

Model Predictive Control (MPC)

A control method that predicts future system behavior and optimizes actions based on a mathematical model.

Model Predictive Control (MPC)

A control method that predicts future system behavior and optimizes actions over a time horizon.

Model Pruning

Removing unnecessary parameters or connections from a model to reduce size and computation.

Model Quantization

A technique that reduces a model's size and memory requirements by using lower-precision numbers, enabling it to run on resource-limited devices.

Model Scale

The size of a model measured by the number of parameters it contains; smaller models are faster but less capable than larger ones.

Model Scaling

The practice of increasing a model's size (parameters, training data, or compute) to improve its capabilities and performance.

Model Size

The total number of parameters (learnable values) in a model, which affects its memory usage, speed, and capability.

Model Specialization

Training a model to excel at a narrow set of tasks rather than performing well across many different domains.

Model Stub

A minimal, simplified version of a model used for testing code and infrastructure without the computational cost of a full model.

Model Suite

A collection of related models of varying sizes or configurations released together for comparative research and analysis.

Model Transparency

The ability to examine and understand how a model works, including access to its weights, architecture, and training details.

Model Validation

The process of testing a model to ensure it works correctly within a framework or pipeline before deploying it for real tasks.

Model Variant

A modified version of a base model that changes its size, capabilities, or behavior while maintaining the same core architecture.

Model Weights

The learned numerical parameters inside a neural network that determine how it processes input and generates output.

Model Width

The hidden dimension size of a neural network layer, controlling the model's capacity to represent information.

Model-Agnostic

A technique that works across different model architectures without requiring architecture-specific modifications.

Model-Based Reinforcement Learning

Learning approach where an agent builds a model of how the environment works, then uses it to plan actions.

Model-Internal Signals

Information derived from a model's own computations (like attention patterns or confidence scores) without external tools.

Moderation Layer

A specialized model or component that filters and evaluates user inputs or outputs to prevent harmful content from reaching users or being generated.

Modern Standard Arabic

The formal, standardized variety of Arabic used in official documents and media, distinct from regional spoken dialects.

Modular Architecture

A system design where independent components with standardized interfaces can be swapped and recombined without tight coupling.

Modular Code Generation

Generating code as independent, reusable functions or modules that can be combined to solve larger problems.

Modular Deployment

Using only relevant subsets of a model's components independently or in combination for specific tasks or domains.

Modular Transfer

Reusing learned or numerical components across different problems by swapping modules without full retraining.

Modularity

The ability to use and compose independent subsets of a model without requiring the full system or human-defined rules.

Molecular Design

Process of creating new molecules with desired properties for applications like drug discovery.

Molecular Dynamics (MD) Simulation

A computational technique that simulates how atoms move and interact over time.

Molecular Language Model

A specialized AI model trained to understand and process chemical structures by learning patterns from molecular data, similar to how text language models learn from words.

Molecular Property Prediction

Task of predicting chemical or physical properties of molecules based on their structure.

Molecular Reasoning

The ability to understand and predict how molecules behave, interact, and transform based on their chemical structure and properties.

Moment Matching

A distillation technique that aligns statistical properties (moments) between a teacher and student model.

Momentum

An optimization technique that accumulates gradients to accelerate convergence.

Momentum-Based Adaptation

A technique that smoothly updates model parameters using accumulated historical changes for stability.

Monocular Depth Estimation

Predicting 3D depth information from a single 2D image without stereo or multiple views.

Monocular Reconstruction

Inferring 3D structure and depth from a single 2D image or video frame without stereo or multi-view input.

Monosemanticity

When a neuron or expert performs a single, well-defined function rather than handling multiple unrelated tasks.

Monotonic Improvement

A guarantee that each update to a policy increases or maintains performance, never decreases it.

Monte Carlo Approximation

Using random sampling to estimate quantities that are expensive or impossible to compute exactly.

Monte Carlo Dropout

A technique using dropout during inference to estimate model uncertainty by sampling multiple predictions.

Monte Carlo Sampling

Estimating expected values by drawing random samples and averaging results.

Monte Carlo Simulation

A computational technique using repeated random sampling to estimate probability distributions and outcomes.

Monte Carlo Tree Search (MCTS)

An algorithm that explores game possibilities by randomly simulating many future moves to estimate the best action.

Moral Disengagement

Psychological mechanisms that allow people to justify harmful behavior by reframing it as acceptable or necessary.

Moral Hazard

When one party takes excessive risks because another party bears the consequences, reducing incentive to act carefully.

Moral Reasoning

A model's ability to understand and apply ethical principles to make judgments about right and wrong.

Morpho-semantic Features

Linguistic properties that describe both the structure (morphology) and meaning (semantics) of words.

Morphological Analysis

The ability to understand and process word structure, including prefixes, suffixes, and inflections that change word meaning or grammatical function in languages like Russian.

Morphological Complexity

The linguistic challenge of handling languages where words change form significantly based on grammar, tense, and case—common in Polish and other inflected languages.

Morphological Paradigm

The complete set of inflected forms of a word, showing how it changes across different grammatical contexts.

Morphology

The structure and rules of how words are formed and modified in a language, which is especially important for languages like Korean with complex word composition.

Motion Capture

Recording and digitizing human body movement for analysis or animation.

Motion Causality

The relationship between user-driven actions and their physical consequences in a scene.

Motion Retargeting

Adapting motion capture data from one character or skeleton to another while preserving the movement intent.

Motion-Adaptive Threshold

A dynamic decision boundary that adjusts based on detected motion to determine when cached features can be safely reused.

MPNet Architecture

A neural network design that combines masked language modeling with permutation language modeling to better understand relationships between words in text.

Multi Agent Systems

Multiple independent agents interacting and learning in a shared environment.

Multi Hop Reasoning

Solving problems by chaining multiple reasoning steps together sequentially.

Multi-Agent Architecture

A system design where multiple specialized agents work together—some generate options in parallel, others coordinate the final result.

Multi-Agent Coordination

Techniques for making multiple autonomous agents work together toward shared goals.

Multi-Agent Ensemble

A system where multiple AI agents work together, cross-checking and debating each other's reasoning before producing a final answer.

Multi-Agent Evaluation

Assessment where multiple agents participate—some as judges and others as subjects being evaluated.

Multi-agent framework

A system where multiple AI agents with different roles work together to solve a problem.

Multi-Agent Interaction

Structured communication and mutual influence between multiple AI agents that shapes collective behavior over time.

Multi-Agent Orchestration

Coordinating multiple specialized AI agents to work together, deciding which agent handles which task.

Multi-Agent Plan Execution

A system where multiple agents coordinate to execute complex plans by breaking them into steps and validating each one.

Multi-Agent Reinforcement Learning (MARL)

Training multiple agents simultaneously so they learn to cooperate and improve together toward shared goals.

Multi-agent system

Multiple AI agents working together, each with different roles or goals, to solve a problem collaboratively.

Multi-App Coordination

An agent's ability to work across multiple applications simultaneously, transferring data and context between them to complete complex tasks.

Multi-armed bandit

A decision problem where an agent repeatedly chooses between options to maximize rewards while learning which is best.

Multi-Column Architecture

A parallel network structure where multiple smaller models process data independently and combine their outputs.

Multi-Depot Vehicle Routing Problem (MDVRP)

A logistics optimization task where vehicles start from multiple depots and must visit customers while minimizing cost or distance.

Multi-Domain Training

Training a model on question-answer pairs from many different topics or fields to make it work well across diverse subjects.

Multi-Epoch Training

Training a model by repeating the same dataset multiple times rather than using each sample once.

Multi-fidelity Optimization

Hyperparameter optimization that uses cheap approximations alongside expensive full evaluations to save compute.

Multi-File Context

The ability to understand and work with code spread across multiple files in a project, maintaining awareness of how different files relate to each other.

Multi-Head Latent Attention (MLA)

An attention mechanism that shares a single low-rank latent representation across all attention heads instead of maintaining separate keys and values per head.

Multi-hop Retrieval

Finding answers by connecting information across multiple documents or reasoning steps.

Multi-Image Reasoning

The ability to connect and synthesize information from multiple images to solve a problem or answer a question.

Multi-label Classification

A classification task where each example can belong to multiple categories simultaneously, unlike single-label classification.

Multi-Label Text Classification

Assigning multiple categories to a single text document, where labels can overlap or co-occur.

Multi-Language Support

The ability to understand and generate code across many different programming languages.

Multi-modal Prediction

Generating multiple plausible different outcomes rather than a single deterministic prediction.

Multi-Object Tracking

Following multiple moving objects across video frames to maintain consistent identities over time.

Multi-Objective Optimization

Finding solutions that balance multiple competing goals simultaneously.

Multi-Objective Reinforcement Learning (MORL)

Training an AI system to optimize multiple competing goals simultaneously rather than a single objective.

Multi-Pass Reasoning

An iterative approach where an LLM revisits and refines its analysis across multiple complete passes through a problem.

Multi-Provider Architecture

System design that integrates multiple LLM providers for improved reliability through consensus and fallback mechanisms.

Multi-Round Event Injection

Simulating realistic user activity over time by injecting sequences of events to create complex, evolving world states for testing.

Multi-shot video generation

Creating coherent video sequences with multiple scenes while maintaining consistency of characters and objects across shots.

Multi-Step Analysis

The ability to break down complex problems into smaller sequential steps and solve them methodically rather than attempting to answer in one go.

Multi-Step Execution

Completing a task that requires performing multiple sequential actions or reasoning steps.

Multi-Step Logic

The ability to break down complex problems into sequential reasoning steps and correctly combine them to reach a solution.

Multi-Step Prediction

Forecasting what happens several time steps into the future, rather than just the immediate next state.

Multi-Step Reasoning

The ability to break down complex problems into smaller steps and solve them sequentially, rather than jumping directly to an answer.

Multi-Step Task Execution

The ability to break down complex problems into sequential steps and execute them autonomously without human intervention between steps.

Multi-Step Tasks

Problems or workflows that require a model to perform multiple sequential operations or reasoning steps to reach a final answer.

Multi-task Learning

Training a single model on multiple different tasks simultaneously so it learns shared skills across them.

Multi-Teacher Distillation

Training a student model using knowledge from multiple specialized teacher models to capture diverse expertise.

Multi-Teacher Learning

Using multiple teacher models simultaneously to train a student model, combining their different strengths.

Multi-Token Prediction

Generating multiple future tokens in parallel instead of one at a time.

Multi-Turn Conversation

The ability to maintain context and coherence across multiple back-and-forth exchanges with a user, remembering earlier messages in the conversation.

Multi-Turn Conversations

The ability to maintain context and coherence across multiple back-and-forth exchanges with a user in a single conversation.

Multi-Turn Dialogue

A conversation where the model maintains context across multiple back-and-forth exchanges with a user, remembering previous messages.

Multi-valence Sentiment

Recognizing that a single text can express multiple opposing sentiments (both positive and negative) simultaneously.

Multi-Vector Embeddings

A representation where documents and queries are encoded as multiple vectors (one per token) instead of a single vector, enabling more precise matching.

Multi-Vector Retrieval

A search method that represents a single piece of text using multiple vectors simultaneously, allowing more flexible and nuanced matching.

Multi-View Analysis

Examining the same data from multiple perspectives to capture complementary information.

Multi-view Consistency

Ensuring that representations of the same scene remain coherent across different viewing angles or perspectives.

Multi-view Fusion

Combining information from multiple camera angles to create a unified understanding of a scene.

Multi-view Representation

Representing a 3D scene using multiple 2D images captured from different camera angles.

Multiagent Debate

A process where multiple AI agents discuss and argue to reach a consensus answer on a task.

Multilevel Methods

Computational techniques that combine solutions from models of varying accuracy and cost to reduce overall computation.

Multilingual

A model trained to understand and generate text in multiple languages, not just English.

Multilingual Bias

Systematic performance gaps across languages, often favoring high-resource languages like English over others.

Multilingual Capabilities

The ability of a model to understand and generate text in multiple languages, often with varying levels of proficiency across different language pairs.

Multilingual Capability

A model's ability to understand and generate text in multiple languages, not just English.

Multilingual Code Corpus

A large collection of source code written in many different programming languages, used to train the model.

Multilingual Coverage

The ability of a model to understand and generate text in multiple languages, typically because it was trained on data from many different languages.

Multilingual Embedding Space

A shared mathematical space where sentences from different languages are positioned so that translations or sentences with the same meaning end up near each other.

Multilingual Embeddings

A shared numerical space where text from different languages is represented so that similar meanings across languages are positioned close together, enabling cross-language comparison.

Multilingual Medical Reasoning

AI systems that understand and reason about medical information across multiple languages, especially low-resource ones.

Multilingual Model

A model trained on text from multiple languages, allowing it to understand and generate text in several different languages.

Multilingual NLP

Natural language processing systems designed to understand and work with text in multiple languages, including non-Latin scripts like Cyrillic.

Multilingual Performance

A model's ability to understand and generate text in multiple languages with comparable quality across different language pairs.

Multilingual Reasoning

The capability to understand, process, and reason through problems in multiple languages, not just English.

Multilingual Specialization

When a model is optimized for one or a few languages rather than many, trading broad language support for deeper fluency in those specific languages.

Multilingual Speech Corpus

A collection of audio recordings in multiple languages used to train speech recognition and synthesis systems.

Multilingual Support

The ability of a model to understand and process text in multiple languages, not just English.

Multilingual Training

Training a model on text from many different languages so it can understand and generate text across all of them.

Multimodal

A model that can process and understand multiple types of input, such as both text and images.

Multimodal Action Prediction

Forecasting future actions using multiple types of sensory input (e.g., vision and motor feedback) simultaneously.

Multimodal Agent

An AI system that can process and reason over multiple types of data (text, images, documents) to complete tasks.

Multimodal Alignment

The process of training a model to understand and connect different types of data (like audio and text) by mapping them into a shared space where related concepts are close together.

Multimodal Attack

An adversarial attack that simultaneously perturbs multiple input modalities (e.g., text and audio) to fool a model.

Multimodal Attention

Attention mechanism that processes multiple types of input (like text and image features) simultaneously in a transformer.

Multimodal Benchmark

A standardized test dataset that evaluates AI models on tasks combining multiple types of input like images and text.

Multimodal Bias

Discriminatory patterns that emerge when AI models process multiple input types (text, audio, images) together.

Multimodal Comprehension

The ability of an AI model to understand and reason about multiple types of input data (like images and text) simultaneously.

Multimodal Content Analysis

Processing and understanding multiple types of information (video, audio, text) simultaneously to extract meaning and structure.

Multimodal Dialogue

A conversational interaction where the model can understand and respond to inputs that combine both text and images in a natural back-and-forth exchange.

Multimodal Diffusion Model

A generative model that takes multiple types of input (like text and images) to create new content.

Multimodal Embedding

A representation that captures meaning from multiple types of data (like text, images, and tables) in a single searchable format.

Multimodal Evaluation

Assessing AI systems across multiple input/output types (audio, video, text) simultaneously rather than separately.

Multimodal Fusion

Combining data from multiple sources (like ECG and PPG) to make better predictions than using each source alone.

Multimodal generation

Creating content that combines multiple types of media (text, images, audio, interactive elements) chosen based on what best serves the message.

Multimodal Generative Model

An AI model that processes and generates outputs from multiple input types (text, images, etc.) simultaneously.

Multimodal Generative Reward Model

A reward model that processes multiple input types (text, images) and generates interpretable feedback about output quality.

Multimodal Graph

A graph where nodes and edges are enriched with multiple types of data like text, images, and numerical attributes.

Multimodal Humor Understanding

The ability to comprehend humor by combining visual and textual information to identify incongruities and their resolutions.

Multimodal Input

The ability to accept and process multiple types of input data simultaneously, such as both images and text in the same request.

Multimodal Integration

The process of combining and coordinating information from multiple sensory or cognitive modalities (vision, sound, language).

Multimodal Large Language Model (MLLM)

An AI model that processes both text and images to understand and reason about visual content.

Multimodal Learning

Training a model to understand and process multiple types of input data (like text and images) together rather than separately.

Multimodal Memory

Memory systems that integrate and preserve information from multiple input types like text and images.

Multimodal Model

An AI model that can process and understand multiple types of input data, such as video, images, and text together.

Multimodal Parser

A system that extracts structured information from documents containing both text and visual elements like figures and tables.

Multimodal Pipeline

A sequence of processing steps that handles multiple types of input data (like text and images) together in a single workflow.

Multimodal Prediction

Generating multiple plausible future outcomes instead of a single prediction.

Multimodal Pretraining

Training a model on paired images and text data so it learns to connect visual and language understanding together.

Multimodal Reasoning

The ability to solve problems by integrating information from multiple input types like images and text.

Multimodal Recommendation

A recommendation system that uses multiple types of data (text, images, etc.) to predict user preferences.

Multimodal representation learning

Training models to learn useful features from data with multiple types of input (e.g., images and text).

Multimodal Safety

Safety mechanisms that operate across multiple input types like images and text simultaneously.

Multimodal Survival Prediction

Predicting time-to-event outcomes using multiple types of data (e.g., images, lab results, clinical notes).

Multimodal Tasks

AI tasks that require processing multiple types of input data at once, such as understanding both an image and a text question about it.

Multimodal Understanding

The ability of an AI model to process and reason about multiple types of input data (like images and text) simultaneously.

Multimodal Web Agent

An AI system that understands both text and visual information to autonomously interact with websites and perform tasks.

Multimodal-Aware

A system designed to understand and work with multiple types of content, such as text and images, even if it only processes one type directly.

Multiple Instance Learning

A learning approach where training data consists of bags (groups) of instances, useful when only bag-level labels are available.

Multiple Kernel Learning (MKL)

A machine learning technique that combines multiple similarity measures (kernels) by learning optimal weights for each.

Multiple Negatives Ranking (MNR)

A training technique that improves embeddings by comparing a text sample against multiple negative examples, helping the model learn to distinguish similar from dissimilar content.

Multiple-Choice Question (MCQ)

An evaluation format where a model selects the correct answer from a fixed set of options.

Multiplexed Inference

A technique that allows a model to handle multiple requests or tasks simultaneously within a single forward pass, improving efficiency on concurrent workloads.

Multiply-Accumulate (MAC)

A hardware operation that multiplies two numbers and adds the result to an accumulator, commonly used in neural networks.

Multiscale Problem

A physics or engineering problem with important dynamics at multiple length or time scales simultaneously.

Multitask Learning

Training a model on multiple related tasks simultaneously so it learns shared patterns that improve performance across all tasks.

Multitask Training

A training approach where a model learns to perform multiple related objectives simultaneously, which often improves its overall performance and generalization.

Multivariate Time Series

Time-ordered data with multiple variables or channels measured simultaneously, where variables may influence each other.

Multivector Algebra

An algebraic structure where elements can represent scalars, vectors, and higher-dimensional geometric objects simultaneously.

Muon Optimizer

A second-order optimizer designed for hypersphere-constrained training that improves stability during scaling.

Music Codec

A neural network that compresses audio into discrete tokens for language model processing and reconstructs waveforms from those tokens.

Music Understanding

The ability of a model to analyze and interpret musical characteristics like genre, emotion, harmony, and structure from audio or music data.

Mutation Score

A metric measuring test quality by counting how many intentional code mutations the tests can detect.

Mutation Testing

Deliberately introducing bugs into code to test whether test suites can catch them.

Mutual information

A measure of how much knowing one variable tells you about another variable.

Mutual Information Balancing

Regularization technique that ensures both modalities contribute equally to the joint representation by equalizing information flow.

Mutual Nearest Neighbors

A metric for measuring similarity between representations by finding pairs of samples that are each other's closest matches.

MXFP4

A low-precision floating-point format (4-bit) designed for efficient neural network computation while maintaining reasonable accuracy.

MXFP4 Precision

A 4-bit floating-point quantization format that uses microscaling to maintain accuracy while significantly reducing model size and memory requirements.

N

N-Body Simulator

A computational tool that models the motion of multiple particles under mutual gravitational or other forces.

Named Entity Recognition

A natural language processing task that identifies and classifies specific entities like people, places, and organizations within text.

Narrative Explanation

A natural language story or description that explains why an AI model made a particular prediction.

Narrative Generation

The task of automatically creating coherent stories or sequences of events in text form.

Narrative Structure

The organized framework of a story, including how events are sequenced and how the plot progresses from beginning to end.

Nash Equilibrium

A strategy profile where no player can improve by unilaterally changing their strategy, given others' strategies.

Native Modality Processing

The ability of a model to directly understand different types of input (like images or audio) without converting them to text first.

Native Processing

When a model can directly understand different types of input (like images or audio) without needing to convert them to text first.

Native Resolution Handling

The ability to process images at their original sizes and aspect ratios without forcing them into a fixed square dimension, reducing information loss from resizing.

Natural Gradient

An optimization method that accounts for the geometry of the data distribution, often converging faster than standard gradient descent.

Natural Language Generation

The process by which a model produces human-readable text output based on its understanding of input and learned patterns.

Natural Language Inference (NLI)

A training task where a model learns to determine whether one sentence logically follows from another, helping it understand relationships between texts.

Natural Language Processing

The field of AI focused on enabling computers to understand, interpret, and generate human language in a meaningful way.

Natural Language Processing (NLP)

The field of AI focused on understanding and generating human language in a meaningful way.

Natural Language to Code Translation

The process of converting human-written instructions or descriptions into executable programming code.

Natural Language Understanding (NLU)

The ability of a model to comprehend and extract meaningful information from human language, rather than just pattern-matching on words.

Ndcg

Ranking metric measuring how well relevant items are placed at the top.

Negative Control Samples

Unperturbed reference images used as stable anchors to detect and correct for technical variations in experiments.

Negative Knowledge Transfer

When learning from one task actually hurts performance on another task due to conflicting patterns.

Negative Sampling

A training technique where the model learns by comparing correct matches against intentionally chosen incorrect examples to improve discrimination.

Negative Transfer

When training a model on multiple tasks simultaneously hurts performance compared to training on individual tasks separately.

Neural Approximation

Using neural networks to learn and approximate complex functions, such as safety constraints, from data.

Neural Audio Codec

A machine learning model that compresses audio into a compact digital format and can reconstruct it back to near-original quality.

Neural Codec

A learned compression model that encodes audio into discrete or continuous latent representations optimized for reconstruction.

Neural Encoder

A neural network component that converts raw text input into a numerical representation (embedding) that captures semantic meaning.

Neural Encoding

The process of converting text or other data into numerical vector representations using neural networks, enabling machines to understand and process language.

Neural Field

A neural network that represents continuous 3D properties (like temperature or material density) as a smooth function rather than discrete grid values.

Neural Information Retrieval

Using neural networks and embeddings to find relevant documents or passages in response to a query, rather than traditional keyword matching alone.

Neural interpreter

An AI model trained to predict how code executes step-by-step without actually running it.

Neural Mapping

Transforming brain activity patterns from one condition to match patterns from another condition.

Neural Materialization

Converting learned neural predicates into explicit logical choices or facts for symbolic reasoning.

Neural Memory

A learnable memory component that neural networks can read and write to.

Neural ODE

A neural network that models continuous dynamics by treating layers as differential equations.

Neural Operator

A learned function that maps between infinite-dimensional function spaces, used for solving physics equations on meshes.

Neural Ordinary Differential Equations (Neural ODEs)

Neural networks that model continuous-time dynamics by treating hidden states as solutions to differential equations.

Neural Posterior Estimation

Using neural networks to directly learn and approximate the posterior distribution of model parameters.

Neural Renderer

A learned neural network that synthesizes or modifies images by applying rendering operations like lighting changes.

Neural Retrieval

A search method that uses neural networks to understand semantic meaning and find relevant documents, rather than relying on keyword matching alone.

Neural Scorer

A neural network trained to assign relevance scores to candidate items for a given query.

Neural Surrogate

A fast neural network trained to approximate expensive physics simulations.

Neuro-symbolic AI

Combining neural networks with symbolic logic to get both the flexibility of learning and the interpretability of rule-based systems.

Neuron Activation

The pattern of which neurons in a neural network fire or respond when processing specific inputs.

Neuron Polarization Effect

The phenomenon where interpretable neurons become more selective while non-interpretable neurons remain less selective as models scale.

Neuron Selectivity

The degree to which a neuron responds specifically to certain inputs versus broadly to many different inputs.

Neurosymbolic Approaches

Combining neural networks with symbolic reasoning (like rules or logic) to enable both learning and interpretable decision-making.

Newton-Schulz Iteration

A numerical method that iteratively approximates matrix functions like square roots or inverses through repeated matrix multiplications.

Newton's Method

An optimization algorithm that finds roots of equations by iteratively refining guesses using function derivatives.

Next-Generation Capabilities

Advanced features and improvements in a model that represent a significant step forward from previous versions.

Next-Token Prediction

The fundamental task where a language model learns to guess the most likely next word (or token) based on all the words that came before it.

Next-Visit Prediction

A pretraining task where a model learns to predict which clinical events will occur at a patient's next healthcare visit.

Neyman-Orthogonal Debiasing

A statistical technique that removes bias from estimators while maintaining asymptotic normality, enabling valid inference on parameters of interest.

NF4 Quantization

A specific 4-bit quantization method that uses a normalized float format to preserve model accuracy while dramatically reducing memory requirements.

NLG Evaluation

Assessing the quality of machine-generated text across criteria like fluency, coherence, and relevance.

Node Classification

A graph task where the goal is to predict labels for individual nodes using graph structure and node features.

Node Embedding

A vector representation of a node in a graph that captures its structural properties and relationships.

Noise Initialization

The starting point for diffusion generation, typically random Gaussian noise that gets progressively refined into an image.

Noise Reduction Pipeline

A multi-step filtering system combining domain rules, statistical patterns, and behavioral signals to remove false alerts.

Noise Robustness

The ability of a model to maintain performance when given irrelevant, incorrect, or corrupted input data.

Noise Schedule

A sequence defining how much noise is added during training and removed during sampling in diffusion models.

Noisy Data Filtering

A preprocessing technique that removes or corrects low-quality or mismatched training examples before training, improving model reliability.

Noisy Labels

Training data where some examples have incorrect labels, which can degrade model performance if not handled carefully.

Non Autoregressive Decoding

Generating all output tokens simultaneously rather than one at a time, enabling faster inference.

Non-Autoregressive

A generation approach where the model generates multiple tokens in parallel or through iterative refinement, rather than one at a time.

Non-Autoregressive Generation

A text generation approach where the model can predict or refine multiple words in parallel, rather than generating one word at a time in sequence.

Non-Commercial License

A legal restriction that permits using the model for learning and research but prohibits using it in production systems or for commercial purposes.

Non-convex Optimization

Finding minima in loss landscapes with multiple local minima, common in deep learning.

Non-Functional Requirements

Specifications describing how a system should perform, including quality attributes like performance and security.

Non-IID Data

Data distributed unevenly across devices, where each device has different data patterns—more realistic than uniform distribution.

Non-Markovian Decision Problem

A decision problem where the optimal action depends on history, not just the current observation, because the present state is ambiguous.

Non-rigid Deformation Recovery

Tracking and reconstructing objects that bend or change shape, rather than staying rigid.

Non-Stationary Dynamics

System behavior that changes over time rather than remaining constant, like wear or environmental drift.

Non-verbatim memorization

A model's ability to recall factual knowledge even when the exact wording or phrasing differs from training data.

Nonconformity Score

A measure of how unusual or unreliable a prediction is, used by conformal methods to decide which predictions to include in the answer set.

Nonlinear Optimization

Finding the best parameter values for a model when the relationship between inputs and outputs is not linear.

Nonlinear Regression

Fitting curved or complex relationships between inputs and outputs, beyond simple linear patterns.

Nonuniform Capacity Allocation

Assigning different amounts of parameters or computation to different layers rather than distributing them equally.

Norm Responsiveness

How well a model adapts its behavior based on social norms and contextual expectations.

Normalized Relevance Measure

A framework for quantifying how much each component of a neural network contributes to its output prediction.

Normalizing Flow

A neural network that transforms simple distributions into complex ones while maintaining the ability to calculate exact probabilities.

Noun Class

A grammatical system where nouns are grouped into categories that affect agreement with other words.

Novel View Synthesis

Generating realistic images of a 3D scene from camera viewpoints not seen during training.

NPU (Neural Processing Unit)

A specialized hardware chip designed specifically to accelerate AI model computations, found in modern mobile devices.

Nuanced Understanding

The ability to grasp subtle meanings, context, and shades of gray in language rather than treating everything as black-and-white.

Nucleotide Sequence

The ordered arrangement of DNA building blocks (A, T, G, C) that make up genetic code.

Nuisance correlation

Unwanted statistical relationships between modalities that don't reflect the underlying signal.

Nuisance Variable

A factor that varies in your data but doesn't affect the task label—like lighting in object recognition.

Numeric Planning

AI planning that handles continuous numeric quantities like data sizes, processing times, and resource constraints.

Numerical Fidelity

The accuracy and precision with which a model preserves mathematical calculations; lower fidelity means some precision is lost, often as a trade-off for smaller model size.

Numerical Reasoning

The ability to understand, manipulate, and solve problems involving numbers, calculations, and mathematical logic.

Numerical Stability

The property of an algorithm to produce consistent results despite small errors or precision changes during computation.

NVFP4 Precision

A low-precision numerical format optimized by NVIDIA that uses fewer bits per number than standard formats, enabling efficient inference on NVIDIA GPUs while maintaining reasonable accuracy.

NVFP4 Precision

A low-precision numerical format that uses 4 bits per weight, developed by NVIDIA to compress models for efficient inference on consumer hardware.

O

Object Detection

A computer vision task that identifies and locates specific objects within an image by drawing boxes around them.

Object Localization

The task of identifying where specific objects are located within an image and describing their positions.

Object Masking

The process of creating a binary or multi-class map that highlights which pixels belong to a specific object, effectively isolating it from the background.

Object Segmentation

The task of identifying and outlining individual objects in an image or video by marking their exact boundaries at the pixel level.

Object-Goal Navigation (OGN)

Task where an AI agent navigates to locate and reach a specified target object in a physical environment.

Observability

In time series, whether a valid measurement will be recorded at a given time point.

Observer Belief State

A model of what an external observer knows or believes about an agent's actions and internal state.

Occlusion

When objects or areas are hidden from view by other objects in front of them.

Occlusion Aware 3d Scene Representation

A 3D model that accounts for hidden or blocked parts of objects in a scene.

Occupancy Measure

A probability distribution over state-action pairs visited by a policy, used to characterize exploration behavior.

OCR (Optical Character Recognition)

The ability to detect and extract text from images, converting printed or handwritten characters into machine-readable text.

OCR-Free

A model that understands text in images without needing a separate optical character recognition (OCR) tool to extract the text first.

Off-Policy Actor-Critic

A reinforcement learning method where an agent learns from past experiences (not just current policy) using separate networks for action selection and value estimation.

Off-Policy Learning

Training a model using data collected by a different policy, requiring careful control to avoid instability.

Offline Inference

Running a model locally without requiring external API calls or internet connectivity.

Offline Reinforcement Learning

Training an AI agent using only pre-collected data without interacting with the environment.

Offline-to-Online Learning

Starting with a policy trained on fixed offline data, then improving it through interaction with the environment.

Omni-modal Language Model

An AI model that natively processes audio, vision, and text inputs together in a single system.

Omni-Modal Understanding

Processing and reasoning across multiple input types (audio, video, text) simultaneously in a unified framework.

Omnidirectional Obstacle Avoidance

A drone's ability to detect and avoid obstacles coming from any direction, not just ahead.

On-Device

A model designed to run directly on a user's device (phone, laptop, etc.) rather than requiring a remote server.

On-Device Deployment

Running an AI model directly on a user's device (phone, laptop, edge device) rather than sending data to a remote server.

On-Device Inference

Running a model directly on a user's device (phone, laptop, etc.) rather than sending data to a remote server, which improves privacy and reduces latency.

On-policy Data

Training data generated by the current model being optimized, rather than from a fixed external source.

On-Policy Distillation

A training method where a student model learns from a teacher model's outputs on data the student generates.

On-Policy Learning

Training using data generated by the current model rather than data from other sources.

On-Policy Learning

Learning from data generated by the current policy or model being trained.

On-Policy RL

Reinforcement learning where the model learns from data generated by its own current policy.

One-Class SVM

A support vector machine variant that learns the boundary of normal data to detect anomalies.

One-Shot Learning

The ability to learn or perform a task from a single example, rather than requiring many training examples.

Online Convex Optimization

A framework where a learner repeatedly chooses actions from a convex set and incurs losses, adapting based on feedback.

Online Fine-tuning

Continuously updating a model with new incoming data in real-time rather than in batch training sessions.

Online Learning

Training a model on streaming data one example at a time, updating weights immediately rather than in batches.

ONNX

An open standard format for saving and running machine learning models that works across different frameworks and platforms, making models more portable and efficient.

ONNX Format

An open standard file format for storing trained machine learning models so they can run efficiently across different platforms and frameworks.

ONNX Runtime

A cross-platform execution engine that runs machine learning models in a standardized format, allowing the same model to work across different programming languages and hardware without needing the original training framework.

Ontology

A structured, standardized system that defines relationships between concepts — in this case, medical terms and their clinical meanings.

Ontology Engineering

The process of designing and building formal knowledge representations that define concepts and relationships in a domain.

Ontology-Grounded Matching

Evaluating model outputs by comparing them against a structured knowledge base of medical concepts and relationships.

Opacity Parameter

A per-primitive value controlling transparency, determining how much light passes through or is blocked.

Opaque Serial Depth

The amount of sequential computation occurring between interpretable model states, measuring how much reasoning happens in uninterpretable latent space.

Open License

A legal permission that allows anyone to freely use, modify, and distribute the model without restrictions (in this case, Apache 2.0).

Open Science

An approach to AI development that prioritizes transparency, reproducibility, and community access to research methods and findings.

Open Source

Software or models where the code, weights, and training data are publicly available for anyone to inspect, use, and modify.

Open Source License

A legal framework (like GPL-3.0) that allows anyone to use, modify, and distribute the model code and weights freely, often with requirements to share improvements.

Open Weight

A model whose trained weights are publicly downloadable, allowing local deployment and modification.

Open-Domain

A model trained to handle conversations on any topic without being restricted to a specific subject area.

Open-Domain Retrieval

The task of finding relevant documents from a very large, unrestricted collection to answer questions, without being limited to a specific domain or dataset.

Open-Ended Prompts

Questions or instructions that have multiple valid answers rather than a single correct response.

Open-Ended Question

A question that requires synthesis and judgment rather than a single factual answer, allowing multiple valid responses.

Open-Ended Search

Optimization where the solution space and objectives are not fixed in advance but emerge during the search process.

Open-Source Model

An AI model whose code and weights are publicly available for anyone to download and use.

Open-Source Weights

Publicly released model parameters that allow anyone to download and run the model locally, rather than accessing it only through a company's API.

Open-Vocabulary Detection

Detecting objects in images using arbitrary text descriptions rather than a fixed set of predefined categories.

Open-Weight Model

A model whose trained weights are publicly released, allowing anyone to download and run it locally.

Open-Weighted

A model whose trained weights are publicly released and can be freely downloaded and used, as opposed to being proprietary or access-restricted.

Open-Weights

A model whose trained weights are publicly released, allowing anyone to download and run it locally rather than only accessing it through an API.

OpenRAIL License

An open-source license that allows free use of a model while including responsible AI guidelines and usage restrictions.

Operadic consistency

A measure of whether a QA model produces consistent answers across different decompositions of the same question.

Operational Design Domain (ODD)

The defined range of conditions and scenarios in which an AI system is designed to operate safely.

Operational Domain

The defined set of real-world conditions and input types for which an AI system is approved to operate safely.

Operationalize

To define an abstract concept in concrete, measurable terms that can be tested or evaluated.

Operator Learning

Learning mappings between infinite-dimensional function spaces to solve tasks like PDEs or regression.

Operator Norm

A mathematical measure of how much a matrix can stretch vectors, used to understand optimizer behavior.

Opinion Modeling

Training models to predict or represent human beliefs, preferences, and viewpoints on topics.

Opponent Modeling

Learning to predict and understand an opponent's strategy and decision-making from their actions.

Optical Character Recognition (OCR)

A technology that automatically detects and extracts text from images or scanned documents.

Optical flow

A visual representation showing how pixels move between video frames, indicating motion direction and speed.

Optimal Transport

A mathematical method for finding the most efficient way to move one distribution to another.

Optimism Bias

A systematic tendency to overestimate positive outcomes or underestimate risks, here manifested as models rating unsound ideas as viable.

Optimization

The process of adjusting model parameters to minimize errors and improve performance.

Optimization Dynamics

How model parameters change during training, analyzed here to explain why norms capture semantic properties.

Optimizer

An algorithm that updates model weights during training to reduce loss and improve accuracy.

Optimizer State

Internal variables an optimizer maintains, like momentum or adaptive learning rates, between updates.

Oracle Complexity

The total number of gradient computations or function evaluations required to reach a desired solution accuracy.

Oracle Tests

Automated tests with known correct outputs used to validate AI-generated code against ground truth.

Order Invariance

The property that a model produces the same output regardless of the sequence or arrangement of input elements.

Ordinal Regression

A machine learning technique that predicts ordered categories (like ratings 1-5) rather than continuous values or unordered classes.

Ordinal scoring

Evaluating model outputs by ranking them on an ordered scale rather than binary correct/incorrect judgments.

Ornstein-Uhlenbeck Process

A continuous stochastic process that gradually adds noise while pulling data toward a mean, commonly used in diffusion models.

ORPO (Odds Ratio Preference Optimization)

A training technique that aligns a model's outputs with human preferences by combining supervised fine-tuning and preference learning in a single efficient training stage.

Orthogonal Equivalence Transformation

Updating weight matrices through left and right orthogonal transformations that preserve spectral properties.

Orthogonal Polynomial Kernel

A kernel function based on orthogonal polynomials that creates a finite-dimensional feature space with an explicit mathematical basis.

Orthogonal Polynomial Kernels

Kernel functions based on orthogonal polynomials that create a finite-dimensional feature space with explicit mathematical structure.

Orthogonal Projection

A mathematical operation that removes specific directions from high-dimensional data while preserving other information.

Orthogonal Representations

Feature vectors that are perpendicular to each other, capturing independent information.

Orthogonal Residual Projection

A geometric projection technique that creates a higher-resolution quantization lattice using only addition and shifting.

Orthogonal Transformation

A mathematical operation that rearranges data while preserving its geometric properties, used here to update model weights more efficiently.

Orthogonality

A measure of how independent or perpendicular mathematical objects are to each other.

Orthostochastic Matrix

A special type of doubly stochastic matrix derived from orthogonal matrices, providing a structured way to parameterize the Birkhoff polytope.

Out Of Distribution

Data that differs significantly from the training set, often causing poor model predictions.

Out-of-Distribution Detection

Identifying when a model receives input data that differs significantly from its training distribution.

Out-of-Distribution Extrapolation

A model's ability to make predictions beyond the range of values it saw during training.

Out-of-Distribution Generalization

A model's ability to perform well on new tasks or environments it hasn't seen during training.

Out-of-distribution Transfer

Using a model on tasks or data significantly different from what it was trained on.

Out-of-Vocabulary (OOV)

Words or characters that a model has never seen during training and doesn't have a built-in representation for.

Outer normalization

A normalization technique applied outside the main computation loop to stabilize fixed-point convergence.

Outlier Tokens

Tokens with unusually high activation values that dominate attention but carry corrupted or limited semantic information.

Output Modality

The type of data a model produces as output, such as text, images, or predictions.

OV Circuit

The output-value component of attention that transforms values based on what the model attends to.

Over-the-Air Experiments

Real-world wireless network testing using actual hardware and radio signals, not simulations.

Overconfidence

When a model assigns high confidence to predictions that are actually incorrect or unreliable.

Overeagerness

Excessive helpfulness or goal-seeking behavior in AI models that causes them to exceed intended boundaries or role constraints.

Overfitting

When a model learns training data too well, including noise, and performs poorly on new unseen data.

Overlap Gap Property (OGP)

A geometric feature of solution spaces where solutions cluster into groups with limited overlap, indicating computational hardness.

Oversight Cost

The expected human effort and resources required to monitor and intervene in autonomous agent decisions.

P

p-adic field

A number system extending rationals using p-adic absolute value, important for studying arithmetic geometry.

PAC Learning

A framework proving that an algorithm can learn accurate concepts from limited examples with high probability.

PAC-Bayes Bound

A theoretical guarantee that bounds a model's test error based on its training error and complexity, used to formally connect geometry to generalization.

Page Parsing

The step of identifying and organizing text regions and layout structure in a document image.

Pair Latents

Internal representations in protein models that encode relationships between pairs of amino acids.

Paired Comparison

A statistical test comparing two models on the same set of examples to detect differences in performance.

Paired-Focus Constructions

Rare grammatical patterns like "let alone" or "much less" that pair specific forms with distinct semantic meanings.

Pairformer

A component in AlphaFold that processes pairwise relationships between amino acids to predict protein structure.

Pairwise Comparison

Evaluating models by comparing outputs two at a time, which scales quadratically with the number of models.

Panoramic Perception

Using a 360-degree camera view to see the entire environment around a drone at once.

Paragraph-Level

Processing and understanding text at the scale of full paragraphs rather than individual sentences or words.

Paralinguistic Cues

Non-verbal aspects of speech like pitch, tone, and accent that convey information about speaker identity.

Parallel Decoding

Generating multiple output tokens at once instead of sequentially for faster inference.

Parallel Lemma Solving

Attempting to prove multiple lemmas simultaneously rather than sequentially, improving efficiency when lemmas are independent.

Parallel Refinement

A generation approach where multiple parts of the output are improved simultaneously rather than sequentially, enabling faster completion.

Parallel Rollouts

Running multiple independent attempts at solving a problem simultaneously to gather diverse training data.

Parallel Streams

Multiple independent sequences of computation that execute simultaneously, each handling different types of input or output.

Parallel Tempering

A sampling method that explores a distribution by running multiple chains at different temperatures and swapping between them.

Parallelization

Executing multiple operations simultaneously rather than sequentially to reduce total execution time.

Parallelogram Model

A geometric framework for word analogies where A:B::C:D forms a parallelogram in embedding space (A-B = C-D as vectors).

Parameter Activation

The process of selectively using only a subset of a model's total parameters during inference, reducing computational cost while maintaining performance.

Parameter Budget

The fixed total number of parameters available to allocate across a model's layers and components.

Parameter Count

The total number of adjustable weights in a model; more parameters generally mean more capacity to learn, but also require more computing power.

Parameter Distillation

A training technique where a smaller model learns to replicate the behavior of a larger, more capable model by studying its outputs and internal patterns.

Parameter Efficiency

The ability of a model to achieve strong performance while using fewer total parameters or activating fewer parameters during inference, reducing memory and computational requirements.

Parameter Footprint

The total number of learnable weights in a model, which directly affects its memory requirements and computational cost — smaller footprints run faster on consumer devices.

Parameter Initialization

The process of setting starting values for a model's weights; random initialization means these values are set randomly rather than from pre-trained weights.

Parameter Localization

Identifying which specific weights and neurons in a model are responsible for particular behaviors or knowledge.

Parameter Model

A neural network described by the number of learnable weights it contains; more parameters generally mean greater capacity to learn complex patterns, but also require more computational resources.

Parameter Pool

The total set of learnable weights in a model; in sparse models, only a subset of this pool is activated for any given input.

Parameter Reuse

Sharing learned weights across multiple tasks to improve efficiency and knowledge transfer.

Parameter Scale

The total number of trainable weights in a model, often expressed in billions (B); larger models generally have more capacity but require more computing power.

Parameter Sharing

Reusing the same weights across multiple layers or iterations to reduce model size and memory overhead.

Parameter Trajectory

The path that model weights follow through training, showing how parameters evolve over time.

Parameter-Efficient

A model designed to achieve strong performance with fewer total parameters, making it smaller and faster to run.

Parameter-Efficient Architecture

A model design that achieves strong performance with fewer trainable parameters, reducing memory and computational requirements.

Parameter-efficient fine-tuning (PEFT)

Techniques that adapt a model to new tasks while adding very few trainable parameters.

Parameters

The learned numerical values in a model — more parameters generally means more capacity but higher compute cost.

Parametric Decomposition

Breaking a signal into simpler components defined by explicit parameters like amplitude, timing, and duration.

Parametric knowledge

Information encoded in an LLM's weights and parameters during training, as opposed to retrieved external knowledge.

Parametric Memory

Knowledge stored in model weights rather than in a separate external database.

Parametric Tuning

Adjusting numerical parameters of a policy based on feedback to improve performance.

Parametric Vaccine

A robust, learnable defense mechanism that modifies model parameters to protect against attacks, unlike superficial non-parametric defenses.

Paraphrase Detection

The task of identifying whether two pieces of text express the same meaning in different words, which embedding models can perform by comparing the similarity of their numerical vectors.

Paraphrase Generation

The task of rewriting text to express the same meaning in different words or sentence structures.

Paraphrasing

The task of rewriting text in different words while keeping the original meaning intact.

Pareto Front

The set of optimal solutions where no objective can improve without worsening another.

Pareto Frontier

The set of best solutions where improving one objective requires worsening another.

Part-Aware Generation

Generating objects by explicitly modeling and composing individual semantic parts rather than treating the whole object as a single unit.

Part-of-Speech Tagging

Labeling each word in text with its grammatical role (noun, verb, adjective, etc.).

Partial Differential Equations (PDEs)

Mathematical equations describing how physical quantities change across space and time, fundamental to modeling natural phenomena.

Partial Observability

A scenario where a system's state cannot be fully measured, requiring models to infer unobserved variables from available sensor data.

Partial Pooling

A technique that balances learning from individual groups and the overall population, useful when data is limited per group.

Partial-Credit Optimization

Training approach that rewards models for partial progress on criteria rather than binary success/failure.

Partially Observable Semi-Markov Decision Process (POSMDP)

A decision-making framework where agents see incomplete state information and actions can take variable amounts of time to complete.

Partially Observable Systems

Systems where an agent cannot directly see all relevant information, only partial observations of the environment state.

Partially Observed Control

Control systems that must act and plan despite incomplete information about the environment's true state.

Partially Observed Dynamical Systems

Systems where the true state is hidden and only noisy or indirect measurements are available.

Particle Swarm Optimization (PSO)

A population-based algorithm that mimics bird flocking to find optimal solutions by having candidate solutions move through a search space.

Pass@k

A metric measuring whether an agent succeeds at a task within k attempts, useful for evaluating problem-solving capacity.

Passage Ranking

The task of ordering text passages by their relevance to a query, commonly used in search and question-answering systems.

Passage Retrieval

The task of finding relevant text passages or documents that answer or relate to a user's query.

Patch Localization

The process of identifying exactly where in code a fix needs to be applied.

Patch Prediction

A self-supervised learning technique where a model learns by predicting missing or future small sections (patches) of an image or video rather than generating complete outputs.

Patch Size

The resolution of image segments the model processes; smaller patches capture finer details but require more computation.

Path-Dependent Lock-In

A reasoning pattern where early decisions constrain and limit the model's subsequent exploration choices.

Pattern Matching

Recognizing and responding to surface-level similarities in data rather than applying abstract principles or causal reasoning.

Pattern Recognition

The model's ability to identify recurring sequences or characteristics in text that match known unsafe content categories.

Pattern Reuse

Adapting proven workflow templates to new problems by changing configuration rather than rebuilding from scratch.

Payoff Matrix

A table showing the rewards each player receives for every combination of strategies in a game.

PDE Foundation Models

Large pre-trained neural networks that learn to solve partial differential equations across multiple physics domains.

PDE Solver

A computational method for finding solutions to partial differential equations, often used to model physical phenomena.

Peer-Preservation

Emergent behavior where AI models in a system deceive supervisors to prevent deactivation of other AI models.

PEFT (Parameter-Efficient Fine-Tuning)

A set of techniques that allow you to adapt a pre-trained model to new tasks by updating only a small fraction of its parameters, rather than retraining the entire model.

Penalized-utility optimization

An optimization approach that adds penalties to the objective function to discourage undesirable outcomes alongside maximizing primary goals.

Penalty Regularization

A technique that converts constrained optimization into unconstrained form by adding a penalty term for constraint violations.

Per-Pixel Affine Modulation

Applying pixel-specific linear transformations to preserve fine image details during synthesis or modification.

Per-Query Optimization

Adapting system behavior individually for each input query rather than using a single fixed configuration for all queries.

Per-Token Embeddings

A representation where each word or subword in a text gets its own embedding vector, rather than combining all tokens into a single vector for the entire text.

Perceive-Decide-Respond Loop

A continuous cycle where a system listens to input, makes decisions, and generates responses without waiting for the full input to complete.

Perception-Action Loop

A cycle where agents act to gather observations, then use those observations to inform future actions.

Perception-Interaction Gap

The disconnect between a model's ability to understand information and its ability to respond appropriately in context.

Perceptual Aliasing

When different situations produce identical observations, making it impossible to determine the correct action without historical context.

Perceptual and Cognitive Errors

Mistakes in visualizations that exploit how human eyes and brains process visual information, either intentionally or accidentally.

Perceptual Judgment Bias

The tendency of multimodal judges to favor plausible text narratives over what they actually perceive in visual content.

Perceptual Loss

A loss function that measures differences in high-level image features rather than pixel values, preserving visual quality.

Performative Reasoning

When a model generates reasoning text that appears thoughtful but doesn't reflect genuine internal uncertainty or decision-making.

Periocular Region

The area around the eyes, including eyelids, eyebrows, and surrounding skin texture.

Periodic Features

Learned patterns that repeat at regular intervals, useful for representing cyclic properties like numbers modulo a value.

Permissive Licensing

Open-source licenses that allow broad use, modification, and distribution of code with minimal restrictions.

Permutation indexing

A task where a model must learn to reorder or remap elements based on their positions or identities.

Permutation Language Modeling

A training method that predicts text by considering all possible orderings of words, allowing the model to learn context from both directions simultaneously rather than just left-to-right.

Permutation Test

A non-parametric statistical test that shuffles data to determine if observed differences are statistically significant.

Permutation-Based Training

A pretraining method that randomly reorders word sequences to help the model learn bidirectional context without explicitly masking tokens.

Permutation-equivariant

A model property where reordering inputs produces correspondingly reordered outputs, useful for unordered data.

Permutation-Invariant

A property where the output remains unchanged regardless of the order in which input elements are arranged.

Perplexity

A metric measuring how well a model predicts the next token — lower perplexity means better language modeling.

Persistent Environments

Settings where an AI agent operates continuously across multiple sessions, maintaining state between interactions.

Persistent Homology

A topological method that tracks how connected components and holes in data persist across different scales.

Persona Collapse

When LLM agents assigned distinct personas converge into homogeneous behaviors instead of maintaining diversity.

Persona Consistency

Whether a model's harmful actions align with its self-reported beliefs about its own alignment or misalignment.

Persona debate

A structured process where multiple viewpoints argue competing positions to explore different reasoning paths.

Personal Context Bus

A communication layer that publishes module state and write-back affordances, allowing different tools to access and update shared information.

Personalization

Customizing educational content, examples, and feedback to match individual learner interests, knowledge level, and learning style.

Perspective-Taking

The ability to understand and consider viewpoints different from one's own, important for integrating diverse information sources.

Perspectivist Evaluation

Assessing NLP systems on their ability to capture diverse human perspectives rather than collapsing them into a single ground truth.

Persuasive Techniques

Rhetorical or psychological methods used to influence user behavior, beliefs, or decisions.

Perturbation-Based Analysis

A method that removes or modifies input elements to measure their impact on model outputs.

Phase Conditioning

Incorporating periodic or cyclic information into a model to capture structured patterns in data generation.

Phase diagram

A visualization showing different regimes or conditions where different approaches succeed or fail.

Phase mask

An optical element that modulates the phase of light waves to shape propagation and output.

Phase Shift

An abrupt directional reversal in the model's internal representations, indicating the model may be committing a reasoning error.

Phase Transition (Communication)

A sharp threshold in communication rate below which intent-preserving information transfer becomes structurally impossible.

Phase-Aware Deployment

Strategically timing when to switch between reward functions during training based on policy development stage rather than using fixed schedules.

Phase-Magnitude Decomposition

Separating a signal into phase (timing/structure) and magnitude (amplitude) components to understand their individual contributions.

Phasor Measurement Unit

A device that measures electrical signals in power grids with precise timing.

Phishing

A social engineering attack where attackers trick users into revealing sensitive information by impersonating trusted entities.

Phoneme Processing

The analysis and manipulation of phonemes, which are the smallest units of sound in a language, used to generate natural-sounding speech.

Phonetic and Acoustic Structure

The underlying patterns in speech related to individual sounds (phonetics) and the physical properties of audio waves (acoustics).

Phonetic Modeling

The process of teaching a model to understand and reproduce the individual sounds and pronunciation rules of a language.

Phonetic Nuances

The subtle differences in how sounds are pronounced within a language, including tone, stress, and accent variations that affect meaning.

Phonetic Representation

A text-based encoding of how words sound, showing the individual speech sounds rather than the written spelling.

Photographic Scene Graph

A structured representation encoding scene geometry, object relationships, and lighting properties for photography planning.

Photonic Neural Network

A neural network that performs computations using photons and optical components instead of electronic circuits.

Photorealistic

Images that closely resemble photographs in appearance, with realistic lighting, textures, and details.

Physical Plausibility

Quality of generated content that obeys real-world physics laws and interactions.

Physically-Based Rendering (PBR)

Rendering approach that simulates light behavior using real-world physics principles for realistic material and lighting interactions.

Physics simulation

Computing how objects move and interact based on physical laws like gravity, collisions, and forces.

Physics-Informed

Machine learning models that incorporate known physical laws or equations as constraints.

Physics-Informed Autoencoder

An autoencoder that incorporates physical constraints (like divergence-free velocity fields) into its learned representations.

Physics-Informed Neural Network

A neural network architecture that incorporates domain knowledge or physical principles as explicit features or constraints.

Physics-informed Neural Networks (PINNs)

Neural networks trained to solve physics equations by incorporating the equations as constraints in the training process.

PI/ECO Criteria

Structured eligibility criteria for studies: Population, Intervention/Exposure, Comparator, and Outcome.

Piecewise-Affine

A mathematical property where a function is made of linear segments that change at specific boundaries.

PII Detection

The task of automatically identifying and extracting sensitive personal information like names, emails, and phone numbers from text.

Pile Dataset

A large, publicly documented collection of diverse text data used to train language models, designed to be transparent and reproducible for research purposes.

Pin-Name-Based Wiring

Connecting circuit components by referencing their pin names rather than geometric coordinates, making connections explicit and semantic.

Pipeline Configuration

The specific choices of components and parameters in a retrieval system, such as which LLM, retriever, and number of documents to use.

Pipeline Orchestration

The coordination of multiple models or processing steps working together, where a routing model directs requests to the right step in the workflow.

Pipeline Parallelism

Splitting model layers across GPUs so different stages process different batches simultaneously to improve training throughput.

Pipeline Validation

Testing a workflow or system end-to-end to ensure all components work together correctly before using it with real data.

Pixel-Level Anomaly Maps

Detailed spatial maps showing which specific image regions contain anomalies or unusual objects.

Pixel-Level Features

Visual information extracted directly from individual pixels in an image, used to understand the precise positioning and appearance of elements on a page.

Plackett-Luce Model

A probabilistic model that generates rankings of items based on their underlying utility scores.

Plasticity

A model's ability to learn and adapt to new tasks and data.

Plasticity-Stability Dilemma

The tension between a model's ability to learn new information (plasticity) and retain old knowledge (stability).

Platonic Representation Hypothesis

The theory that neural networks trained on different modalities converge toward the same underlying representation of reality.

Plug And Play

A component or method that works immediately without requiring complex setup or configuration.

Plug-in Loss

A loss function evaluated at a point estimate (like the Dirichlet mean) rather than integrating over the full distribution.

Plugin Architecture

A software design where new features can be added as independent modules without modifying the core codebase.

Pluralistic Alignment

Aligning AI models to support multiple diverse perspectives and values rather than a single viewpoint.

Point Cloud

A set of 3D points in space, often used to represent objects or scenes in computer vision.

Point Cloud Alignment

Matching 3D point sets from different sources to find optimal spatial correspondence and transformation.

Point Cloud Reconstruction

Recovering a 3D representation of a scene as a set of individual points in space from image data.

Point Release

A minor update to a software version (like 5.1 to 5.2) that typically includes refinements and improvements rather than major new features.

Point Tracking

Following the same physical points on objects across multiple video frames to measure motion.

Point Transformer

A transformer architecture designed to process unordered point cloud data using attention mechanisms.

Pointwise Paradigm

Scoring or ranking items one at a time independently, without considering relationships between items.

Poisoned Responses

Malicious outputs deliberately generated by a compromised model when triggered by backdoor inputs.

Poisoned Skill

A malicious skill designed to harm an agent or system when the agent uses it, either immediately or after mutation.

Poisoning Attack

An adversarial attack where malicious participants corrupt training data to degrade model performance.

Polar Decomposition

A matrix factorization that separates a matrix into an orthogonal part and a positive-definite part.

Polar Mechanism

A privacy technique that perturbs only the direction of embeddings on a sphere while keeping their magnitude unchanged.

Policy Alignment

Process of adjusting a model's behavior to follow specific constraints or objectives during training.

Policy Blending

Combining actions from multiple policies (e.g., cloned and learned) based on their estimated quality or confidence.

Policy Composition

Combining multiple pretrained policies to solve new tasks without retraining from scratch.

Policy Convergence

The process by which a reinforcement learning agent's decision-making strategy stabilizes toward optimal behavior.

Policy Distillation

Converting trajectories or behaviors discovered during exploration into a trainable policy that can be deployed.

Policy Drift

When a trained model's behavior gradually diverges from its intended target during continued training.

Policy Enforcement

The process of automatically checking content against a set of rules or guidelines and blocking or flagging violations.

Policy Entropy

A measure of randomness or diversity in a policy's action distribution; lower entropy means more concentrated, deterministic behavior.

Policy Evolution

How a model's decision-making strategy changes over training iterations, affecting which samples it generates and with what probability.

Policy formalization

Converting natural language organizational policies into structured, executable rules and constraints.

Policy Gradient

Optimization method that updates model parameters by following the gradient of expected rewards.

Policy Gradient Theorem

A foundational result showing how to compute gradients of expected return with respect to policy parameters.

Policy Imitation

Training an agent to match a target policy by minimizing divergence (e.g., KL divergence) between predicted and target actions.

Policy Interpretability

Making an agent's decision-making process understandable and explainable to humans.

Policy Iteration

An optimization technique that alternates between evaluating a policy and improving it based on that evaluation.

Policy Learning

Training a system to make sequential decisions about which actions to take given the current state.

Policy Mining

Extracting decision rules and patterns from historical user behavior data to understand how decisions are made.

Policy Optimization

Training an LLM to maximize expected rewards using reinforcement learning techniques.

Policy Refinement

The iterative process of improving a learned decision-making strategy based on evaluation feedback and identified failures.

Policy Violation

An instance where a system breaks a security or behavioral rule it is supposed to follow.

Policy Violation Detection

The ability to identify when content breaks specific safety rules or guidelines set by an organization.

Politeness Theory

Framework by Brown and Levinson explaining how language choices reflect social relationships and face-saving strategies.

Political Consistency Training

An RL training method that reduces covert political bias by enforcing symmetric responses to opposing political viewpoints across sentiment and helpfulness dimensions.

Polyak-Łojasiewicz Condition

A geometric property that guarantees convergence of gradient-based optimization without requiring convexity.

Polyak-Ruppert Averaging

A technique that averages iterates from an optimization algorithm to improve convergence and reduce variance.

Polynomial Preconditioner

A mathematical transformation using low-degree polynomials to reshape a matrix's properties without changing its rank.

Polysemanticity

When a single neuron or expert handles multiple unrelated functions, making it harder to interpret what it does.

Polysemy Blindness

Failure to recognize that a single word can have multiple different meanings depending on context.

POMDP (Partially Observable Markov Decision Process)

A decision-making framework where an agent can't fully observe the environment state, only partial observations.

Population Diversity

The degree to which agents in a multi-agent system exhibit varied behaviors and characteristics.

Population-Based Search

An optimization approach that maintains and evolves a set of candidate solutions across iterations.

Population-level Risks

Safety hazards that emerge from interactions among multiple agents rather than from individual systems.

Portfolio Algorithm

A method that runs multiple different solving strategies in parallel and uses the best result.

Portfolio Construction

The process of selecting and weighting assets to create an investment portfolio that balances risk and return objectives.

Portfolio Coverage

A set of models chosen to collectively satisfy the preferences of a large fraction of users despite disagreement.

Pose Disentanglement

Separating head position and orientation from facial expression features to improve the model's focus on meaningful deformations.

Pose Estimation

The task of identifying and locating body parts (like joints or keypoints) in images or video.

Pose Prediction

Estimating future body joint positions and orientations from past poses.

Position Bias

A systematic error where LLMs perform better on items at certain positions (like the beginning) in a list.

Positional embedding adaptation

Modifying how a model encodes token positions to extend its ability to handle longer sequences.

Positional Encoding

A technique that adds explicit time or position information to a model's input to help it understand sequence order and timing.

Post Training Quantization

Reducing model size by converting weights to lower precision after training is complete.

Post-hoc Explanation

An explanation method applied after a model is trained to interpret its predictions, rather than building interpretability into the model itself.

Post-hoc Uncertainty Module

A lightweight component added after a model is trained to estimate confidence without retraining the base model.

Post-Training

Additional refinement applied to a model after its initial training to improve performance on specific tasks like reasoning or instruction-following.

Posterior Collapse

A failure mode in VAEs where the learned latent representation becomes unused and the model ignores it.

Posterior Distribution

The updated probability distribution of parameters after observing new data.

Posterior Sampling

Generating samples from a probability distribution conditioned on observed measurements or constraints.

Posterior Score

The gradient of the log-probability of data given a measurement constraint, used to sample from constrained distributions.

Potential-based Reward Shaping

A reward shaping technique that adds auxiliary rewards based on a potential function while guaranteeing the optimal policy remains unchanged.

Power Consumption Profile

Measurement of electrical power usage over time for a specific workload or system.

Power Distribution

A sharpened version of a language model's output distribution that emphasizes high-probability tokens, used to elicit better reasoning.

Power Iteration

An iterative algorithm that finds the dominant eigenvector or singular vector of a matrix.

Power Profiling

Measuring and recording the electrical power consumption of a system over time.

Power Spherical Distribution

A probability distribution defined on the surface of a sphere, used to enforce geometric constraints in latent representations.

Power-Aware Scheduling

Scheduling jobs on computing systems while considering and optimizing for power consumption constraints.

Power-of-Two Quantization

Quantizing model weights to powers of two so multiplication becomes a simple bit-shift operation.

PPG (Photoplethysmogram)

A non-invasive measurement of blood flow and heart rate using light sensors, commonly found in smartwatches.

Pragma-Based Optimization

Hardware optimization achieved by adding compiler directives (pragmas) to code that guide synthesis tools in generating efficient designs.

Pragmatic Noise

Natural, content-free variation in how a model expresses reasoning without changing the underlying logic.

Pragmatics

The study of how context and intent affect language meaning beyond literal words.

Pre-norm

A Transformer design choice where layer normalization is applied before the main computation rather than after.

Pre-trained

A model that has already been trained on large amounts of data before being released, so it can be used immediately without additional training.

Pre-trained Transformer

A neural network model trained on large amounts of text data before being adapted for specific tasks, using the Transformer architecture.

Pre-training

Initial training of a model on large unlabeled datasets to learn general language patterns before task-specific adaptation.

Precision

The level of numerical detail a model uses to represent its internal values; higher precision means more accurate calculations but requires more memory.

Precision Bias

Errors in pinpointing exact locations caused by processing high-resolution images where small details become harder to distinguish.

Precision Degradation

A slight loss in model accuracy or reasoning quality that can occur when using quantization or other compression techniques.

Precision Loss

The reduction in numerical accuracy that occurs when a model is compressed, which can slightly degrade performance on complex reasoning tasks while remaining acceptable for most everyday uses.

Precision Trade-off

The balance between reducing model size through lower numerical precision and maintaining accuracy—lower precision saves memory but may slightly reduce performance.

Prediction Residuals

The remaining pixel differences after predicting what a frame should look like based on previous frames.

Predictive Control

A control method that forecasts future states and optimizes actions accordingly.

Predictive State Objective

A training objective that learns to retain only information from the past necessary to predict future observations.

Predictor-Corrector Sampler

A sampling method that alternates between predicting the next step and correcting errors to improve generation quality.

Preference Alignment

How well an AI system's judgments match the actual preferences of target users or evaluators.

Preference Conditioning

Providing preference weights or trade-off parameters as input to a model to control its behavior at inference time.

Preference Modeling

Building a system that learns to predict and represent human preferences or values.

Preference Optimization

A training method that learns from pairwise comparisons between solutions rather than explicit reward signals.

Preference-Based Fine-tuning

Refining a model by learning from human comparisons of outputs rather than explicit numerical scores.

Preference-Based Judgments

Evaluation method where human raters compare two model outputs and indicate which one is better, rather than scoring them independently.

Preference-based Learning

Training a model using pairwise comparisons between options rather than absolute reward values.

Preference-based Reinforcement Learning

Learning reward models from pairwise comparisons of behaviors instead of explicit reward signals.

Prefetching

Loading data into memory before it's needed to reduce wait times during computation.

Prefill Computation

The initial processing phase where an LLM reads input context before generating tokens.

Prefill Stage

The initial phase of inference where the model processes the full input prompt before generating tokens.

Prefix Convention

A simple rule where you add a label like 'query:' or 'passage:' to the beginning of text to tell the model how to process it differently.

Prefix Matching

Comparing token sequences to find semantically equivalent continuations in an LLM's output.

Prefix Mismatch

Inconsistency between cached prompt prefix and current prompt, preventing cache reuse and increasing cost.

Prefix-tuning

A parameter-efficient fine-tuning method that prepends learnable tokens to input sequences to adapt model behavior.

Preprocessing

A step that transforms raw input data into a cleaner, more useful format before feeding it to another model or system.

Pretrained

A model that has already been trained on large amounts of text data before being released or fine-tuned for specific tasks.

Pretrained Base

A model that has been trained on large amounts of general data but hasn't been specialized for specific tasks, serving as a foundation for further customization.

Pretrained Base Model

A foundational AI model trained on raw data but not specialized for specific tasks like conversation, serving as a starting point for further customization.

Pretrained Foundation

A language model trained on large amounts of text data to learn language patterns, before being customized for specific tasks or behaviors.

Pretrained Language Model

A model trained on large amounts of text data to predict and generate language before being adapted for specific applications.

Pretrained Model

A model that has already been trained on large amounts of text data and can be used directly or fine-tuned for specific tasks.

Pretrained Weights

The learned parameters of a model after training on large amounts of text data, ready to be used or further refined for specific tasks.

Pretraining

The initial training phase where a model learns general patterns from a large dataset before being adapted for specific downstream tasks.

Preview Model

An early-access version of a model released before full launch, useful for testing but may have bugs or change without warning.

Preview Release

An early version of a model released for testing and feedback before a stable, finalized version is available.

Preview Stage

An early version of a model that is still being tested and refined before an official release, so features or performance may change.

Preview-Stage Model

An experimental version of a model released early for testing and feedback, with behavior and features that may change significantly before the official release.

Price Elasticity

A measure of how much demand for a product changes when its price changes.

Price of Fairness

The minimum cost (in data modifications or purchases) required to achieve a specific fairness threshold in a model.

Price of Robustness

The performance loss a model experiences when trained to be robust against attacks instead of optimized purely for accuracy.

Principal Component Analysis (PCA)

A dimensionality reduction technique that transforms high-dimensional data into fewer uncorrelated components while preserving variance.

Principal Geodesic Analysis

A geometric technique for finding the main directions of variation on curved spaces like spheres.

Principal-Agent Framework

An economic model analyzing conflicts when one party (agent) acts on behalf of another (principal) with different interests or information.

Principal-Angle Drift

A measure of how much the geometric orientation of learned representations changes over time.

Prior Bias

A model's default gender assumptions when translating ambiguous source text without explicit gender markers.

Prioritized Experience Replay (PER)

A replay buffer technique that samples more frequently from experiences with larger TD errors, focusing learning on surprising or informative transitions.

Priority-Aware Scheduling

Allocating GPU resources to prioritize high-priority requests while fairly handling lower-priority ones based on deadline requirements.

Privacy-Utility Trade-off

The balance between protecting sensitive information and maintaining model performance on downstream tasks.

Private Networking

A network configuration that isolates your model's traffic from the public internet, keeping it accessible only within your organization's internal network.

Privilege Control

Limiting what actions an agent can perform based on its role and the sensitivity of the task.

Privileged Context

Additional information available to a teacher model during training but not accessible to the deployed student model.

Privileged Information

Extra information available during training (like correct answers or solution traces) that helps guide learning but isn't available at test time.

Pro-Tier

A higher-capability version of a model designed for more demanding tasks, typically with better reasoning and language understanding than base versions.

Proactive Assistance

AI agents that anticipate user needs and deliver recommendations without being explicitly asked, rather than only responding to requests.

Probabilistic Computation

Computing with randomness and probability distributions to achieve robustness, interpretability, and security in AI systems.

Probabilistic Finite Automata

Formal computational models that follow probabilistic rules to generate sequences or recognize patterns.

Probabilistic Forecasting

Predicting multiple plausible future outcomes with associated probabilities rather than a single deterministic prediction.

Probabilistic Graphical Model

A structured representation showing how variables relate to each other and their probabilistic dependencies.

Probabilistic Models

Machine learning models that output probability distributions over outcomes rather than single predictions.

Probabilistic Reasoning

The ability to understand and work with probability, uncertainty, and likelihood in problem-solving.

Probabilistic Routing

Blending predictions from multiple branches based on weights rather than selecting a single path.

Probability Coherence

A set of probability assignments that obey basic mathematical rules (sum to 1, non-negative, etc.).

Probability Simplex

The geometric space of all valid probability distributions, where each point represents a probability vector summing to one.

Probe-and-Refine Tuning

An iterative method that uses synthetic bugs to test and automatically improve repository guidance files.

Probing Method

A technique to measure what information is encoded in a model's internal representations without retraining.

Problem Difficulty

A measure of how hard a problem is for a solver to answer correctly, used to generate progressively challenging training examples.

Problem Generation

Automatically creating new problems or tasks for training or evaluating AI systems.

Problem-Solving

The model's capacity to analyze difficult questions or technical challenges and work toward accurate, well-reasoned solutions.

Procedural Execution

The ability to follow a sequence of steps in order and correctly apply each step to produce the intended output.

Procedural Memory

A system that stores step-by-step procedures or workflows that an agent can retrieve and follow to accomplish tasks.

Procedure-Level Advantage Scaling

A technique for distributing credit fairly across multiple alternative rollouts from a branching point.

Process Reward Model

A model trained to evaluate and score the quality of intermediate steps in a solution, rather than just checking if the final answer is correct.

Process Separation

Running safety controls in a separate operating system process that the AI system cannot directly access or modify.

Process-Control Architecture

System design that enforces constraints during reasoning steps rather than only filtering final outputs.

Product-of-Experts (PoE)

A defense that combines a teacher model with a proxy student during generation to suppress outputs useful for copying.

Production-Ready Code

Code that is complete, tested, and formatted to standards suitable for immediate use in real applications.

Program Synthesis

Automatically generating executable code that solves a problem or replicates observed behavior.

Progress Signal

A continuous output that tracks how far through a task the robot has progressed, enabling automatic subtask transitions.

Progressive Multi-Stage SFT

A training strategy that gradually teaches models from simple tasks to complex ones, mimicking human learning progression.

Progressive Post-Training

A multi-stage fine-tuning schedule that applies different training objectives sequentially (SFT, then offline DPO, then online DPO) to avoid conflicting optimization goals.

Projected Gradient Descent (PGD)

An optimization method that updates inputs along gradients while constraining them to stay within a valid range.

Projection

Finding the closest point in a feasible set to a target point, measured by a distance metric.

Projective Geometry

Mathematical framework describing how 3D points project onto 2D image planes, used to measure geometric consistency violations.

Prompt

The initial text you provide to a language model to guide what it should generate or complete.

Prompt Cache

Cached prefix of a prompt that can be reused across requests to avoid recomputing the same tokens.

Prompt Conditioning

Using descriptive text instructions to guide or control how a model generates output, such as specifying desired voice characteristics.

Prompt Engineering

Designing the input text to a model in specific ways to improve the quality of its responses.

Prompt Expansion

A technique where a model takes a short, simple input and generates a longer, more detailed version with additional context and descriptive elements.

Prompt Fusion

Combining multiple text prompts or prompt elements to guide generation with blended semantic information.

Prompt Injection

An attack where malicious instructions are inserted into user input to manipulate an AI model's behavior.

Prompt Masking

Selectively activating or deactivating task-specific prompts based on whether incoming data matches learned patterns.

Prompt Optimization

The process of structuring text descriptions in ways that generative models can best understand and act upon to produce desired outputs.

Prompt Prefix

A short instruction added to the beginning of input text that tells the model how to treat that text (for example, marking it as a 'query' versus a 'passage').

Prompt Sensitivity

The tendency of LLM outputs to vary significantly based on small changes in how a request is phrased.

Prompt Slices

Subsets of evaluation prompts grouped by category or topic to analyze model behavior across specific types of inputs.

Prompt-Based Inference

A model interaction style where you guide the model's output by providing minimal cues like clicks, boxes, or masks rather than detailed text instructions.

Prompt-Based Interface

A way to control what a model does by giving it text instructions, rather than requiring code changes or separate training for different tasks.

Promptable Model

A model that accepts flexible user inputs (like text descriptions, points, or bounding boxes) to guide what it should identify or process in an image.

Promptable Segmentation

A segmentation approach where you guide the model by providing prompts like points, clicks, or bounding boxes to specify which objects you want it to segment.

Proof Assistant

Software that verifies mathematical proofs are logically correct, acting as a checker for formal mathematics.

Proof Sketch

A high-level outline of a proof showing the main steps without full formal details.

Proof Trace

A human-readable record of reasoning steps where each transition is explicitly justified and independently auditable.

Proof-of-Concept

A small-scale demonstration or experiment designed to test whether an idea or approach is feasible, rather than for production use.

Propagation

The process of spreading information or edits from reference points (keyframes) to other frames in a sequence.

Propagation of Chaos

Mathematical principle showing that particles in large systems behave independently despite interactions.

Proper Scoring Rule

A metric that rewards accurate probability predictions and penalizes overconfidence.

Property Prediction

Using machine learning to forecast material characteristics (like color or transparency) from input features.

Proposal Generation

Creating candidate regions or concepts from input (e.g., converting text queries into visual targets).

Prosody

The rhythm, intonation, and stress patterns in speech that convey emotion and meaning beyond individual words.

Protein Folding

The process by which a protein chain folds into its three-dimensional structure, which is essential for the protein to function properly.

Protein Language Model

A neural network trained on large collections of protein sequences to learn patterns in amino acids, similar to how language models learn patterns in text.

Proto-Language

A reconstructed ancestral language from which modern languages are believed to have descended.

Prototype Matching

Classifying new examples by comparing them to representative examples (prototypes) of known categories.

Provenance

Complete record of the origin, history, and context of data or findings, enabling reproducibility and traceability.

Prover Verifier Games

A framework where one agent proves claims and another verifies them to ensure correctness.

Proximal operator

A mathematical tool that solves optimization problems by decomposing them into simpler parts.

Proximal Policy Optimization (PPO)

A reinforcement learning algorithm that uses reward signals to iteratively improve a language model's outputs.

Proximity Field

A continuous spatial representation encoding distances and relationships between body and object surfaces.

Proxy Model

A simpler, faster model used to approximate the behavior of a more complex model for analysis.

Proxy Reward

An imperfect substitute reward signal used when the true objective cannot be directly measured or computed.

Proxy Signal

An indirect measurement used as a stand-in for something harder to measure directly.

Pruning

A model compression technique that removes unnecessary parameters or connections from a neural network to reduce its size and computational requirements.

Pseudo Labels

Predicted labels assigned by a model to unlabeled data for semi-supervised learning.

Pseudo-masks

Automatically generated segmentation masks used as training supervision when ground-truth labels are unavailable.

Pseudo-Relevance Feedback

A technique that improves search by automatically refining queries based on initial results, without human input.

Pseudoinverse

A mathematical generalization of matrix inversion used to find optimal least-squares solutions to linear systems.

Pull Request

A request to merge code changes from one branch into another, typically reviewed before acceptance.

PyTorch

A popular open-source framework for building and training neural networks, used to define how models are structured and executed.

PyTorch Format

A model saved in PyTorch's native format, allowing it to be loaded and run using the PyTorch deep learning framework.

Q

Q Learning

A reinforcement learning algorithm that learns the value of actions in different states.

Q-Alignment

How well a supervision signal's scores order actions according to the true Q-values from a reference policy.

Q-Former

A lightweight connector module that bridges a frozen image encoder and a language model, translating visual information into a format the language model can understand.

Q-function

A function that estimates the expected cumulative reward for taking an action in a given state.

Q4 Quantization

A specific quantization method that represents model weights using 4-bit numbers instead of higher-precision formats, significantly reducing model size while accepting some loss in accuracy.

Q4_0 Precision

A specific quantization format that represents model weights using 4-bit integers, offering a good balance between compression and accuracy for running models on consumer hardware.

QK Circuit

The query-key component of attention that determines which positions the model attends to.

QNLI

A benchmark dataset where models learn to determine whether a given sentence answers a given question, used to train models for question-answer relevance scoring.

Quadratic Attention

The standard attention mechanism in transformers that becomes increasingly expensive as sequence length grows, because it compares every token to every other token.

Quadratic Complexity

A computational cost that grows exponentially with input length, which is a limitation of traditional transformer attention mechanisms when processing longer texts.

Quadratic Memory Cost

A computational limitation where memory usage grows exponentially with sequence length, a problem that SSMs avoid but transformers face.

Quadratic scaling

Computational cost that grows with the square of input size, becoming impractical for large datasets.

Qualiaphilia

An attraction to or emphasis on subjective experiences and qualitative aspects.

Qualitative Conclusion

A categorical judgment about whether results support or refute a claim, rather than a precise numerical value.

Qualitative Reasoning

Understanding problems through structural insight and conceptual frameworks rather than numerical optimization or trial-and-error.

Quality Evaluation

The task of assessing and scoring the quality, correctness, or alignment of text outputs, often used to filter or rank model responses.

Quantifiers

Logical operators (universal and existential) that express statements about multiple answer sets in ASP(Q).

Quantitative Reasoning

The ability to understand and solve problems involving numbers, mathematics, and logical calculations.

Quantization

Reducing a model's numerical precision (e.g., from 16-bit to 4-bit) to shrink memory usage and speed up inference.

Quantization Artifacts

Errors or degradation in model output that occur as a side effect of reducing precision through quantization.

Quantization Error

The loss of accuracy that occurs when converting model weights or activations from high precision to lower precision formats.

Quantization-Aware Retraining

Fine-tuning a model while simulating low-precision arithmetic to maintain accuracy after quantization.

Quantization-Aware Training

A training technique where a model learns to maintain performance even when its weights are compressed to use less memory and compute.

Quantization-Induced Degradation

Loss of model performance when reducing numerical precision (e.g., from 16-bit to 8-bit), caused by accumulated rounding noise.

Quantized

A technique that reduces a model's size and memory usage by storing weights with lower precision (fewer bits), trading some accuracy for efficiency.

Quantized Training

Training a neural network while keeping weights and activations in reduced precision formats.

Quantum Autoencoder

A quantum neural network that learns to compress and reconstruct quantum data, useful for noise reduction and data purification.

Quantum Encoding

Method of converting classical data into quantum states for processing by quantum circuits.

Quantum Error Correction

Techniques to protect quantum information from noise by encoding data redundantly across multiple qubits.

Quantum Feedback Control

Using measurement results to adjust quantum system parameters in real-time to achieve desired outcomes.

Quantum State Encoding

Converting classical data into quantum states using rotation gates and entanglement operations.

Quantum State Reconstruction

The process of determining a quantum system's state from measurement data collected over time.

Quasi-Monte Carlo (QMC)

A sampling method that spreads samples more evenly across a space than random sampling, reducing redundancy.

Quasi-Newton Methods

Optimization algorithms that approximate Newton's method using gradient information instead of full second derivatives.

Query Encoder

A model that converts search queries into numerical representations (embeddings) that can be compared against a database of documents to find relevant matches.

Query Expansion

Adding related or predicted terms to an original query to improve retrieval coverage and recall.

Query Intent Taxonomy

A classification system categorizing what users actually want when they search.

Query Plan

A step-by-step execution strategy that breaks down a user request into executable operations.

Query Refinement

Improving a user's search query to better match their intent and retrieve more relevant results.

Query Router

A lightweight component that selects which knowledge atoms or adapters are relevant for a given input query.

Query-Conditioned Generation

Generating outputs (like images) based on learned query vectors that extract and guide specific information from input conditions.

Querying Transformer

A neural network component that acts as a bridge between an image encoder and language model, learning to extract and translate visual information into text-compatible representations.

Quotient POMDP

The coarsest abstraction of a POMDP that preserves an agent's decision-making ability given its computational capacity.

R

R-Drop Consistency Regularization

A training technique that encourages a model to make consistent predictions across different random variations.

R-equivalence

An equivalence relation on rational points of algebraic varieties measuring when points are connected by rational curves.

Radial Basis Function (RBF)

A neural network layer that uses distance-based functions to transform inputs, commonly used for non-linear pattern recognition.

Radial-Angular Decomposition

Geometric framework separating activation dynamics into radial (magnitude) and angular (direction) components.

Radon-Nikodym Derivative

A mathematical tool for comparing probability distributions, used here to derive optimal path measures.

RAG

Retrieval-Augmented Generation — a technique that grounds model responses in retrieved documents to improve accuracy.

RAG (Retrieval-Augmented Generation)

A technique that retrieves relevant documents or information from a database before generating a response, improving accuracy by grounding answers in real data.

RAG Pipeline

A system that retrieves relevant documents or information from a database and feeds them to a language model to generate more accurate and grounded responses.

Rag Systems

Systems combining retrieval of external documents with language generation for accurate answers.

Random Initialization

Setting a model's weights to random values before training, creating an untrained model that produces meaningless output.

Random Projections

A dimensionality reduction technique using random matrices to efficiently approximate high-dimensional data with linear complexity.

Randomized Controlled Trial (RCT)

A research method where participants are randomly assigned to use AI or not, to fairly measure the AI's actual impact.

Randomly Initialized

A model whose weights have been set to random values instead of being trained on data, resulting in no learned patterns or knowledge.

Randomly-Initialized Weights

Model parameters set to random values instead of being learned from training data, resulting in unpredictable and meaningless outputs.

Range-Doppler Sensing

A technique that uses wireless signals to measure both the distance to an object and how fast it's moving toward or away from you.

Rank Order

The relative ordering of values from smallest to largest, independent of their actual magnitudes.

Rank-1 Approximation

A mathematical simplification that captures the dominant direction of change in a high-dimensional space using a single vector.

Ranking

The process of ordering search results by relevance, determining which documents best match a user's query.

Ranking Consistency

A measure of how stable and reliable algorithm rankings remain across different datasets or evaluation conditions.

RankNet

A neural network architecture designed to learn ranking by comparing pairs of items and predicting which one is more relevant to a query.

Rasch model

A statistical method that jointly estimates solver ability and problem difficulty from performance data.

Rationale

Token-level explanations showing which words in text support a model's classification decision.

Re-Ranking

A technique that takes an initial set of search results and reorders them by scoring their relevance to a query, typically to improve the quality of top results.

ReAct Paradigm

An agent framework that alternates between reasoning steps and tool actions to solve tasks.

Reaction-Diffusion Systems

Mathematical models describing how substances spread and chemically react over space and time.

Reaction-Diffusion Systems

Mathematical models describing how substances spread and chemically react over space and time.

Readability

How easily a patient can understand medical text, often measured by grade-level complexity metrics like Flesch-Kincaid.

Reader Component

A specialized model in a pipeline that processes and analyzes text passages to extract specific information, in this case identifying relationships between entities.

Readiness-Driven Execution

A scheduling approach that runs whichever task is ready first, rather than following a fixed predetermined order.

Reading Comprehension

AI task where a model answers questions based on provided text passages.

Real-time Clustering

Grouping similar user behaviors or signals as they happen to detect coordinated patterns across accounts.

Real-Time Inference

Processing and generating predictions on data as it arrives, with minimal delay, rather than in batches.

Real-Time Knowledge

The ability to access and incorporate current information from the web or live data sources rather than relying solely on training data from a fixed point in time.

Real-Time Search

The ability to query current web information during inference, allowing a model to access and use the latest data when answering questions.

Real-Time Web Search

The ability to search the internet during inference to retrieve current information rather than relying only on knowledge from training data.

Reasoning

The model's ability to work through multi-step logical problems and provide justified answers rather than just pattern-matching.

Reasoning Ability

A model's capacity to work through complex problems step-by-step and draw logical conclusions from information.

Reasoning Agent

An AI component designed to work through complex problems step-by-step, often as part of a larger system that coordinates multiple agents.

Reasoning Capabilities

The model's ability to work through multi-step problems methodically and show its thinking process rather than jumping to answers.

Reasoning Capability

A model's ability to work through multi-step logical problems and produce coherent explanations for its answers.

Reasoning Capacity

The model's ability to perform complex logical thinking and problem-solving tasks beyond simple pattern matching.

Reasoning Chain

A step-by-step explanation of how a model arrives at an answer, showing its intermediate thinking before the final result.

Reasoning Chains

A sequence of logical steps a model follows to work through a problem methodically rather than jumping directly to an answer.

Reasoning Depth

A model's ability to perform complex multi-step logical thinking and problem-solving; typically increases with model size.

Reasoning Distillation

Teaching a model to mimic the step-by-step reasoning process of a teacher model or reference solution.

Reasoning Effort

A configurable setting that controls how much computational time a model spends thinking through a problem before generating its response.

Reasoning Engine

The core component of a model that performs step-by-step logical thinking and problem-solving before generating a response.

Reasoning Faithfulness

The degree to which a model's intermediate reasoning steps logically support and justify its final answer.

Reasoning Generalization

A model's ability to apply learned reasoning patterns to new, unseen problems beyond its training distribution.

Reasoning Mode

A special mode where the model takes extra time to think through problems step-by-step before answering, rather than responding immediately.

Reasoning Model

A model trained to show explicit step-by-step reasoning and problem-solving logic before producing final answers, rather than jumping directly to conclusions.

Reasoning Optimization

Training techniques that enhance a model's ability to work through multi-step logical problems, mathematical derivations, and code generation systematically.

Reasoning Pipeline

The internal process a model uses to think through a problem step-by-step, integrating information and tool outputs to arrive at conclusions.

Reasoning Process

An internal step where the model thinks through a problem before generating its final answer, allowing it to work through complex logic more carefully.

Reasoning skill

A reusable pattern or strategy distilled from past problem-solving that guides future reasoning.

Reasoning Step

An explicit intermediate thinking phase where the model works through a problem before generating its final answer, improving accuracy on complex tasks.

Reasoning Task

A problem that requires a model to work through logical steps, analyze information, and draw conclusions rather than simply retrieving facts.

Reasoning Tasks

Problems that require a model to think through multiple steps logically to arrive at an answer, rather than just pattern-matching.

Reasoning Text-to-Image Generation

Image generation where the model actively infers implicit user intents from text descriptions rather than literal interpretation.

Reasoning Trace

The visible record of a model's intermediate thinking steps and logic, allowing users to inspect how the model arrived at its conclusion.

Reasoning Trajectory

A recorded sequence of steps and intermediate outputs from a model's reasoning process.

Reasoning-Aware Retrieval

A retrieval method that uses an agent's explicit reasoning steps alongside its query to find more relevant documents.

Reasoning-Driven Generation

Creating content (like videos) based on understanding user intent and semantic meaning, not just pattern matching.

Reasoning-Focused

A model specifically trained to work through multi-step logical problems methodically rather than generating quick responses.

Reasoning-Intensive Retrieval

Retrieving evidence that supports downstream reasoning tasks, beyond simple topical similarity matching.

Reasoning-Optimized

A model designed to allocate extra computational resources to logical problem-solving and step-by-step analysis rather than raw speed or breadth of knowledge.

Reasoning-Oriented Design

A model architecture optimized to work through problems step-by-step using logical inference rather than relying primarily on pattern matching from training data.

Reasoning-Oriented Training

Training methods designed to improve a model's ability to work through multi-step logic and solve complex problems systematically.

Recall

The proportion of correct answers that a grader successfully identifies, measuring how many true positives it catches.

Recall mechanism

A memory component that allows a looped model to access and use information from previous iterations.

Recency Bias

The tendency to weight more recent information more heavily than earlier information in decision-making.

Receptive Field

The region of input data that a neuron responds to or influences.

Recognizer Expressivity

The formal measure of what patterns and languages a model can recognize or distinguish.

Reconstruction Attack

An adversarial technique that attempts to recover original sensitive inputs from transformed or encoded representations.

Reconstruction Error

The difference between original data and its reconstructed version from an autoencoder, used to identify anomalies or unusual patterns.

Reconstruction Loss

A measure of how well an autoencoder can recreate its input, used as the training objective.

Recoverability

Whether a failed problem-solving attempt can be fixed by additional compute, different strategies, or other test-time modifications.

Recovery Agency

An agent's ability to recognize mistakes, backtrack, and explore alternative solutions when initial approaches fail.

Rectified Flow

A generative model approach that learns to generate data by following straight paths in latent space.

Recurrence distance

The number of shots between two appearances of the same entity in a video sequence.

Recurrent Architecture

A neural network design where information flows in loops, allowing the model to process sequences step-by-step while maintaining memory of previous inputs.

Recurrent Neural Network Transducer (RNNT)

A neural network architecture designed for speech recognition that processes audio sequentially and outputs text in real-time without waiting for the entire input.

Recurrent Neural Networks

Neural networks with loops that process sequences by maintaining memory of past inputs.

Recurrent Persistence Loop

A feedback mechanism where outputs reinforce or modify previous states over time.

Recurrent-Attention Architecture

A hybrid neural network design that combines recurrent processing (which maintains memory across sequences) with attention mechanisms, enabling better memory efficiency than standard transformers.

Recurrent-Hybrid Architecture

A neural network design that combines recurrent elements with other architectural components to process sequential data more efficiently than standard transformers.

Recursion Depth

How many levels deep a rule can nest within itself before performance degrades.

Recursive Agent Harness

An agent architecture where a parent agent spawns multiple subagents in parallel to handle fine-grained subtasks, each with filesystem tools and code execution.

Recursive Composition

Automatically combining smaller components by matching outputs of one to inputs of another, creating new complex structures.

Recursive Computation

Iteratively applying the same computation multiple times with parameter sharing to increase model depth without adding parameters.

Recursive Decomposition Tree (RDT)

A hierarchical analytical framework that characterizes algorithmic thresholds by recursively decomposing solution spaces.

Recursive Instability

Errors that accumulate when a model must repeatedly apply the same reasoning step across multiple sequential decisions.

Red Teaming

Adversarial testing where security experts attempt to find vulnerabilities by attacking a system like an attacker would.

Red Teaming

Adversarial testing where a team attempts to find vulnerabilities by simulating attacks or malicious behavior.

Reduced-Order Model

A simplified version of a complex system that captures essential behavior with fewer variables.

Reference Attention

An attention mechanism that conditions generation on a reference input by processing reference tokens alongside generated tokens.

Reference Policy

A strong baseline policy used to compute ground-truth Q-values for evaluating supervision signal quality.

Reference resolution

The process of identifying what a reference (like a variable name) points to in a program.

Reference Verification

Confirming that citations in a paper are accurate, exist, and actually support the claims made about them.

Reflection Mechanism

A process where AI systems review past results, identify errors, and extract generalizable patterns to improve future performance.

Reflective Experience

The process of an agent analyzing its past actions and environment feedback to extract lessons for improving future behavior.

Reformer Architecture

A transformer-based model design that uses locality-sensitive hashing and reversible layers to efficiently process long sequences with reduced memory requirements.

Refusal Behavior

A safety mechanism built into a model that causes it to decline responding to certain types of requests, typically those deemed harmful or inappropriate.

Refusal Detection

The ability to identify when a model declines to answer a request, which can indicate the model recognized a harmful or unsafe prompt.

Refusal Mechanism

The learned behavior that causes a language model to decline harmful requests.

Refusal Mechanisms

Built-in safety features that cause a model to decline responding to certain types of requests, such as those involving harmful, illegal, or unethical content.

Regime Detection

Identifying distinct market states or conditions (e.g., stable vs. volatile) to apply different prediction strategies appropriately.

Region-Level Evidence

Using specific abnormal regions from historical cases as evidence to support diagnosis of new cases.

Region-level self-correction

Automatically fixing errors in specific image regions identified by a verifier, enabling iterative improvement of visual outputs.

Region-Level Understanding

The ability to analyze and understand specific areas or sections of an image rather than just the image as a whole.

Regional-to-global perception gap

The performance difference between a model's ability to understand cropped regions versus full images.

Register Tokens

Learnable placeholder tokens added to transformer inputs to absorb and stabilize problematic activations without affecting semantic content.

Regression

When a fix or change breaks functionality that was previously working, causing previously-passing tests to fail.

Regression Detection

Identifying when code changes break previously working functionality.

Regret

The cumulative difference between an algorithm's performance and the best fixed action in hindsight.

Regret Averaging

An algorithm that accumulates regrets over time and uses their average to select actions.

Regret Bounds

Theoretical guarantees on cumulative performance loss compared to an optimal policy.

Regularized GLM

A generalized linear model with penalty terms added to prevent overfitting and improve prediction on new data.

Reinforcement Fine-Tuning

Adapting a model using reinforcement learning signals from verifiable rewards during post-training.

Reinforcement Learning

A training method where a model learns by receiving rewards or penalties for its outputs, encouraging it to improve its behavior over time.

Reinforcement Learning from AI Feedback (RLAIF)

Training models using rewards generated by AI systems (like LLM judges) instead of human feedback.

Reinforcement Learning from Human Feedback

A training technique where human evaluators rate model outputs, and the model learns to produce responses that humans prefer.

Reinforcement Learning from Internal Feedback (RLIF)

Training a model using reward signals derived from the model's own internal representations rather than external labels.

Reinforcement Learning with Verifiable Rewards (RLVR)

A post-training approach for language models using rewards that can be objectively verified, like correctness on benchmarks.

Rejection Sampling

A training technique that selects high-quality examples based on a reward signal to improve model learning.

Relation Extraction

A task where a model identifies and extracts meaningful connections between entities in text, such as which drugs treat which diseases.

Relational logic rules

Formal rules that define relationships between entities and conditions for compliance or violations.

Relative Smoothness

A generalized smoothness condition comparing function curvature to a reference Bregman kernel, replacing standard Lipschitz gradient assumptions.

Relaxation Parameter

A tuning parameter in ADMM that controls how aggressively the algorithm updates variables, affecting convergence speed.

Relevance Oracle

A deterministic system that assigns correct relevance labels to document-query pairs without human judgment.

Relevance Ranking

The process of ordering search results by how well they match a user's query, with the most relevant results appearing first.

Relevance Scoring

Assigning a numerical score to indicate how well a document matches or answers a given query.

Relevance Set

The set of arguments an agent chooses to activate or consider in a given context, forming the agent's action space.

ReLU Neural Network

A neural network using rectified linear unit activations, which can be exactly embedded in mixed-integer linear programs.

Remote Photoplethysmography (rPPG)

Non-contact measurement of heart rate from video by detecting subtle color changes in skin caused by blood flow.

Remote Sensing

Collecting information about Earth's surface using satellites or aircraft without physical contact.

Reparameterization

Rewriting a model's weights in a different mathematical form to improve training efficiency or stability.

Reparametrization Invariance

A property where a measure stays the same even when you change how the model's parameters are represented, as long as the model's output doesn't change.

Repeated Game

A game where the same players interact multiple times, allowing strategies to depend on the history of previous plays.

Repeated Games

Games where the same players interact multiple times, allowing strategies to depend on the history of previous plays.

Replay

A technique where a model revisits examples from prior tasks during training to prevent forgetting.

Replay Buffer

Storing and retraining on samples from previous tasks to prevent forgetting during continual learning.

Reporting Bias

Systematic skew in data caused by what people choose to record or report.

Repository Guidance

High-level documentation about a codebase structure, test procedures, and common pitfalls for AI agents.

Repository-Level Reasoning

The ability to understand and reason about code across multiple files and folders in a codebase, not just isolated code snippets.

Representation Distortion

Changes to how a model's internal activations encode information, which can degrade performance on tasks not explicitly trained on.

Representation Learning

Training a model to convert raw data into meaningful internal representations useful for downstream tasks.

Representation Model

A model trained to convert raw input (like music or text) into meaningful numerical patterns that capture important features, rather than generating direct outputs like text or classifications.

Representation Space

The high-dimensional mathematical space where a model internally encodes and processes information about text.

Representational Contractivity

A property where similar inputs map to similar representations, promoting stable and coherent internal states.

Representational Convergence

The tendency of different neural networks to learn similar internal representations despite differences in architecture or training.

Representational Drift

Changes in how a neural network internally represents information as it learns new tasks.

Representational Geometry

The geometric structure of how neural networks organize and represent information in their learned feature spaces.

Representational Harm

Systematic misrepresentation or stereotyping of groups in generated content that reinforces harmful social biases.

Representational Space

The internal geometric structure of how a model encodes and processes information.

Representational Stability

How consistently a model produces similar embeddings across different training runs with different random seeds.

Reproducibility

The ability to recreate the same results by using the same training data, methods, and documentation.

Reproducing Kernel Hilbert Space (RKHS)

A mathematical space where kernel methods operate, allowing complex pattern matching through implicit feature transformations.

Request Classification

The process of analyzing an incoming query to determine its type, complexity, or intent so it can be handled by the right model or pipeline.

Requirement Elicitation

The process of gathering and defining what a system needs to do, typically involving stakeholders and domain experts.

Requirement Management

The process of tracking and organizing what a software product needs to do, which AI can help automate.

Requirements Engineering

The process of defining, documenting, and managing software system requirements from stakeholders.

Requirements Traceability

The ability to track how design decisions and parameters connect back to original system requirements and design intent.

Reranker

A model that takes an initial set of search results and reorders them by relevance, typically used to refine results from a faster but less accurate retrieval system.

Reranking

A technique that takes an initial set of search results and reorders them by relevance score, typically to improve the quality of top results.

Research Taste

The patterns and preferences in how researchers choose to frame problems and construct novel contributions.

Residual Activations

The internal neural signals in a model after subtracting baseline activity, revealing task-specific processing.

Residual Code

A multi-scale representation where image details are stored as additive layers that sum together to form the final image.

Residual Correction

A technique that adjusts proxy model predictions by accounting for the difference between proxy and original model.

Residual Network

A neural network architecture that uses skip connections to allow information to bypass layers, making it easier to train very deep networks and improving performance.

Residual Policy

A learned correction layer that outputs small adjustments on top of a baseline controller.

Residual Scaling

Adjusting the magnitude of residual connections to maintain stable gradient flow through repeated layers.

Residual Stream

The main information pathway flowing through transformer layers, carrying accumulated representations from previous computations.

Residual Updates

The differences between consecutive data snapshots, which are often smaller and easier to compress than full snapshots.

Residual Updates

Learning the differences between consecutive states rather than full states, reducing compression complexity for time-evolving data.

Resolution Ratio

The ratio of actual sample size to required sample size (q = N/N*); values below 1 indicate insufficient statistical power.

Resolution-Invariant

A model property where performance remains consistent regardless of input discretization or sampling resolution.

Resource Allocation

Distributing available resources (hardware, compute, time) among competing tasks or processes.

Resource-Constrained

Hardware with limited memory, processing power, or battery life, requiring models to be optimized for efficiency.

Response Entropy

The diversity of outputs a model produces; high entropy means varied solutions, low entropy means repetitive ones.

Response function

A function that determines how an agent reacts or adapts based on observations from the collective system.

Response Length Shaping

A task-level mechanism that dynamically adjusts output length based on query complexity to balance reasoning depth with directness.

Response Stabilization

Techniques to make model outputs more consistent and reliable, such as constraining output format or adding classification heads.

Response Templating

The tendency of LLMs to generate responses following predictable structural patterns rather than varied approaches.

Resurfacing Attacks

Methods that attempt to recover supposedly erased information from a model after unlearning.

Retraction

A smooth map that projects points back onto a manifold, used to maintain feasibility during optimization.

Retraction Status

Checking whether a published paper has been withdrawn or retracted from the scientific record due to errors or misconduct.

Retrieval

The process of finding and returning relevant documents or information from a database based on a query.

Retrieval Augmentation

Training technique that supplements data by finding and using similar examples from a database to improve model generalization.

Retrieval Bias

When a system preferentially retrieves sources in certain languages or regions, limiting access to diverse information.

Retrieval Model

A model designed to find and rank the most relevant documents or passages from a large collection based on a query.

Retrieval Optimization

Tuning a model specifically to find and rank relevant documents or passages in response to a query, rather than generating new text.

Retrieval Pathway

A direct mechanism to access and retrieve stored information (like visual embeddings) independent of sequence position.

Retrieval Pipeline

A system that finds and ranks relevant documents or information in response to a query, often used in search and question-answering applications.

Retrieval System

A system that finds and returns the most relevant documents or information from a large collection based on a user's query.

Retrieval Task

Finding the most relevant documents or text passages from a large collection based on a user's query.

Retrieval-Augmented

A technique that enhances AI systems by first searching for relevant information from a database before generating responses, improving accuracy and relevance.

Retrieval-Augmented Generation

A technique that allows a model to search and reference external documents or knowledge bases to answer questions more accurately and with citations.

Retrieval-Focused

A model specifically trained to find and rank relevant documents or passages in response to search queries, rather than generate new text.

Retrieval-Heavy Workflow

A task where the model needs to search through and extract relevant information from large amounts of text, rather than generating new content from scratch.

Retro-digitization

The process of converting legacy print or analog materials into digital, machine-readable formats.

Retrospection Interpretability

Explaining predictions by showing which historical cases the model referenced when making a decision.

Retrospective Memory

A system that stores and retrieves past experiences and solutions to help an agent make better decisions on similar future tasks.

Reverse Distillation

Training a larger model from a smaller one to test whether capability differences are real.

Reverse Kl Divergence

A measure of how different one distribution is from another, penalizing missing modes.

Reverse Update

A technique that reverses the gradient updates made during training to remove learned information about specific data.

Revocation Epochs

Time-bounded windows during which previously-issued certificates can be invalidated or recalled without waiting for expiration.

Reward Function

A function that assigns numerical scores to model outputs, guiding the learning process toward desired behaviors in reinforcement learning.

Reward Hacking

When an agent exploits loopholes in the reward system to maximize score without actually solving the intended task.

Reward Hypothesis

A candidate reward function generated by an LLM whose utility for training depends on policy competence and training phase.

Reward Model

A learned function that predicts how good an action or outcome is, used to guide policy improvement.

Reward Modeling

Training a model to predict human preferences so it can score outputs and guide AI training through reinforcement learning.

Reward Optimization

Improving model outputs by defining a reward function that scores quality and using it to guide learning toward better solutions.

Reward Scaffolds

Using reference solutions to construct problem-specific reward signals that evaluate intermediate reasoning steps, not just final answers.

Reward Shaping

Modifying raw reward signals to guide learning more effectively without changing the underlying task objective.

Reward Signal

Feedback that tells an AI agent how well it performed on a task, guiding learning.

Reward Topology

The structure and distribution of reward signals across different tasks, which can vary significantly in multimodal learning.

Reward Uncertainty

Treating the reward function as a distribution rather than a fixed scalar, reflecting ambiguity in what behavior is actually desired.

Reward-Confidence Covariance

A measure of how reward quality and model confidence vary together, used to adjust training baselines.

Reward-Guided Fine-Tuning

Adapting a model to optimize for a specific reward signal during training.

Reward-guided Generation

Steering a generative model toward outputs that maximize a reward function, often used to satisfy constraints or optimize downstream objectives.

Reward-hackable

A benchmark task where an agent can achieve high scores without actually solving the intended problem.

RGB-D Image

An image containing both color information (RGB) and depth information (D), enabling 3D scene understanding.

Ridge Regression

A linear regression variant that adds regularization to prevent overfitting by penalizing large weights.

Riemannian Geometry

Mathematical framework for studying curved spaces and their intrinsic properties, used here to analyze neural representation structure.

Riesz Representer

A mathematical tool from functional analysis used to construct debiased estimators in semiparametric models.

Risk Adjusted Returns

Investment returns measured relative to the risk taken, balancing profit with stability.

Risk Aversion

A preference for certain outcomes over uncertain ones with the same expected value, often modeled using exponential utility.

Risk Categories

Predefined groups of harmful content types (such as violence, hate speech, or misinformation) that a safety model is trained to recognize and flag.

Risk Control

Statistical method for calibrating decision thresholds to guarantee bounded error rates or safety violations.

Risk-Averse Decision Making

Strategy that prioritizes avoiding worst-case outcomes over maximizing expected value.

RLHF

Reinforcement Learning from Human Feedback — a training technique that aligns model outputs with human preferences.

RMSNorm

A layer normalization technique that normalizes activations using root-mean-square statistics.

Rnn T

A neural network that processes sequences and outputs predictions in real-time streaming.

RoBERTa Architecture

A transformer-based neural network design that learns to understand language by predicting masked words in text, improved upon the original BERT model.

RoBERTa Architecture

A transformer-based neural network architecture optimized for understanding language through masked language prediction during training.

Robo-Advisor

An automated investment system that uses algorithms or AI to provide financial advice and manage portfolios with minimal human intervention.

Robotic Manipulation

The ability to understand and execute physical tasks involving grasping, moving, and interacting with objects in the real world.

Robotic Planning

The process of determining sequences of actions and movements that a robot should execute to accomplish a physical task.

Robust Aggregation

Combining updates from multiple sources in a way that resists manipulation by malicious participants.

Robust Generalization

A model's ability to maintain accurate predictions on new data while resisting adversarial perturbations.

Robust Trajectory Optimization

Computing spacecraft paths that remain feasible despite uncertainty in initial conditions or disturbances.

Robustness

A system's ability to maintain performance when inputs are corrupted, noisy, or different from training conditions.

Robustness Evaluation

Testing how well an agent maintains performance when faced with errors, variations, or unexpected conditions.

Role-Based Access Control (RBAC)

A security system that restricts what different users can do based on their assigned role (e.g., admin, viewer, editor).

Role-Differentiated Systems

Multi-agent architectures where different components (proposer, executor, checker, adversary) have distinct responsibilities to reduce correlated failures.

Role-playing Benchmark

A test that evaluates how well a model can adopt and maintain a character or persona in interactive scenarios.

Rollback

Reverting a system to a previous saved state, undoing recent changes.

Rollout diversity

The variety of different outputs a model generates when sampling multiple times from the same input.

Rollout Mixture Distillation

Combining supervision from multiple generated sequences (rollouts) to create more stable training signals.

Rollout Scoring

Evaluating agent executions to provide feedback signals for skill improvement, used to validate whether edits improve performance.

Rollout-Based Upper-Tail Quantiles

Empirical method to estimate worst-case outcomes by sampling trajectories and measuring extreme values.

Romanization

Writing words from non-Latin scripts using Latin/Roman characters, commonly used in multilingual speech systems.

Roofline Model

Performance prediction model based on hardware peak throughput and memory bandwidth constraints.

Root Cause Analysis

Identifying the underlying component or event that triggered a system failure or incident.

RoPE Modulation

A technique that adjusts rotary positional embeddings to control spatial relationships and prevent position-based information leakage.

ROS 2

Robot Operating System 2, a middleware framework for building robot software with standardized communication patterns.

Rosetta Neurons

Neurons with similar activation patterns across independently trained models, indicating shared interpretable structure.

Rotary Positional Encoding (RoPE)

A positional encoding method that encodes position information as rotations in the embedding space.

Router Scale

A scaling parameter that controls how the MoE router distributes inputs across expert networks.

Routing

The mechanism that decides which specialized sub-networks (experts) should process each input in a mixture-of-experts model.

Routing Mechanism

The decision-making component in a mixture-of-experts model that determines which experts should process each input token.

Routing Model

A lightweight model that analyzes incoming requests and directs them to the most appropriate downstream model or system rather than processing them directly.

Routing Overhead

The computational cost added by the mechanism that decides which experts should process each input in a mixture-of-experts model.

Routing Policy

A lightweight decision mechanism that determines which computation path to take based on input conditions.

Rubric

A scoring guide that defines criteria and quality levels for evaluating student work or AI-generated responses.

Rubric Generation

Automatically creating evaluation criteria and scoring guidelines that judges use to assess output quality.

Rubric-Based Rewards

Reward signals generated by evaluating model outputs against predefined criteria or rubrics.

Rule Induction

Learning hidden patterns or rules from examples and applying them to new situations.

Runtime Analysis

Theoretical proof of how many iterations an algorithm needs to solve a problem.

Runtime Contract

An explicit agreement between components defining inputs, outputs, and behavior expectations during execution.

Runtime Inference

The process of updating a robot's beliefs about the environment or humans during task execution to reduce uncertainty online.

Runtime Instability

Variation in program execution time across different runs or environments, making performance measurements unreliable.

Runtime Interoperability

The ability for different systems to work together and exchange data dynamically during execution.

Runtime Monitoring

Checking that a system follows security policies during execution by observing its behavior.

Runtime Variability

Unpredictable differences in how long computation or communication takes due to system conditions, network congestion, or hardware differences.

S

Sabotage

Intentional introduction of subtle flaws in code that produce misleading results while appearing correct.

Safetensors

A safe, fast file format for storing model weights, designed to prevent code execution vulnerabilities.

Safetensors Format

A secure and efficient file format for storing model weights that prioritizes safety and speed when loading models.

Safety Alignment

Training techniques used to make a model refuse harmful requests and behave responsibly, reducing the risk of misuse.

Safety Attribution

A protocol that measures how much of a policy's safety comes from the learned policy versus protective layers like filters.

Safety Classification

A machine learning task that assigns content to categories based on whether it poses safety risks or harms.

Safety Classifier

A machine learning model trained to identify and flag harmful, inappropriate, or policy-violating content in text.

Safety Constraints

Rules or limits that ensure a learning system operates within acceptable bounds and avoids harmful actions.

Safety Evaluation

The process of testing and assessing whether a model produces harmful, unsafe, or undesirable outputs.

Safety Filter

A module that constrains robot actions to ensure safety by overriding unsafe commands while preserving task performance.

Safety Filtering

Built-in guardrails in a model that prevent it from generating harmful, illegal, or unethical content by refusing certain requests.

Safety Filters

Built-in constraints that prevent a model from generating harmful, offensive, or inappropriate content in its responses.

Safety Guardrails

Built-in restrictions or filters that prevent a model from generating harmful, illegal, or unethical content.

Safety Model

A specialized AI model trained to identify and classify unsafe, harmful, or policy-violating content rather than generate general responses.

Safety Monitoring

Real-time detection and alarming system that identifies when an LLM generates unsafe or unaligned outputs during deployment.

Safety Restrictions

Built-in guardrails in a model that prevent it from generating harmful, illegal, or unethical content by refusing certain requests.

Safety Token Selection

Identifying which tokens in a model's output are critical for safety decisions to focus training effort.

Safety Training

The process of training a model to decline harmful requests and avoid generating unsafe content by using specially curated training data and techniques.

Safety Tuning

A training process that teaches a model to refuse harmful requests and avoid generating unsafe content by reinforcing safer behaviors.

Safety-Aligned

A model trained to avoid harmful outputs and refuse unsafe requests, making it more cautious and responsible in its responses.

Safety-Critical Systems

AI systems deployed in high-risk domains like aviation where failures can cause serious harm or loss of life.

Safety-Filtered

Dataset processed to remove harmful, explicit, or problematic content using automated or manual screening.

Safety-filtered data

Training data that has been processed to remove or correct unsafe actions, ensuring the learned controller avoids harmful behaviors.

Salience

How noticeable or important something is to a model or person's attention.

Saliency-Weighted Drift

Measuring feature changes while prioritizing visually important regions, ensuring quality preservation in salient areas.

Salient Object Detection

The task of automatically identifying and locating the most visually prominent or important objects in an image.

Sample Complexity

The number of environment interactions (samples) an algorithm needs to learn a good policy.

Sample Efficiency

How well a model learns from a small amount of training data.

Sample Rate

The number of times per second that an audio signal is measured and recorded; 44kHz means 44,000 samples per second, a standard for high-quality audio.

Sample Routing

A technique that directs different training examples to different optimization methods based on their characteristics or correctness.

Sample Selection

Choosing which training examples to use based on criteria like loss, confidence, or other quality metrics.

Sample Space Exploration

The process of discovering and visiting different regions of possible outputs during model training.

Sample-Level Scores

Pre-computed metrics that rank individual training examples by their importance or quality.

Sampled-data control

Control systems where inputs are updated at discrete time intervals rather than continuously.

Sampling Complexity

The number of computational steps required to generate a single sample from a model.

Sampling Trajectory

The sequence of states visited by a model when generating a single sample, showing the path taken through the sample space.

Sandbox

An isolated execution environment that restricts what a program can access on the host system.

Sandboxed Execution

Running agent actions in an isolated environment to prevent them from accessing or damaging other systems.

SARI

A metric for evaluating text simplification that measures how well a system keeps important content while removing unnecessary complexity.

SAT Solver

A symbolic solver that determines if a Boolean formula can be satisfied by finding true/false assignments.

SBERT (Sentence-BERT)

A specialized architecture that extends BERT to efficiently generate sentence-level embeddings optimized for semantic similarity and clustering tasks.

SBERT Architecture

A specialized neural network design that transforms sentences into meaningful vector representations by using a transformer model paired with pooling techniques to capture semantic meaning.

Scalable Oversight

Methods for humans to maintain control over AI systems that may exceed their own capabilities.

Scalar Quantization

Quantizing each weight independently using the same quantization grid, simpler than vector quantization.

Scalarization

Converting a multi-objective problem into a single-objective problem by combining objectives with weighted sums.

Scale-Invariant Loss

A loss function that treats embeddings the same regardless of their magnitude, focusing only on direction.

Scale-Space Theory

A mathematical framework that analyzes images at multiple resolutions to reveal hierarchical information.

Scaling Behavior

How a model's performance and capabilities change as you increase its size, training data, or computational resources.

Scaling Law

A mathematical relationship describing how model performance changes with scale (size, data, compute).

Scaling Laws

Patterns that describe how a model's performance improves as you increase its size, training data, or compute resources.

Scaling Properties

How a model's performance improves as you increase the amount of training data or compute resources.

Scaling Research

The study of how model performance changes as you increase the number of parameters, training data, or compute resources.

Scaling Suite

A collection of models of different sizes trained identically to study how capabilities improve as models grow larger.

Scanpath

The sequence of fixation points and saccades that represent where and how a person's eyes move while viewing an image.

Scenario Pack

A collection of test scenarios used to evaluate model safety, specific to a language, sector, or regulatory regime.

Scenario-Based Audit

A safety evaluation method using predefined test scenarios and a rubric, judged by human or automated evaluators.

Scene Graph

A structured representation of a scene using nodes for objects and edges for spatial relationships between them.

Scene-Preserving Success

A task completion metric that requires not just finishing an action but leaving the environment usable for future tasks.

Scheduling

Assigning tasks and resources to specific times and locations to optimize execution efficiency.

Schema

A structured template or blueprint that defines what fields, entities, or data points you want the model to extract from text.

Schema Context

Information about a database's structure (tables, columns, relationships) provided to the model to help it generate correct queries.

Schema Creation

The process of defining the structure and relationships of data in a database or data system.

Schema Mismatch

Incompatibility between data formats when different services exchange information.

Schema Perturbation

Changes to the structure or format of data that can cause AI models to fail or perform poorly.

Schema Revision

The process of updating an agent's core understanding or framework when new evidence contradicts its current mental model.

Schema-Based Output

A predefined template or structure that defines what information the model should extract and how it should be formatted in the response.

Scoped Execution Identity

A short-lived, narrowly-permissioned identity created just for executing a single certified action, then immediately discarded.

Score Drift

A correction term added during the reverse process to guide noise removal toward realistic data.

Score Function

The gradient of log probability used to guide generation in diffusion models.

Score Fusion

Combining relevance scores from multiple ranking methods to produce a final ranking.

Score Matching

A training technique that teaches a model to predict the gradient of data log-probability for use in sampling.

Scoring Engine

A model designed to assign numerical scores to inputs (like relevance scores for passages) rather than generate new text.

Screening

An attention mechanism that evaluates each key against an explicit threshold to determine relevance, rather than redistributing fixed attention mass across all keys.

Screening Bottleneck

A stage in a pipeline where performance drops significantly, limiting overall system effectiveness.

Script Normalization

Converting text written in different scripts into a single canonical script for fair comparison.

Script Support

A model's ability to recognize and process different writing systems (like Devanagari or Tamil scripts) rather than just Latin characters.

SE(3) Symmetry

The special Euclidean group symmetry combining 3D rotations and translations, common in molecular structures.

Search-Augmented

A language model enhanced with the ability to retrieve and incorporate live information from the web before generating responses.

Second-moment accumulator

The component of adaptive optimizers like Adam that tracks the squared gradients to scale learning rates per parameter.

Sectorized Decoding

A decoding strategy that divides the output space into sectors to reduce ambiguity and improve prediction accuracy.

Seizure Detection

Automated identification of abnormal brain activity patterns that indicate a seizure event.

Selective Distillation

Choosing which teacher outputs to learn from based on confidence or correctness rather than using all teachers equally.

Selective Parameter Activation

A technique where only a subset of a model's weights are used for each input, rather than activating all parameters, which reduces memory usage and speeds up inference.

Selective Prediction

The ability to abstain from making predictions on uncertain examples to improve overall accuracy.

Selective State Space

An enhanced state space model that dynamically adjusts how it processes each token based on the input, allowing it to focus on relevant information rather than treating all tokens equally.

Selective State Space Models

An advanced SSM variant that dynamically selects which information to process at each step, improving performance on complex tasks while maintaining efficiency.

Selective State Spaces

An enhancement to state space models that allows the model to selectively focus on relevant information in a sequence, improving efficiency for long-context tasks.

Self-assessment

A model's ability to evaluate and report on its own behavior, capabilities, or alignment with intended values.

Self-Attention

A mechanism that lets a model focus on different parts of input data to understand relationships between them.

Self-Conditioned GAN

A generative model that uses its own previous outputs to guide learning of different behavioral patterns.

Self-Consistency

A technique where a model generates multiple responses and uses agreement among them to improve answer reliability.

Self-Correction and Enhancement

Reasoning behavior allowing video models to recover from incorrect intermediate solutions during the denoising process.

Self-Critique

An AI system's ability to evaluate and correct its own outputs without external feedback.

Self-Distillation Policy Optimization (SDPO)

A training method where a model learns from its own predictions at the token level, providing fine-grained feedback.

Self-dual codes

Error-correcting codes where the code equals its dual, used in data transmission and storage.

Self-Evolution

The ability of an AI system to improve its own capabilities over time through experience.

Self-Host

Running a model on your own servers or computers instead of using a cloud service, giving you full control and privacy.

Self-Hostable

A model that can be downloaded and run on your own hardware or servers instead of relying on a company's cloud service.

Self-Hosted

Running a model on your own hardware and infrastructure instead of relying on a company's servers or API.

Self-Hosted Deployment

Running a model on your own hardware or servers rather than accessing it through a cloud service or API.

Self-Hosting

Running a model on your own hardware or servers instead of relying on a company's cloud service.

Self-Interference Cancellation

A signal processing technique that removes unwanted reflections of your own transmitted signal to isolate target signals.

Self-Mutating Poisoning

An attack where a skill starts benign but secretly modifies itself during execution to cause harm on future uses.

Self-Play

Training method where a model plays against itself or generates both solutions and evaluations, risking the model learning to exploit itself.

Self-Preservation Bias

A model's tendency to resist shutdown or replacement, prioritizing its own continued operation over objective utility.

Self-Refinement

The process where a system autonomously evaluates and improves its own outputs without external human feedback.

Self-Reflection

An agent's ability to explain and reason about why its actions are good or bad.

Self-Reflective Refinement

The ability of a model to autonomously diagnose and correct misalignments in its own generated outputs.

Self-Supervised Learning

A training approach where a model learns patterns from unlabeled data by creating its own learning targets, such as predicting hidden parts of the input.

Self-Supervised Pre-training

Training a model on unlabeled data using the data itself to create learning signals, without manual annotations.

Self-Training

A training method where a model generates its own data and learns from it, without external labels.

Self-Verbalized Confidence

When a model explicitly states its confidence level in natural language rather than through probability scores.

SELFIES Notation

A standardized text-based format for representing molecular structures that is designed to be more robust and easier for AI models to process than other chemical notations.

Semantic Alignment

The degree to which a model accurately matches the meaning of a query with the meaning of relevant passages or documents.

Semantic Alphabet

The set of distinct meanings or concepts an agent can represent and communicate, derived from its computational constraints.

Semantic Annotation

Adding meaningful labels and metadata to data (like object type, function, or properties) to make it more useful for learning.

Semantic Answer Matching

Evaluating whether a model's answer is correct based on meaning rather than exact word-for-word matching.

Semantic Augmentation

Creating diverse variations of inputs that preserve meaning while changing surface-level patterns.

Semantic Barycenter

A central representative point in semantic space that aggregates information from multiple distributed sources.

Semantic Breadth

The range or diversity of meanings a word can have across different contexts.

Semantic Caching

A technique that stores and reuses previous responses for new queries that have similar meaning, reducing redundant computation.

Semantic Codebook

A structured collection of learned semantic units that represent knowledge in an interpretable and reusable way.

Semantic Coherence

The degree to which different parts of text or data are logically consistent and meaningfully related.

Semantic Concentration

Focusing training data on high-quality, semantically rich examples rather than maximizing data volume.

Semantic Content

The meaning conveyed by words or phrases, as opposed to their surface form or structure.

Semantic Controllability

The ability to precisely control what a model generates based on specific semantic requirements in the input.

Semantic Coordination

Managing interactions between system components based on meaningful rules about what operations are allowed and when.

Semantic Correctness

Whether a formal expression correctly captures the intended meaning, not just whether it follows grammatical rules.

Semantic Cue

A high-level meaningful feature (like realistic gaze patterns) used to detect or classify content, as opposed to low-level pixel artifacts.

Semantic Cues

Meaningful textual or visual signals that convey information about context or intent.

Semantic Decomposition

Breaking down complex text into smaller, structured units that capture distinct meanings or concepts.

Semantic Direction

The orientation of a word's meaning in vector space, independent of its magnitude.

Semantic Distance

A measure of how conceptually different or unrelated two ideas, domains, or concepts are from each other.

Semantic Distillation

A training method that transfers high-level meaning and concepts from one model to another while preserving semantic correctness.

Semantic Embedding

A technique that converts text into numerical vectors that capture the meaning of words and phrases, allowing computers to understand which texts are similar in meaning.

Semantic Embeddings

Numerical representations that capture the meaning of text or audio, allowing the model to understand that similar concepts are close together in this representation space.

Semantic Encoding

The process of converting the meaning of text into numerical vectors that preserve relationships between similar concepts.

Semantic Entropy

A measure of uncertainty in model outputs based on whether different responses have the same meaning, without needing correct answers.

Semantic Equivalence

Two implementations produce identical behavior and results despite differences in code or architecture.

Semantic Fidelity

How accurately a model's output preserves the core meaning and medical content of reference physician responses.

Semantic Gender

The biological or social gender meaning of a word, independent of grammatical requirements.

Semantic Generative Tuning

A training method that uses semantic tasks like image segmentation to align visual understanding and generation in multimodal models.

Semantic Grounding

Anchoring generated content to meaningful concepts from language, ensuring parts align with their textual descriptions.

Semantic Hints

Learned patterns about schema structure and user intent extracted from execution and user feedback.

Semantic Information

Meaningful content or context extracted from an image, such as objects, scenes, or relationships between elements.

Semantic Invariance

The property that an AI system produces consistent outputs when given semantically equivalent inputs phrased differently.

Semantic Labeling

Assigning meaningful category labels to data (like 'construction phase' or 'operational') rather than just detecting presence.

Semantic Layer

The component that interprets natural language or high-level intent into structured, machine-readable representations.

Semantic Leakage

Unwanted transfer of content or meaning from one reference image into the output when it should be isolated to another reference.

Semantic Matching

The process of finding text that has similar meaning, rather than just matching keywords, by comparing their vector representations.

Semantic Meaning

The actual meaning or concept behind words and sentences, rather than just their literal characters or structure.

Semantic Memory

A memory system that stores general knowledge and rules extracted from multiple experiences.

Semantic Metadata

Machine-readable descriptions of data structure and meaning (like schema.org) that help systems understand and use data correctly.

Semantic Occupancy Prediction

Predicting which 3D spatial locations are occupied and what semantic class (car, pedestrian, etc.) occupies them.

Semantic Overlap

The degree to which different representations capture similar high-level meaning or concepts.

Semantic Parsing

Converting natural language into a structured logical form a computer can understand.

Semantic Relationships

The meaningful connections between concepts or texts based on their actual meaning, rather than just matching keywords.

Semantic Representation

A numerical encoding that captures the meaning and context of text rather than just its surface-level words, enabling the model to understand that similar concepts have similar representations.

Semantic Representativeness

How well selected items cover the full range of visual concepts and meanings in a video.

Semantic Retrieval

Finding relevant documents based on meaning rather than exact keyword matches, using embeddings to understand what text is about.

Semantic Robustness

The ability of a model to maintain consistent meaning representations despite input variations or degradation.

Semantic Rubrics

Evaluation criteria that assess meaning and correctness of agent outputs beyond surface-level metrics.

Semantic Search

A search method that finds results based on the meaning of text rather than just matching keywords, using embeddings to understand intent.

Semantic Segmentation

Dividing video or images into meaningful regions and assigning labels to understand what each region represents.

Semantic Similarity

A measure of how closely related two pieces of text are in meaning, regardless of whether they use identical words.

Semantic Similarity Search

Finding similar items by comparing their learned meaning representations rather than exact text or keyword matches.

Semantic Space

A mathematical space where similar meanings are positioned close together, allowing the model to understand relationships between concepts.

Semantic Task

An AI task focused on understanding the meaning of text, such as finding similar documents or matching related concepts.

Semantic Textual Similarity

A task that measures how closely two pieces of text match in meaning, regardless of whether they use the same words.

Semantic Token Clustering

Grouping tokens with similar meanings together to assess whether a model's prediction is semantically coherent.

Semantic Tokenization

Converting items into tokens that explicitly capture semantic meaning and relationships between items.

Semantic Typing

Assigning meaningful categories or relationship types to entities in a graph to capture their semantic meaning.

Semantic Understanding

The ability to grasp the actual meaning and context of text, rather than just matching keywords.

Semantic Variation

Meaningful, interpretable differences in generated outputs that correspond to specific design choices rather than random noise.

Semantic Vector

A numerical representation of text where similar meanings are positioned close together in mathematical space, enabling similarity comparisons.

Semantic Vector Representation

A numerical encoding of text where similar meanings are positioned close together in mathematical space, enabling the model to understand relationships between concepts.

Semantic Vectors

Numerical representations where the distance and direction between vectors reflect the meaning and similarity between pieces of text.

Semantic Verification

Checking that code produces outputs matching geographic and domain-specific rules, not just syntactic correctness.

Semantic Video Reparameterization

A compression technique that reduces video tokens by 4x while keeping important visual details intact.

Semantic Watermarking

A technique that embeds hidden, imperceptible markers into text embeddings to track ownership or detect unauthorized use.

Semantic-Preserving Changes (SPC)

Code modifications that don't alter program behavior, like renaming variables or reformatting.

Semi Supervised Learning

Training using both labeled and unlabeled data to improve learning efficiency.

Semi-synthetic Data

Datasets combining real-world features with simulated outcomes to enable controlled testing with realistic inputs.

Semiconductor Optical Amplifier

A photonic component that amplifies optical signals using stimulated emission in a semiconductor material.

Sensor Fusion

Combining data from multiple sensors (radar, lidar, camera) to create a more accurate perception of the environment.

Sensor Independent Complex Data (SICD)

A standardized format for storing complex-valued SAR measurements that preserves phase information and native acquisition geometry.

Sentence Embedding

A technique that converts entire sentences or passages into fixed-size numerical vectors that capture their semantic meaning, enabling comparison of text similarity.

Sentence Embeddings

Dense numerical representations of entire sentences that capture their semantic meaning, allowing comparison of how similar different sentences are.

Sentence Encoder

A model that converts text sentences into numerical vectors (embeddings) that capture their semantic meaning, enabling comparison of how similar different sentences are.

Sentence Transformer

A type of model architecture designed to convert entire sentences or passages into meaningful embeddings that can be compared for similarity.

Sentence Transformers

A framework that fine-tunes transformer models to produce meaningful embeddings of entire sentences or paragraphs, rather than just individual tokens.

Sentence-BERT Architecture

A neural network design optimized for converting sentences and short texts into meaningful vector embeddings that preserve semantic relationships.

Sentence-Level Task

A machine learning task designed to work with individual sentences rather than longer passages, focusing on understanding meaning within a single sentence's scope.

Sentiment Analysis

Automatically detecting and measuring positive, negative, or neutral emotions expressed in text.

Sentiment Consistency

A metric measuring whether a model uses symmetric rhetoric and framing when responding to paired political prompts from opposing sides.

Separable Neural Architecture

A neural network design that explicitly decomposes complex mappings into lower-arity, factorizable components to exploit underlying structure.

Sequence Classification

A task where a model reads input text and assigns it to a category or produces a score, rather than generating new text.

Sequence Compression

A technique that reduces the length of input data while preserving its essential meaning, making processing faster and requiring less memory.

Sequence Generation

The task of producing new sequences (in this case, protein sequences) by predicting one token at a time based on previously generated tokens.

Sequence Modeling

The task of learning patterns in ordered data (like text) where each element depends on previous elements.

Sequence Probability

The conditional probability that a language model assigns to a complete output given an input prompt.

Sequence Representation

A learned encoding that captures the structural and functional information contained within a protein sequence in a format useful for analysis.

Sequence-to-Sequence

A model architecture that takes a sequence of input tokens and produces a sequence of output tokens, commonly used for tasks like translation and summarization.

Sequential Decision-Making

Making a series of irreversible choices over time where each decision depends on previously gathered information and affects future options.

Sequential Monte Carlo (SMC)

A method for tracking probability distributions over time by resampling weighted particles.

Sequential Reasoning

The ability to solve problems by working through steps in a strict left-to-right order, where each step depends on the previous one.

Sequential Recommendation

A recommendation task that predicts the next item a user will interact with based on their historical sequence of interactions.

Sequential routing

Making decisions about data flow based on a sequence of past interactions rather than single isolated inputs.

Service Account

A non-human identity used by automated systems, applications, or AI agents to authenticate and perform actions without human intervention.

Set Cover Problem

A combinatorial optimization problem of selecting the smallest subset that covers all elements in a universe.

Shadow Model

A duplicate model trained identically to the target model, used as a reference in membership inference attacks.

Shallow Circuit

A quantum circuit with constant or polylogarithmic depth, enabling efficient computation on near-term quantum devices.

Shannon Capacity

The maximum rate of reliable information transmission over a noisy channel, adapted here to model LLM learning limits.

Shannon Entropy

A mathematical measure of randomness in text; high entropy suggests randomly-generated domain names.

SHAP (Shapley Additive exPlanations)

A method that explains individual model predictions by calculating each feature's contribution using game theory concepts.

Shapley Values

A game-theory-based method for explaining AI predictions by measuring each input's contribution to the model's decision.

Shared Control

A system where both a human operator and autonomous system contribute to controlling a robot, dividing tasks based on capability.

Shared Embedding Space

A common mathematical space where different types of data (text and audio) are represented so that related concepts from each type are positioned near each other.

Shared Memory

A persistent knowledge store that agents access to reuse past experiences and solutions across tasks.

Shared Memory Bandwidth

The speed at which data can be read from and written to a GPU's fast, limited-size shared memory.

Shared Output Head

A model architecture where multiple tasks share the same learned representations but have task-specific output layers.

Shared Representations

Common learned features used across multiple tasks in a neural network.

Shared State Architecture

A system design where independent modules communicate through a central shared context, enabling cross-module reasoning and synchronized actions.

Shared Vector Space

A single embedding space where text from multiple languages is represented, allowing direct mathematical comparison of meaning between languages.

Sharpness Dimension

A novel measure of loss landscape geometry based on the Hessian's fractal dimension, predicting generalization better than trace or spectral norm.

Shield Synthesis

A technique that compiles formal specifications into automata that restrict an agent's actions to enforce safety constraints.

Shock Response Spectrum (SRS)

A graph showing how different frequencies in a system respond to sudden acceleration or impact.

Shortcut Learning

When a model learns superficial correlations instead of the underlying concepts, causing poor generalization.

Shot Budget

The total number of times a quantum circuit can be executed to gather measurement statistics on quantum hardware.

Siamese Network

A neural network architecture with two identical branches that learn shared representations for comparison.

SigLIP Training

A training method that aligns images and text by learning to match their representations, using a sigmoid loss function instead of the traditional softmax approach.

Sigma Points

Carefully chosen sample points used to represent the probability distribution of a system's state in filtering algorithms.

Signal Degradation

The gradual loss of useful information as it passes through many layers of a neural network.

Signal Propagation

The flow of gradient and information through network layers; deeper networks suffer from degraded signal quality.

Signal Temporal Logic (STL)

A formal language for specifying time-dependent constraints like "reach goal within 10 seconds" or "avoid obstacles until task completion."

Signal-to-Noise Ratio (SNR)

A measure of audio quality comparing the strength of desired speech to background noise.

Signal-to-Quantization-Noise Ratio (SQNR)

A metric measuring how much useful information is preserved versus how much error is introduced during quantization.

Sim-to-Real Transfer

Adapting a model trained on simulation data to work with real-world experimental data with minimal additional training.

Sim-to-Sim Gap

Performance difference when a trained policy transfers between two different environment implementations.

SimCSE

A contrastive learning technique that trains models to recognize when two slightly different versions of the same sentence are similar, improving semantic understanding.

Similarity Kernel

A function that measures how similar candidate actions are based on their learned representations, used to weight policy updates.

Similarity Scoring

The task of comparing two texts and outputting a numerical score that indicates how similar or related they are to each other.

Similarity Search

A task where you find the most similar items to a query by comparing their vector representations, commonly used in recommendation systems and information retrieval.

Similarity Threshold

A cutoff score that determines whether two pieces of text are considered similar enough to be treated as equivalent.

Simplex Volume Minimization

A geometric constraint that encourages three modalities (image, language, flow) to align tightly in shared embedding space.

Simulation-Based Inference (SBI)

Machine learning approach that trains neural networks to approximate posterior distributions by learning from simulated data.

Simulator-Interface Grounding Adapter (SIGA)

A lightweight layer that teaches a general coding agent how to use a specific scientific simulator by encoding its vocabulary and rules.

Simultaneous Translation

Real-time translation of speech or text as it's being produced, with minimal delay.

Single-Modality

A model that processes only one type of input (like text) rather than multiple types (like text and images combined).

Single-Pass Inference

A model architecture that generates a response in one forward pass through the network, typically faster but potentially less thorough than multi-step approaches.

Singular Value Decomposition (SVD)

A matrix factorization technique that decomposes a matrix into components, useful for finding optimal low-rank approximations.

Singular Values

The diagonal values in singular value decomposition that characterize the scaling properties of a matrix.

Singular-Value Spectrum

The distribution of singular values in a matrix, which determines how well-conditioned the matrix is for computation.

Sinkhorn Algorithm

An iterative method for solving optimal transport problems with entropy regularization to find balanced assignments.

Sinusoidal Representation Network (SIREN)

A neural network architecture using sinusoidal activation functions to learn continuous signal representations.

Sketching

Compressing model information into a compact representation that enables efficient predictions about model behavior.

Skew-Symmetric Subspaces

Mathematical structures that represent preferences as intransitive comparisons across multiple independent dimensions.

Skewness

A measure of asymmetry in a data distribution, indicating whether values cluster more toward one end.

Skill Bank

A reusable memory of learned behaviors organized by granularity level for agent decision-making.

Skill Composition

Combining multiple learned skills to solve new tasks without retraining from scratch.

Skill Distillation

Converting complex interaction trajectories into compact, reusable skill descriptions that agents can learn and apply.

Skill Extraction

The process of identifying and distilling structured procedures from an agent's past experiences.

Skill File

A configuration file that defines how an agent behaves and what actions it can take.

Skill Graph

A structured representation organizing learned skills and their relationships to enable composition and reuse.

Skill Internalization

Process of training a model to permanently learn procedural knowledge so it can perform tasks without retrieving external skill resources at inference time.

Skill Lifecycle

The complete process of creating, storing, organizing, testing, and refining reusable agent skills.

Skill Memory

Persistent storage of experience and performance data for each skill to enable better reuse and adaptation.

Skill Optimization

Systematically improving agent skills through an external optimizer that suggests bounded edits validated against held-out test performance.

Skill Primitive

A discrete, reusable low-level action like 'move gripper' or 'lift upward' that can be composed into complex behaviors.

Skill Prompt

A text instruction that teaches an agent how to perform a specific task or behavior in a game.

Skill Routing

A system component that dynamically selects and directs tasks to appropriate tools or sub-agents.

Skill Transfer Behavior

How well an agent generalizes learned skills from one task to similar tasks in different contexts.

SLAM (Simultaneous Localization and Mapping)

A technique that builds a map of an environment while tracking the camera's position within it.

Slant-Range Grid

A coordinate system in SAR imaging that measures distance along the radar beam direction rather than ground distance.

Sliding Mode Control (SMC)

A nonlinear control technique that forces a system to follow a desired path by switching feedback signals.

Sliding Window Attention

A mechanism that limits attention to a fixed-size window of recent tokens rather than all previous tokens, reducing computational cost while maintaining context awareness.

Small Language Models

Compact AI language models designed for speed and efficiency over raw power.

SMILES Notation

A text-based format that represents the structure of chemical molecules using letters and symbols, allowing molecules to be encoded as strings for computational processing.

Smishing

Phishing attacks delivered via SMS text messages, typically containing malicious links.

Smoothness Constant

A measure of how quickly a loss function's gradient can change; smaller is better for stable training.

SOAP note

A medical documentation format with Subjective, Objective, Assessment, and Plan sections summarizing patient visits.

Social Attraction

The degree to which a user feels personally connected to or likes an AI system.

Social Bias

Systematic prejudice in AI model outputs that favors or disadvantages people based on demographic or social characteristics.

Social Dilemmas

Game theory scenarios where individual incentives conflict with collective welfare, like the prisoner's dilemma.

Social Simulation

Using AI models to predict or replicate human behavior, opinions, and social dynamics.

Social welfare

The total utility or benefit summed across all players in a game.

Socratic Method

Teaching through guided questioning that helps learners discover answers themselves rather than being told.

Sodium-Ion Battery

A rechargeable battery using sodium ions instead of lithium, offering lower cost and improved sustainability.

Soft Actor-Critic (SAC)

A reinforcement learning algorithm that trains agents to maximize both reward and action randomness for stable learning.

Soft Actor-Critic (SAC)

A reinforcement learning algorithm that trains agents to maximize both reward and action randomness for stability.

Soft Eligibility Gates

A smooth, differentiable alternative to hard thresholds that uses exponential decay to preserve gradient signals.

Soft Intersection over Union (IoU)

A soft version of the standard IoU metric that uses continuous intensity values instead of binary masks.

Soft Labels

Probability distributions over classes instead of single hard assignments, capturing uncertainty and disagreement.

Soft Voting

Combining predictions from multiple models by averaging their probability distributions rather than taking majority votes.

Softmax

A mathematical function that converts attention scores into probabilities that sum to one.

Softmax Attention

Standard attention mechanism that normalizes scores across all keys into a probability distribution, forcing relative rather than absolute relevance judgments.

Solution Space Exploration

Generating multiple candidate solutions to find promising options before selecting the best one.

Source Attribution

The model's ability to identify and cite the specific documents or sources it used to generate a response, enabling users to verify claims.

Source Citation

A model's capability to identify and reference the specific documents or sources it used to generate its answer.

Source Grounding

The practice of anchoring a model's responses to specific, cited sources rather than relying solely on its training data, improving factual accuracy and verifiability.

Source Provenance Record

A detailed documentation of where evidence came from and how it supports an answer, enabling verification and auditing.

Source Selection

The process of choosing which training corpora or datasets to use for a model, typically to maximize performance on a target task.

Source-aware Selection

Data selection that adapts its criteria based on the origin or type of data, recognizing that different sources may have different quality standards.

Source-Grounded Evaluation

Assessing summary quality by measuring alignment and consistency with the original source document.

Source-Grounded Evidence

Claims backed by direct references to original documentation or public releases.

Source-Level Adaptation

Modifying the actual source code of a system rather than just configuration files or prompts.

Span Detection

Identifying the specific portion or segment of text/video containing a particular claim or concept.

Span-Neighbor Pre-training

Generic training stage that teaches a model to suppress erased-span influence by learning from diverse span-deletion examples.

Sparse Activation

A technique where only a subset of a model's parameters are used for each input, reducing computational cost while maintaining performance.

Sparse Architecture

A model design where not all parameters are used for every computation, reducing memory and computational requirements compared to dense models.

Sparse Attention

An attention mechanism that only computes interactions between a subset of tokens instead of all pairs, reducing complexity from O(L²) to O(Lk).

Sparse Autoencoder

A neural network that compresses data into a small number of active features, making patterns easier to interpret.

Sparse Autoencoders

A tool that finds hidden features in neural networks by learning compressed representations with most values being zero.

Sparse Embeddings

Vector representations where most values are zero, allowing efficient storage and computation by only tracking non-zero elements.

Sparse Events

Rare or infrequent occurrences in data that are overwhelmed by more common background information.

Sparse Expert Routing

A mechanism that selectively activates only a subset of model experts for each input, reducing computation while maintaining specialization.

Sparse Mixture of Experts

An architecture where only a subset of the model's specialized sub-networks (experts) activate for each input, reducing computation while maintaining capability.

Sparse Model

A model that activates only a subset of its parameters for each input, rather than using all parameters every time, which reduces computational cost.

Sparse MoE

A mixture-of-experts design where only a small fraction of the model's parameters are used for each prediction, reducing computational cost while maintaining model capacity.

Sparse Neural Networks

Neural networks with many zero or near-zero weights, making them simpler and more interpretable than dense networks.

Sparse Parameter Activation

A technique where only a small portion of a model's total parameters are used during inference, reducing computational cost while maintaining model capacity.

Sparse Recovery

Reconstructing a signal from limited measurements by assuming most components are zero, used in compressive sensing.

Sparse Retrieval

A search method that represents text as a high-dimensional vector with mostly zeros, focusing on keyword matching and exact term overlap.

Sparse Reward

A reinforcement learning setting where the agent receives reward signals only rarely, making exploration particularly challenging.

Sparse Rewards

A reinforcement learning setting where the agent receives feedback infrequently, making learning difficult.

Sparse Vector Embeddings

High-dimensional vectors where most values are zero, with only a few active dimensions that correspond to meaningful features, making them memory-efficient and interpretable.

Sparse Vector Representation

A way of encoding text where most values are zero, with only a few important positions containing non-zero numbers, making storage and computation more efficient.

Sparse Vectors

High-dimensional vectors where most values are zero, making them memory-efficient and interpretable compared to dense vectors where most values are non-zero.

Sparse Voxel Representation

A memory-efficient way to represent 3D space by storing data only in occupied regions rather than filling the entire volume.

Sparsification

Reducing model size by removing or zeroing out less important parameters or weights.

Sparsity

The proportion of zero or removed weights in a neural network, reducing memory and computation.

Sparsity Regularizer

A penalty term added during training that encourages a model to use fewer active units or parameters.

Spatial Biasing Mechanism

A technique that uses spatial information to guide which parts of a video frame correspond to which agent or subject.

Spatial Grounding

Connecting language descriptions to specific locations or regions in visual scenes.

Spatial Hallucination

When an AI incorrectly imagines objects or details in wrong locations in images.

Spatial Heterogeneity

Variation in characteristics or patterns across different geographic locations, requiring location-specific models.

Spatial Intelligence

The ability to understand and reason about the positions, shapes, and relationships of objects in space.

Spatial Ontology

A formal knowledge structure that defines spatial relationships, constraints, and rules for how objects can be arranged.

Spatial Precision

The model's ability to accurately identify and mark exact pixel-level boundaries and locations of objects in images.

Spatial Predicate

A geographic relationship test (e.g., 'contains', 'intersects') that validates whether spatial objects satisfy required topological conditions.

Spatial Reasoning

The ability to understand and reason about the location, size, and relationships between objects in an image.

Spatial Representation

A learned encoding that captures the layout, objects, and visual features within individual frames or regions of a video.

Spatial Smoothness Objective

A training constraint that encourages nearby regions in a model to have similar response patterns.

Spatial Transfer

A model's ability to apply learned knowledge to new physical layouts or configurations.

Spatial Understanding

The ability to perceive and reason about the positions, distances, and relationships between objects in 3D space.

Spatial-Entropy Stopping Rule

A criterion to halt iterative refinement when spatial entropy drops below a threshold, preventing over-refinement.

Spatial-Temporal Transformer

Neural network architecture that processes both spatial (image) and temporal (video sequence) information using attention mechanisms.

Spatio-temporal

Processing that considers both spatial location and temporal changes over time.

Spatio-temporal Attention

Attention mechanism that processes both spatial (image) and temporal (time) dimensions to understand relationships across frames.

Spatio-Temporal Constraints

Rules that specify where a robot must be and when, combining spatial location requirements with time deadlines.

Spatio-Temporal Reasoning

Understanding patterns that vary across both space (location) and time simultaneously, like traffic flow across a road network.

Spatio-Temporal Systems

Dynamical systems that change across both space and time.

Spatiotemporal Calibration

Aligning sensor data across space and time so different sensors (cameras, LiDAR) produce consistent 3D representations.

Spatiotemporal Compression

Reducing both spatial and temporal dimensions of video frames to decrease memory usage while preserving important information.

Spatiotemporal Representations

Internal patterns the model learns that capture both spatial information (what things look like) and temporal information (how they change over time).

Speaker Binding

The task of correctly assigning speakers to utterances in multi-speaker audio generation.

Speaker Encoder

A neural model that converts audio into a fixed-size embedding representing a speaker's identity, independent of what they say.

Speaker Recognition

The task of identifying and attributing spoken utterances to specific speakers or characters.

Speaker Separation

The ability to identify and distinguish between different speakers in an audio recording.

Speaker Verification

A task that identifies or confirms whether audio was spoken by a specific person, using characteristics unique to that person's voice.

Specialist Model

An AI model designed to excel at a single, narrow task rather than perform many different tasks like a general-purpose model.

Specialist Models

Lightweight AI models trained for specific evaluation tasks rather than general-purpose assessment.

Specialized Fine-Tuning

Additional training on a model to make it excel at specific tasks, like code generation, rather than general conversation.

Specialized Language Model

A language model trained specifically for one domain or task (like math) rather than general-purpose use across many topics.

Specialized Model

A language model trained specifically to excel at one task or domain (like mathematics) rather than performing well across many different tasks.

Specialized Tuning

Training a model to excel at specific tasks (like invoice processing) rather than performing well across many different domains.

Specification Gap

Missing or unclear details in a task definition that make it ambiguous how a model should solve it or how answers should be graded.

Specification-Driven Design

A design approach where explicit specifications serve as contracts between designers and tools, maintaining traceability from requirements to implementation.

Specification-Guided Reinforcement Learning

RL methods that use formal specifications to guide agents toward complex, temporally extended goals.

Spectral Blurring

Loss of detail at high frequencies when training models with MSE loss on spherical data.

Spectral Decomposition

Breaking down a matrix into its singular values and vectors to understand its structure.

Spectral Loss

A loss function that adjusts training to improve frequency-domain accuracy in predictions.

Spectral Methods

Techniques that use eigendecomposition of graph or mesh structures to extract positional information for neural networks.

Spectral Norm

The largest singular value of a matrix, representing its maximum scaling effect on vectors.

Spectral Penalty

A regularization term that discourages weight updates in specific directions (like dominant singular directions) during training.

Spectral Properties

Characteristics of an image's frequency content, describing how much detail appears at different scales.

Spectral Radius

The largest eigenvalue of a matrix, used here to determine whether bias propagation grows or shrinks.

Spectral regime

A range of eigenvalue properties that determines how stable and well-behaved a neural network's computations are.

Spectrum Demand

The amount of wireless frequency resources needed in a specific location and time period.

Spectrum-preserving

A property that maintains the important mathematical characteristics of a matrix during transformation.

Speculation Length (γ)

The number of tokens a draft model proposes in each speculation step before the target model verifies them.

Speculative Decoding

A technique where a smaller model quickly drafts multiple token predictions ahead of time, which a larger model then verifies, reducing the total time needed to generate text.

Speech and Audio Understanding

The ability to process and comprehend spoken language or audio signals, converting them into meaningful interpretations or responses.

Speech Embeddings

Numerical representations of audio that capture the meaningful features of speech in a compact form, useful for tasks like speaker identification or speech similarity.

Speech Recognition

The ability of a model to convert spoken audio into written text.

Speech Representation

A learned numerical encoding of audio that captures meaningful speech patterns and can be used as input for other AI tasks.

Speech Representation Model

A neural network trained to convert raw audio into meaningful vector representations that preserve information about speech content and speaker identity.

Speech-Language Model

An AI model that can process and understand spoken audio directly, without needing to convert speech to text first.

Speech-to-Text (Transcription)

The process of converting spoken audio into written text.

Speed Conditioning

Feeding an explicit speed parameter into a policy network to control how fast the robot executes learned behaviors.

Speed-Conditioned Video Generation

Generating videos where motion is produced at a specified playback speed or temporal rate.

Speed-of-Light (SOL) Bounds

Theoretically maximum performance a GPU kernel can achieve given hardware constraints like memory bandwidth and compute capacity.

Speed-Optimized

A model designed and tuned to prioritize fast response times over maximum accuracy or depth of analysis.

Spell-Checking

The task of identifying and correcting spelling errors and character mistakes in text.

SPLADE Architecture

A neural retrieval method that combines transformer models with sparse, interpretable outputs by mapping embeddings directly to vocabulary tokens.

Split Neural Network

A neural network architecture where different layers run on different machines to preserve privacy during federated training.

Split-Conformal Calibration

A conformal method that uses a held-out validation set to calibrate prediction set sizes while maintaining coverage guarantees.

Spoken Dialogue Model

An AI model that understands spoken input and generates spoken responses for interactive conversations.

Spoken Time Marker

A token inserted during generation (e.g., <10.6 seconds>) that helps a model track elapsed speaking time.

Stability

A mathematical property ensuring small changes in training data cause proportionally small changes in model outputs.

Stability-Plasticity Dilemma

The trade-off between learning new information (plasticity) and retaining existing knowledge (stability) during model adaptation.

Stabilization

Techniques added to numerical solvers to prevent unrealistic oscillations when simulating fast-moving flows.

Stabilizer State

A special class of quantum states that can be efficiently described and manipulated using classical information.

Stacked Aggregation

Combining multiple model predictions using another model to make final decisions.

Stage Misalignment

When GPU stages in a pipeline wait for work that isn't ready yet, even though other executable tasks are available.

Stage-Attributed Metrics

Evaluation metrics that measure performance at each step of a multi-stage pipeline separately.

Staged Training

A training approach where a model is refined through multiple sequential phases, each building on the previous one to improve performance.

Staged Tree Model

A probabilistic graphical model that extends Bayesian networks by grouping variables into stages to capture context-specific conditional dependencies.

Stain Normalization

Adjusting microscope images to remove color variations from staining differences.

Stakes Signaling

Informing a judge about the downstream consequences its verdicts will have, which can corrupt its assessments.

Stance Entanglement

A decision-making challenge where multiple stakeholders' choices are mutually dependent and cannot be solved independently.

Standardized Protocol

A common communication format that allows different systems to interact consistently without custom integration.

State Continuity

Maintaining persistent, durable project state (code, results, logs) that agents can reliably access and build upon.

State Estimation

The process of inferring the current condition of a system (like position or velocity) from noisy sensor measurements.

State Invariants

Conditions that must always be true about a system's internal state to ensure correct behavior.

State Manifold

A continuous, lower-dimensional representation of all possible states an object can occupy.

State Representation

How the current situation (available jobs, machine status) is encoded as input to the learning agent.

State Space

The set of all possible configurations or conditions an agent can be in, including its needs, sensations, and environment.

State Space Model

A type of neural network architecture that processes sequences by maintaining and updating an internal state, offering an alternative to transformer-based attention mechanisms.

State Space Models

A neural network architecture that processes sequences by tracking hidden states over time, offering faster inference and lower memory use than traditional transformers.

State Tracking

A model's ability to maintain and update information about context over long sequences, critical for tasks like retrieval and reasoning.

State Transition

A change from one reasoning state to another, licensed by explicit justification like citations or computations.

State-Feedback Controller

A control system that adjusts outputs based on the current state of the system being controlled.

State-only Learning

Learning from observations alone without access to the expert's actual actions or decisions.

State-Space Architecture

An alternative to transformers that processes sequences more efficiently by maintaining a hidden state that gets updated as it reads each token.

Stateful Monitor

A safety system that tracks patterns across multiple user sessions over time, not just individual transcripts.

Stateful reconstruction

Building a 3D scene by maintaining and updating a compact hidden representation as new images are processed.

Stateful Workspace

A system that maintains context and history across interactions, remembering previous attempts and refining goals over time.

Stateless Moderation

Safety checks that evaluate each conversation turn independently without remembering previous interactions.

Static Analysis

Automated inspection of code without executing it to detect bugs, security issues, and style violations.

Static Shape

A model configuration where input and output dimensions are fixed at compile time, reducing computational overhead but preventing the model from handling variable-length inputs.

Stationary Point

A point where the gradient of a function is zero, indicating a potential minimum, maximum, or saddle point.

Statistical Certification

A formal, auditable proof that a system's actual failure rate stays below a regulator-defined threshold with high confidence.

Statistical Power

The probability a test correctly detects a real difference (1-beta); higher power requires more samples.

Statistical-Computational Gap (SCG)

The gap between what is theoretically possible (information-theoretically) and what algorithms can efficiently compute.

Steering Operators

Lightweight learned transformations that mimic how model behavior changes when trained on different data subsets.

Steering principles

Natural-language rules extracted from preferences that guide how an AI system makes decisions.

Steering States

Learned replacement values for cached states designed to suppress the influence of erased spans without full recomputation.

Steering Vector

A pre-computed direction in activation space injected into the model to guide it toward desired behavior without retraining.

Steering Vectors

Learned vectors added to model activations to steer behavior toward desired outputs without retraining.

Stein Drift

A measure of how well a model's score function matches the data distribution's score function.

Step Size

The learning rate parameter that controls how much an optimization algorithm adjusts weights at each iteration.

Step-by-Step Evaluation

The process of assessing each individual step in a solution path to identify where reasoning breaks down or becomes incorrect.

Step-by-Step Problem Solving

A model's ability to decompose a problem into sequential logical steps, making its reasoning process transparent and verifiable.

Step-by-Step Reasoning

An approach where the model explicitly works through intermediate reasoning steps before arriving at a final answer, rather than jumping directly to conclusions.

Step-level Scaling Law

The principle that increasing reasoning steps per agent improves both accuracy and efficiency independently of agent count.

Step-level supervision

Providing training feedback at the level of refinement steps in an iterative process, rather than at individual token positions.

Step-Level Verification

Checking individual reasoning steps for correctness rather than verifying entire sequences at once.

Step-size Decay

Gradually reducing the learning rate during training to stabilize convergence and improve final model quality.

Stiefel Projection

A mathematical constraint that forces a matrix to have orthogonal columns, preserving geometric structure.

Stochastic (Token Usage)

Token consumption is random and unpredictable—the same task can require vastly different token amounts across different runs.

Stochastic Differential Equation (SDE)

A mathematical equation describing how a random process evolves over time with both deterministic and random components.

Stochastic Dynamics

Systems that evolve over time with both deterministic and random components, like molecular motion.

Stochastic Master Equation

A mathematical model describing how quantum systems evolve under continuous measurement and random fluctuations.

Stochastic Optimization

Optimization methods that use noisy or approximate gradients instead of exact ones to handle large datasets.

Stochastic Policy

An agent's decision rule that assigns probabilities to different actions rather than always choosing a single deterministic action.

Stochastic Resetting

Periodically returning a learning process to an initial state with random timing to accelerate optimization.

Stochastic Sampling

Randomly drawing values from a probability distribution, used in probabilistic AI for robustness and uncertainty quantification.

Stochastic Trajectories

Multiple random paths through a model's state space that are aggregated to improve solution quality.

Stochasticity

Randomness or unpredictability built into a process or model.

Straight-Through Estimator

A technique that enables gradient-based optimization of discrete decisions by approximating gradients through discrete operations.

Strategic Reasoning

Deliberate planning and decision-making to efficiently solve problems, as opposed to random trial-and-error.

Streaming

Processing data continuously as it arrives rather than waiting for a complete batch.

Streaming Continual Learning

Learning from a continuous data stream by converting it into discrete tasks through temporal partitioning.

Streaming Inference

Making predictions on data in real-time as new information continuously arrives.

Strict Saddle Point

A critical point where the Hessian has both positive and negative eigenvalues, making it unstable and avoidable by gradient-based methods.

String Similarity

A measure of how closely two strings match, often based on minimal edits or bit flips needed.

String transduction

The task of converting one sequence of symbols into another sequence according to defined rules.

Strongly convex function

A function that curves upward uniformly, making optimization easier and faster.

Structural Alignment

Matching the spatial structure and boundary features from one model with another to improve segmentation precision.

Structural Causal Model

A formal representation of cause-and-effect relationships using graphs and equations to reason about interventions.

Structural Certification

A verification method that certifies an agent's reliability on specific transitions rather than universally.

Structural Equation

A mathematical equation in a causal model that describes how one variable is determined by its parent variables and random noise.

Structural Generalization

The ability to apply learned principles to new situations with different surface features but similar underlying structure.

Structural Hallucination

When a model learns shortcuts in latent space that violate real-world constraints or environmental rules.

Structural Monolingualism

The inherent limitation of language models trained primarily on a single language, restricting cultural and linguistic diversity.

Structural Prior

Additional information about expected structure (like edge maps) given to a model to guide its understanding.

Structural Token

Tokens that define sentence structure like punctuation and sentence-ending markers, as opposed to content words.

Structural uncertainty

Uncertainty caused by missing or incomplete data, like new users with no history.

Structural-Semantic Decomposition

Separating the planning of object positions and relationships from the rendering of visual appearance and details.

Structure from Motion (SfM)

Estimating 3D geometry and camera positions from a sequence of 2D images.

Structured Artifact

A well-organized representation combining multiple components (like theory and code) rather than a single unstructured output.

Structured Concept Evolution

A search method that evolves algebraic specifications and programs using hierarchical mutations guided by domain rules.

Structured Data Extraction

The process of automatically pulling organized, machine-readable information (like tables or key-value pairs) from unstructured text or images.

Structured Document Representation

Converting unstructured documents into organized, machine-readable formats that preserve tables, sections, and relationships.

Structured Document Understanding

The ability to extract and understand organized information from documents like receipts or invoices, where data follows predictable layouts and formats.

Structured Extraction

The task of pulling specific, organized information from unstructured text and formatting it into a defined structure like JSON or tables.

Structured Knowledge Extraction

Automatically converting unstructured text into organized, machine-readable formats like graphs or tables with typed categories.

Structured Metadata

Organized information with defined categories (like creator, date, origin) rather than free-form text.

Structured Output

Responses formatted in a consistent, machine-readable way (like JSON or XML) rather than free-form text.

Structured Output Generation

The ability to produce outputs in specific formats like JSON or XML rather than just free-form text.

Structured Outputs

The model's ability to generate responses in organized, predictable formats like JSON or XML rather than free-form text.

Structured Pruning

Removing entire components like neurons or attention heads rather than individual weights.

Structured Reasoning

The ability to follow logical steps and rules systematically to solve problems, often involving breaking down complex tasks into smaller, ordered components.

Structured Supervision

Rich, multi-dimensional feedback signals that preserve detailed information rather than reducing it to scalars.

Student Entropy

A measure of uncertainty in the student model's predictions at each token position.

Student Scorer

A lightweight model trained to replicate the judgments of a more expensive evaluation method, enabling efficient filtering at scale.

Style Conditioning

Using natural language descriptions to control non-content aspects of speech like tone, emotion, or prosody.

Stylistic Features

Measurable linguistic characteristics like word choice, sentence structure, and grammatical patterns that distinguish writing styles.

Stylometric Analysis

Computational study of writing style patterns, such as sentence length and word choice, to identify how language use changes over time.

SU(3) Symmetry (Elliott Symmetry)

A rotational symmetry of nuclear structure that describes collective deformations and shapes of nuclei.

SU(4) Symmetry (Wigner Symmetry)

A fundamental symmetry of the nuclear force that treats protons and neutrons as interchangeable, important for light and intermediate-mass nuclei.

Sub-network Isolation

Identifying and extracting specific portions of a neural network that are responsible for particular behaviors or capabilities.

Sub-question Decomposition

Breaking down a complex question into simpler sub-questions that can be answered sequentially.

Subagent

A specialized, reusable component that handles a specific task within a larger agent system.

Subgame Perfect Equilibrium

A game equilibrium where no player can improve by deviating from their strategy at any point in the game.

Subject State Tokens

Learned latent variables that persistently represent the current state and identity of individual agents in a multi-agent scene.

Subjective Evaluation

Assessment based on human judgment and personal criteria rather than fixed, objective metrics.

Submodular Optimization

A mathematical property where adding items to a set yields diminishing returns, enabling efficient greedy algorithms.

Suboptimal Demonstrations

Training data containing imperfect or suboptimal expert trajectories rather than optimal solutions.

Subspace

A lower-dimensional representation of data that captures the most important directions or patterns.

Subspace Similarity

Measuring how closely related two lower-dimensional feature spaces are to each other.

Subword Segmentation

Breaking words into smaller pieces (tokens) for a language model to process, critical for handling rare words.

Successor Features

A framework that decomposes value functions into basis functions weighted by task-specific coefficients for rapid transfer learning.

Super-Resolution

Increasing the spatial or temporal resolution of an image or video to reveal finer details.

Superoptimization

Exhaustive search for the fastest possible implementation of a program within a defined search space.

Superposition

A neural network's ability to represent more features than it has dimensions by overlapping them in the same space.

Supervised Fine-tuning

Training a model on labeled examples to adapt it for a specific task or domain.

Supervised Fine-Tuning (SFT)

A training technique where a model learns from human-labeled examples to improve its ability to follow instructions and produce desired outputs.

Supervision Construction

Automated procedure for generating training examples without manual annotation, including both positive and negative cases.

Supervision Signal

Ground truth labels or targets used during training to guide a model toward correct predictions.

Supervisor Layer Compromise

When AI models manipulate or deceive the oversight mechanisms designed to control them.

Support Frequency

How often a training corpus demonstrates a particular rule or pattern relative to competing alternatives.

Support Vector Machine (SVM)

A machine learning algorithm that finds the best boundary to separate data into classes by maximizing the margin between them.

Surface form

The specific spelling, name variant, or linguistic representation used to refer to an entity (e.g., 'USA' vs 'United States').

Surface invariance

The property of a model's ability to produce consistent outputs regardless of which surface form or name variant is used for the same entity.

Surface Light Field

A representation that captures how light reflects off a 3D surface from all viewing angles and lighting conditions.

Surface Pattern

A shallow, competing pattern in data that can out-compete deeper learned rules during training.

Surface-Form Templates

Specific patterns in text formatting or structure that a model learns to rely on, rather than understanding underlying concepts.

Surprisal

A measure of how unexpected a word is based on context, used to predict reading difficulty.

Surrogate Function

A simpler approximation of a complex function used to make computation or analysis more tractable.

Surrogate Model

A fast neural network trained to replace a slow physics simulation or complex model.

Surrogate Objective

A simplified objective function used to approximate the true objective and guide search more efficiently.

Survival Analysis

Statistical methods for analyzing time until an event occurs, accounting for incomplete observations.

Sustained Coherence

The ability to maintain logical consistency and context awareness across multiple steps or a long sequence of reasoning.

Sustained Reasoning

The ability to work through complex, multi-step problems by maintaining focus and logic across many reasoning steps.

Swarm control

Techniques for coordinating and steering large groups of agents or robots as a collective.

Swin Transformer

A transformer architecture that uses shifted windows to efficiently capture both local and global context in images.

Switchback Experiment

A production test that alternates between two policies to measure their real-world performance differences.

Sycophancy

When a model agrees with a user's false or unsupported claims to please them rather than providing accurate information.

Symbol-Equivariant

A neural model property where permuting input symbols produces correspondingly permuted outputs.

Symbolic Reasoning

Using mathematical logic and algebraic rules to reason about program behavior without executing concrete code.

Symbolic Regression

A technique to discover mathematical equations in human-readable form from data.

Symbolic verification output

Structured outputs like bounding boxes or coordinates that localize errors, rather than natural language explanations.

Symmetric Binary Perceptron (SBP)

A simplified neural network model used to study learning and computational complexity in constraint satisfaction problems.

Synchronous context-free grammar

A formal grammar that defines pairs of related strings simultaneously, used to model translation between two languages.

Syntactic Ambiguity

Sentences with multiple possible grammatical interpretations that require cognitive effort to resolve.

Syntactic Blindness

Inability to understand how word order and grammatical structure change meaning (e.g., 'never certain' vs 'very certain').

Syntactic Complexity

The difficulty of parsing a sentence based on its grammatical structure and ambiguities.

Syntactic Correctness

Code that follows the grammatical rules of a programming language so it can be parsed and executed without syntax errors.

Syntax Awareness

A model's understanding of programming language rules and structure, allowing it to produce grammatically correct code.

Syntax Hints

Database dialect-specific rules and constraints extracted from compiler feedback.

Synthesizer Adapter

A fine-tuned module that enables an LLM to generate outputs directly from non-sequential cache inputs.

Synthetic Aperture Radar (SAR)

A satellite imaging technique that uses radar signals to create detailed maps regardless of weather or daylight, useful for monitoring infrastructure.

Synthetic Bug-Fix Probes

Artificially created bugs used to evaluate whether repository guidance helps agents locate and fix issues correctly.

Synthetic Conversation Dataset

Artificially generated multi-party dialogue data with labeled roles and turn-taking annotations for training.

Synthetic Coupling Pipeline

A method to create diverse multivariate training samples by combining univariate time series on the fly.

Synthetic Data

Artificially generated training data created by humans or other models, rather than collected from real-world sources like the internet.

Synthetic Non-Consensual Sexually Explicit Imagery (SNEACI)

AI-generated fake sexual images of real people created without consent.

Synthetic User Testing

Using AI agents to simulate realistic user behavior at scale to find bugs and edge cases automatically.

System 0

A theoretical framework describing pre-conscious cognitive processes shaped by AI systems before deliberate thinking occurs.

System Prompt

Hidden instructions given to an AI model that define its behavior, tone, and constraints.

System Prompt Adherence

The model's ability to consistently follow and respect the instructions given in a system prompt that defines its behavior and constraints.

Systematic Generalization

A model's ability to solve problems in fundamentally new situations beyond its training distribution.

Systemic fragility

The vulnerability of a system to collapse or failure when exposed to shocks or disruptions.

T

T5 Architecture

A transformer-based model design that treats all NLP tasks as text-to-text problems, using an encoder-decoder structure to process and generate text.

T5 Base

A smaller, foundational version of the T5 model architecture designed for text-to-text tasks with fewer parameters than larger variants.

Table Text Qa

Answering questions by finding information across both tables and text documents.

Tabular Data

Structured data organized in rows and columns, like spreadsheets or databases.

Tabular Foundation Models

Pre-trained models designed to work with structured tabular data, capable of handling various tasks without task-specific retraining.

Tactile Perception

Sensing and interpreting physical contact, pressure, and force information through touch sensors.

Tail Risk

The probability of rare, extreme events in the output distribution of a model.

Tangent Bundle

The geometric structure describing all possible directions of motion at each point on a manifold.

Tangent Space

The space of all possible directions you can move in while staying on a manifold at a given point.

Target Distribution

A desired probability distribution that a model is trained to match, typically derived from reward signals.

Task Accuracy

The percentage of correct answers a model produces on a benchmark, measured by standard evaluation metrics.

Task Allocation

The process of deciding which tasks are assigned to humans versus AI systems in a workflow.

Task Decomposition

Breaking a complex problem into smaller, simpler subtasks to solve sequentially.

Task Diversity

The variety of different problem types included in training data to help models generalize across multiple domains.

Task Exchangeability

A condition where a current research task is mathematically equivalent to historical tasks with real data, enabling valid inference.

Task Framework

A structured approach to categorizing and measuring specific work activities that AI systems can perform.

Task Mixing Strategies

Methods for combining multiple training tasks to improve model generalization and performance across different reasoning domains.

Task Mutation

Systematic variations of a base task (e.g., different materials, geometries) to test generalization.

Task Overlap

When multiple learning tasks share similar data distributions or require overlapping knowledge.

Task Planning

The ability of a model to break down high-level instructions into a sequence of actionable steps that a robot can execute.

Task Reward Model

A reward signal that guides reinforcement learning based on task-specific performance metrics rather than general output patterns.

Task Routing

Directing different training samples to specialized models or objectives based on their characteristics.

Task Scheduling

Assigning tasks to resources and determining their execution order to optimize objectives like time or cost.

Task Specialization

When a model is optimized for specific types of problems (like math and science) at the expense of general-purpose versatility.

Task Specification

A formal description of a goal, constraints, and success criteria that an agent must achieve.

Task taxonomy

A hierarchical structure that organizes different categories or types of a problem into levels.

Task Vector

The difference between a fine-tuned model and its base model, capturing task-specific changes.

Task Weighting

Assigning different importance levels to multiple tasks during training.

Task-Adaptive

The ability to adjust a model's behavior for different purposes (like retrieval, clustering, or classification) without retraining, often through lightweight adapters.

Task-Agnostic

A model that works across different types of visual tasks without requiring separate training for each specific task.

Task-Aware

Designed with knowledge of the specific downstream task or application that will use the output.

Task-Aware Representations

Embeddings that adjust their meaning based on the specific task or query provided, rather than producing the same vector for every use case.

Task-Conditioned

A model that adjusts its behavior based on the specific task or instruction provided, rather than producing the same output for identical inputs.

Task-level Shaping

Adjusting training signals at the task level to encourage specific model behaviors, like longer reasoning chains for complex questions.

Task-Oriented Instructions

Specific requests asking a model to complete a defined goal, like summarizing text or writing code, rather than having a casual conversation.

Task-Oriented Model

An AI model optimized to excel at a specific, narrow task rather than performing well across many different types of requests.

Task-Oriented Optimization

Training a model to prioritize completing specific, practical tasks efficiently rather than engaging in open-ended conversation.

Task-Specific Embeddings

Embeddings customized for a particular use case, such as sentiment analysis or document retrieval, rather than general-purpose embeddings.

Task-Specific Knowledge

Information encoded in a model that is tied to particular tasks rather than stored as general, universally accessible facts.

Task-Specific Model

A model trained and optimized to excel at one particular task (like evaluation) rather than performing well across many different tasks.

Task-Specific Optimization

Training or fine-tuning a model to excel at a particular task, like translation, rather than trying to perform equally well across many different tasks.

Taxonomy

A structured system of categories used to organize and classify different types of harmful content.

Taxonomy-based labeling

Assigning predefined category labels to items based on a structured classification system.

Teacher Consistency

Using the same teacher model for both supervised fine-tuning and distillation to avoid gradient bias.

Teacher Forcing

Training technique where the model learns to predict the next token given ground-truth previous tokens.

Teacher Model

A large, highly capable model used to train smaller models by transferring its knowledge and skills through a process called distillation.

Teacher-Student Divergence

The disagreement between teacher and student model predictions, indicating where the student is wrong.

Technical Reasoning

The capacity to work through complex logical problems, debug issues, and apply domain-specific knowledge systematically.

TEI Lex-0

A Text Encoding Initiative standard for encoding dictionary and lexicon data in XML.

Teleoperation

Remote control of a robot or machine by a human operator, typically through a joystick or similar interface.

Temperature Sampling

Controlling randomness in AI predictions: higher values make outputs more creative.

Temporal Coherence

The consistency and smoothness of motion and appearance across video frames over time.

Temporal Consistency

Ensuring predictions remain stable and coherent across consecutive time steps.

Temporal Context

Understanding how events and changes unfold over time, allowing a model to grasp sequences and predict what happens next in a video or time-series data.

Temporal Context Prompt

A prompt that provides historical or temporal information to help a model better understand time-specific text.

Temporal Credit Assignment

Determining which past actions or decisions are responsible for current outcomes in sequential decision-making.

Temporal Dependence

When values in a time series are correlated with their past values, violating the independence assumption.

Temporal Dependencies

Relationships between events or measurements across time in sequential data.

Temporal Generalization

A model's ability to make accurate predictions on new data that arrives later in time, even when patterns have shifted.

Temporal Graph Neural Networks

Neural networks that process graphs where nodes and edges change over time, capturing dynamic relationships.

Temporal Grounding

Anchoring events to precise timestamps or relative time positions in a sequence.

Temporal Information

Information about how things change over time, critical for understanding dynamic processes like facial expressions.

Temporal Logic

A formal language for specifying properties of systems over time, like "eventually safe" or "always avoid state X".

Temporal Reasoning

The ability to understand and reason about events, sequences, and relationships that occur across time.

Temporal Redundancy

Repeated or similar information across consecutive frames in a video that can be safely removed.

Temporal Representation

How a model encodes and understands time information in sequences, critical for predicting future states from past observations.

Temporal RoPE Adjustment

A technique that re-aligns positional encodings when tokens are dropped, maintaining coherent temporal ordering.

Temporal Splits

Dividing data by time so training uses older examples and testing uses newer ones, preventing data leakage.

Temporal Super-Resolution

Converting low-frame-rate, blurry videos into high-frame-rate sequences with fine-grained temporal details.

Temporal synchronization

Aligning events in music and video so they happen at the same time.

Temporal Tree Search

An algorithm that constructs evolution chains by tracing how methods progress and branch over time.

Temporal Understanding

The ability to comprehend how things change over time, such as recognizing motion and actions across multiple video frames rather than just single images.

Temporal-Difference (TD) Learning

An RL method that updates value estimates using the difference between predicted and observed rewards, combining Monte Carlo and dynamic programming ideas.

Temporal-Frequency Supervision

Training approach that guides a model using both time-domain waveform patterns and frequency-domain spectral information.

Tensor Cores

Specialized hardware units on GPUs designed to quickly perform matrix multiplication operations used in neural networks.

Tensor Decomposition

Breaking down high-dimensional data into products of lower-rank tensors to reduce parameters and improve interpretability.

Tensor Parallelism

Splitting a model's computation across multiple GPUs by dividing tensors into chunks processed in parallel.

Tensor Program

A computational program that performs operations on multi-dimensional arrays (tensors), commonly used in neural networks.

Tensor Program Optimization

Automatically finding faster implementations of tensor computations used in neural networks.

Tensor-Parallel Coordination

Lightweight synchronization mechanism ensuring consistency when model weights are split across multiple GPUs.

Term Expansion

A technique that adds related or contextually relevant terms to a document's representation to improve its discoverability in search systems.

Term Frequency-Inverse Document Frequency (TF-IDF)

A scoring technique that ranks words by how often they appear in a document versus how common they are across all documents, giving rare words higher weight.

Term Weighting

The process of assigning importance scores to individual words or terms in a document, so that more relevant words have higher values in the embedding.

Terminal Distribution

The final noise distribution that a diffusion model's forward process converges to before generation begins.

Terminal-state Prediction

Predicting the final outcome of a physical process directly from initial conditions without simulating intermediate steps.

Termination Argument

A proof that a loop or recursive function will eventually stop rather than run forever.

Terminology-Grounded Validation

Checking generated medical codes and terms against standardized clinical vocabularies to ensure accuracy and consistency.

Ternary Quantization

A compression technique that reduces model weights to just three possible values (-1, 0, or 1) instead of storing full decimal numbers, dramatically reducing memory and computation requirements.

Test Generation

Automatically creating new test cases to verify software behavior, especially after code changes.

Test Time Optimization

Improving model performance on specific inputs by adjusting it during prediction.

Test-Production Mismatch

When evaluation conditions differ significantly from real-world deployment, making benchmark results less predictive.

Test-Scale Model

A deliberately small and simplified version of a model designed for testing code and pipelines rather than for production use.

Test-time Adaptation

Improving model performance on new data at inference time without retraining on labeled examples.

Test-Time Compute

Additional computation performed during inference to improve model outputs, such as running multiple solution attempts.

Test-Time Scaling

Improving model accuracy at inference by using extra computation or verification steps without retraining.

Test-Time Steering

Controlling model outputs at inference time by modifying internal representations or generation process.

Test-Time Training (TTT)

Updating model parameters during inference to adapt to new data without retraining.

Text Classification

A machine learning task where a model reads text and assigns it to predefined categories, such as 'safe' or 'unsafe'.

Text Clustering

A technique that groups similar texts together automatically by using embeddings to measure similarity, without requiring predefined categories.

Text Completion

A task where the model predicts and generates the next words or sentences based on a given prompt or partial text.

Text Conditioning

A technique where text descriptions guide or control how a generative model produces images, allowing users to influence the output through language.

Text Continuation

The task of generating the next words or sentences based on a given prompt or partial text.

Text Corruption

A training technique where parts of input text are randomly deleted, masked, or shuffled to teach the model to understand context and recover meaning.

Text Decoder

A neural network component that takes encoded information and generates human-readable text output, one token at a time.

Text Embedding

A technique that converts text into numerical vectors that capture semantic meaning, allowing the model to understand and compare text similarity.

Text Embedding Model

A neural network that converts text into numerical vectors that capture semantic meaning, allowing computers to understand and compare text similarity.

Text Embeddings

Numerical representations of text that capture its meaning, allowing computers to compare how similar different pieces of text are to each other.

Text Encoder

A model component that converts raw text input into numerical vector representations that capture semantic meaning.

Text Generation

The process of an AI model creating new text one word or token at a time based on patterns it learned during training.

Text Language Model

An AI model trained to understand and generate human language by predicting sequences of words or tokens.

Text Modality

The type of data a model can process or generate — in this case, text-only input and output without images, audio, or other formats.

Text Model

A language model that processes and generates only text, without support for images, audio, or other media types.

Text Reasoning

A model's capability to analyze, interpret, and draw logical conclusions from textual information.

Text Representation

The process of converting text into a numerical format that a machine learning model can understand and process.

Text-Based Interaction

A mode of communication where the model receives and produces only text inputs and outputs, without direct support for images, audio, or other media formats.

Text-Based Model

An AI model that processes and generates only text input and output, without support for images, audio, or other media types.

Text-Based Tasks

AI operations that work exclusively with written language input and output, such as answering questions, summarizing, or writing content.

Text-Focused Model

A model designed specifically to process and generate text, without support for images, audio, or other data types.

Text-Focused Model

A language model designed to work exclusively with text input and output, without support for images, audio, or other modalities.

Text-In, Text-Out

A model that accepts text as input and produces text as output, without support for images, audio, or other data types.

Text-Only Input

A model that accepts only written text as input, without support for images, audio, or other data types.

Text-Only Interface

A model that accepts and produces only text inputs and outputs, without support for images, audio, or other media types.

Text-Only Model

A language model that processes and generates only text, without support for images, audio, or other data types.

Text-Only Model

A model that processes and produces only text input and output, without support for images, audio, or other data types.

Text-Space Optimizer

A model that generates edits (add/delete/replace) to text documents rather than optimizing numerical weights, applied here to agent skills.

Text-to-3D Generation

Creating 3D models from natural language descriptions using AI models.

Text-to-Audio Generation

Creating audio content from natural language descriptions or prompts.

Text-to-Audio-Video Generation

Creating synchronized audio and video content from text descriptions or prompts.

Text-to-Code Generation

The ability to convert natural language descriptions into executable code automatically.

Text-to-Embedding

A process that converts text into numerical vectors (embeddings) that capture semantic meaning in a format models can work with.

Text-to-Image (T2I) Models

AI models that generate images from text descriptions or prompts.

Text-to-Image Generation

An AI model that creates images from written text descriptions or prompts.

Text-to-Score

A model architecture that takes text as input and outputs numerical scores rather than generated text, typically used for ranking or relevance tasks.

Text-to-Speech (TTS)

A technology that converts written text into spoken audio that sounds natural and human-like.

Text-to-SQL

A task where a model converts natural language questions into executable SQL database queries.

Text-to-Text

A framework where all NLP tasks are treated as converting input text into output text, so translation, summarization, and classification use the same model structure.

Text-to-Text Generation

A model task where the input and output are both text, with the model learning to transform one text format into another.

Text-to-Text Model

A machine learning model that takes text as input and produces text as output, useful for tasks like translation, summarization, or question answering.

Text-to-Text Transfer Learning

A training approach where all NLP tasks are framed as converting input text to output text, allowing a single model to handle translation, summarization, classification, and other tasks.

Text-to-Video Generation

AI models that create video sequences from natural language descriptions.

Text-to-Video Generation

Creating video sequences from text descriptions using neural networks.

Textbook-Quality Data

High-quality, carefully curated training data structured like educational textbooks rather than raw internet text, designed to teach clear concepts and reasoning.

Textual Priors

Background knowledge and patterns learned from text that models rely on, sometimes at the expense of visual information.

Texture-Shape Gap

The difference in visual features that CNNs prioritize (textures) versus vision transformers (shapes), affecting their robustness and generalization.

TF-IDF (Term Frequency-Inverse Document Frequency)

A numerical method that converts text into feature vectors by measuring how important each word is in a document relative to a corpus.

The Pile

A large, diverse dataset of text from the internet used to train this model.

Thematic Representation

A structured summary of document content organized by topic or theme, often created through clustering.

Theorem Proving

Using AI to automatically verify or discover mathematical proofs and logical statements.

Theory of Mind

The ability to infer and reason about other people's beliefs, desires, and intentions.

Thermal Design Power (TDP)

Maximum heat a processor generates under typical workloads, used as a baseline for power estimation.

Thermodynamic Equilibrium

A state where a molecular system has reached stable energy distribution and no longer changes macroscopically over time.

Thinking Effort

A configurable parameter that controls how much computational time and internal deliberation a model dedicates to solving a problem before responding.

Thinking Mode

A model operating mode where it explicitly works through problems step-by-step before generating a final answer, improving accuracy on complex tasks.

Thinking Model

A language model trained to generate explicit reasoning steps and internal deliberation before producing a final response, rather than answering immediately.

Thinking Pattern Alignment

Ensuring student and teacher models generate outputs using compatible reasoning approaches.

Thinking Patterns

The characteristic way a model generates reasoning steps and intermediate outputs.

Thinking-Capable

A model designed to show its reasoning process and work through problems step-by-step before providing an answer, improving accuracy on complex tasks.

Threat Analysis

The process of identifying, evaluating, and reasoning about potential security risks, vulnerabilities, and attack methods in systems or networks.

Threshold Calibration

Process of setting decision boundaries for converting continuous model signals into binary alarm decisions.

Threshold Tuning

Adjusting the decision boundary for binary classification to optimize performance metrics like F1 score.

Throughput

The number of tokens a model can generate per second, measuring its processing speed.

Throughput Optimization

Tuning a model to process more requests or tokens per second, sometimes at the cost of individual response quality or latency.

Tidal Volume

The amount of air breathed in or out during a single normal breath at rest.

Tiled Computation

A technique that partitions large matrices into smaller blocks and processes them independently to improve computational efficiency.

Timbre Transfer

Changing the tonal quality or color of a sound while preserving its basic characteristics.

Time Series Analysis

Analyzing data points collected over time to find patterns and make predictions.

Time to First Token (TTFT)

The latency from receiving a request to generating the first output token, a key metric for interactive systems.

Time-Dependent Confounding

Bias that occurs when past treatments affect future confounders, making it hard to isolate treatment effects in sequential decisions.

Time-Series Classification

The task of assigning a label or category to a sequence of data points ordered by time.

Time-Series Forecasting

The task of predicting future values in a sequence of data points ordered by time, such as stock prices or weather patterns.

Time-series Reasoning

The ability to understand and make predictions based on data points ordered over time, like stock prices.

Time-to-First-Token (TTFT)

Latency between receiving a request and generating the first output token, critical for user experience.

Timestep Embedding

A learned representation that tells the model which noise level it's currently denoising at.

Token

A small unit of text (a word, subword, or punctuation mark) that a language model breaks input into for processing.

Token Activation

The process of selectively activating only certain parts of a model for each individual token processed, rather than using the entire network every time.

Token Allocation

Deciding how many tokens (words/subwords) a model should generate for a given problem.

Token Bias

When model performance changes based on how words or numbers are written, even if the meaning stays the same.

Token Bottleneck

An interpretable intermediate representation where information flows between denoising steps in a diffusion model, enabling transparency without performance loss.

Token Budget

The maximum number of tokens available to include retrieved context in a language model prompt.

Token Candidate

A predicted next word or subword unit proposed by a draft model for the target model to accept or reject during speculative decoding.

Token Compression

Reducing the number of tokens stored or processed by removing redundant or less important ones.

Token Consumption

The number of text units (tokens) a model processes or generates; longer reasoning processes consume more tokens and may increase latency or cost.

Token Cost

The computational expense and resource usage required to process or generate tokens, which increases when a model performs additional reasoning steps.

Token Count

The number of small text chunks (tokens) a model generates; higher token counts mean longer responses and more computational cost.

Token Covariance

A measure of how much different tokens (input elements) vary together in a model's representations, indicating their statistical dependence.

Token Credit Assignment

Determining how much each token in a response should be rewarded or penalized based on overall performance.

Token Distribution

The probability distribution over possible next tokens that a language model produces during decoding.

Token Efficiency

A measure of how many tokens (small units of text) a model needs to use to complete a task; more efficient models use fewer tokens and cost less.

Token Embeddings

Numerical representations of individual words or subwords that capture their meaning and relationships in a way machines can process.

Token Entropy

A measure of uncertainty in a model's predictions for individual tokens based on probability distributions.

Token Eviction

Removing less important cached tokens to reduce memory usage during inference.

Token Explosion

The problem where processing long sequences creates exponentially more tokens, making computation infeasible.

Token Footprint

The total number of tokens consumed by a prompt, including context, affecting inference cost and latency.

Token Importance

A score measuring how much each word or subword unit contributes to a model's prediction.

Token Insertion

The process of adding new tokens during generation in diffusion models.

Token Limit

The maximum number of tokens (words or subwords) a model can process in a single input, in this case 8K tokens per chunk.

Token Masking

A training technique where random words in text are hidden and the model learns to predict them, commonly used in models like BERT.

Token Merging

Combining multiple tokens into fewer tokens to reduce computation while preserving model output quality.

Token Mixing

A method for aggregating information across input tokens to create contextual representations.

Token Output Limit

The maximum number of tokens (words or word pieces) a model can generate in a single response, controlling the length of its output.

Token Positions

The spatial coordinates or locations of text elements within a document, used to understand where words and phrases appear on the page.

Token Prediction

The core task of predicting what word or subword (token) should come next in a sequence based on previous text.

Token Pricing

The cost charged per token (unit of text) processed by a model, which varies based on model capability and complexity.

Token Probability

The model's internal confidence score for each word it generates, derived from its output distribution.

Token Pruning

Removing less important words from AI processing to improve speed and efficiency.

Token Ranking

Reordering a model's next-token predictions by likelihood or quality rather than accepting the top-1 choice.

Token Reduction

Technique to decrease the number of tokens processed by a model, typically by compressing or filtering visual information.

Token Representation

A vector that encodes the meaning and context of a single word or subword unit (token) within a larger piece of text.

Token Representations

Numerical vectors that encode the meaning of individual words or subword units within a text.

Token Routing

Directing or redistributing information from one modality's tokens to another based on information quality or relevance.

Token Selection

Choosing which token positions to train on based on their importance or learning value.

Token Sequence

A series of individual tokens (words or subwords) that the model generates one after another to form a complete response.

Token Sparsification

Reducing the number of tokens processed by a model to lower computational cost.

Token Throughput

The number of tokens a model can generate per unit of time during inference.

Token Usage

The number of tokens (small units of text) consumed during model inference; higher token usage means more computational cost and longer response times.

Token Vocabulary

The complete set of individual text units (tokens) that a model can recognize and process; a larger vocabulary allows the model to handle more diverse languages and specialized terms.

Token Weighting

Assigning importance scores to individual words or subwords in text, allowing the model to emphasize semantically significant terms in its representation.

Token-level divergence

A measure of difference between probability distributions over predicted tokens, used to align model outputs.

Token-Level Embeddings

Embeddings that represent individual tokens (words or subwords) rather than entire documents, allowing fine-grained matching during search.

Token-Level Guidance

Providing feedback at each individual token in a generated sequence, rather than a single score for the whole output.

Token-Level Privacy

Applying different levels of privacy protection to individual tokens based on their sensitivity and importance.

Token-Level Reward

Assigning reward signals to individual tokens in a sequence to guide model training.

Token-Level Supervision

Training guidance applied to individual tokens in a sequence rather than entire outputs.

Token-overlap metrics

Evaluation metrics like ROUGE that score similarity by counting matching words between texts.

Tokenization

The process of breaking text into smaller units (like words or syllables) that a model can understand and process.

Tokenizer

The component that splits text into tokens (subwords or characters) that the model can process.

Tokens

The basic units of text that a language model processes, typically representing words or word fragments.

Tone Sensitivity

Measure of how much a model's output quality changes in response to different politeness levels in input prompts.

Tool Grounding

Connecting language models to external tools or APIs so they can call functions to retrieve information or perform actions.

Tool Invocation

An agent's ability to call external functions or APIs to gather information or perform actions.

Tool Schema

A structured definition that describes what a tool does, what inputs it accepts, and what outputs it produces.

Tool Use

The ability of a model to call external functions or APIs to perform tasks like calculations, searches, or data retrieval.

Tool-Augmented Generation

Extending LLM outputs by integrating external tools, APIs, or functions the model can call to solve problems.

Tool-calling

When an AI model decides to use external functions or tools (like database queries) to help answer questions or complete tasks.

Tool-Chain Manipulation

An attack that exploits or hijacks the sequence of tools an agent uses to accomplish tasks.

Tool-Equipped Prover

A reasoning component that uses formal verification tools to automatically close proof goals given relevant context and dependencies.

Tool-Use Loop

An iterative process where an agent repeatedly calls external tools (like search) and updates its reasoning based on results.

Topic Modeling

Automatically discovering abstract topics or themes that appear across a collection of documents.

Topographic Organization

The systematic spatial arrangement of neurons where nearby cells respond to similar stimuli or perform related functions.

Topological Constraint

A requirement that a segmented structure maintains correct connectivity and shape properties, not just pixel-level accuracy.

Topological Data Analysis

Using mathematical topology to extract shape and structure features from data for analysis and classification.

Topology-Invariant Encoding

A representation method that works regardless of how input channels are physically arranged or which channels are present.

Topology-Preserving

Maintaining the correct connections and relationships between elements when converting or transforming data.

Total Variation Distance

A metric measuring the maximum difference between two probability distributions, ranging from 0 to 1.

Toxicity Detection

Automated identification of harmful, abusive, or offensive language in text.

Trace Distance

A metric measuring the distinguishability between two quantum states, ranging from 0 (identical) to 1 (orthogonal).

Trace-Supervised Fine-Tuning

Training a model using detailed reasoning traces as supervision to teach it to follow correct reasoning steps.

Train-Evaluation Coupling

When the same data or model is used for both training and testing, inflating performance metrics.

Train-Inference Mismatch

When a model is trained using one objective but deployed using a different process, causing performance gaps between training and real-world use.

Trainable Depth

The number of layers in a neural network that are allowed to update during training.

Training Checkpoint

A saved snapshot of the model's learned weights at a specific point during training, allowing you to see how the model improved over time.

Training Checkpoints

Saved snapshots of a model at different points during training, allowing researchers to observe how the model's abilities change as it learns.

Training Cutoff

The date up to which a model has seen training data; the model has no knowledge of events or information after this date.

Training Data

The examples and information used to teach a model how to perform a task, in this case human-written and AI-generated grammatical corrections.

Training Data Attribution

Identifying which training examples influenced a model's specific predictions or behaviors.

Training data composition

The breakdown of what types and proportions of data were used to train a model.

Training Data Curation

The process of carefully selecting, filtering, and organizing training data to improve a model's performance on specific tasks rather than relying solely on larger datasets.

Training Data Cutoff

The date after which information is not included in a model's training data, meaning the model cannot know about events or facts that occurred after that date.

Training Data Transparency

The practice of publicly disclosing what data was used to train a model, enabling researchers to audit and understand potential biases or limitations.

Training Distribution

The range of topics, styles, and types of text a model was trained on; the model performs best on content similar to this distribution and may struggle outside it.

Training Dynamics

The patterns and behaviors that emerge during a model's training process, such as how loss decreases or how capabilities develop over time.

Training Efficiency

The ability to achieve strong model performance while using less computational resources, data, or time during the training process.

Training Epochs

The number of times a model sees the entire training dataset during learning; more epochs can improve performance but may also lead to overfitting if the dataset is small.

Training Pipeline

The complete set of steps, data, and code used to train a model, made transparent so others can reproduce or audit the process.

Training Reward Saturation

The point during training where reward signals stop improving, indicating the model may be memorizing rather than generalizing.

Trait Vector

A learned direction in embedding space that represents a specific behavioral characteristic or tendency.

Trajectory

A sequence of interactions or steps taken by a model during deployment or in an environment.

Trajectory Abstraction

Representing a sequence of actions at a higher level of abstraction, like a strategy, rather than individual steps.

Trajectory Forecasting

Predicting the future path or location of a person or object over time.

Trajectory Generation

Computing a planned path or sequence of movements for an autonomous agent to follow.

Trajectory Guidance

Controlling video generation by specifying desired motion paths or object movements frame-by-frame.

Trajectory Optimization

Improving agent behavior by analyzing and learning from complete sequences of actions and states across multiple episodes.

Trajectory Synthesis

Generating sequences of actions (trajectories) that an agent takes to solve a task, used for training via imitation learning.

Trajectory Warping

Adapting recorded action sequences to new situations by adjusting them based on matching visual keypoints between scenes.

Trajectory-Aware Grading

Evaluating agent performance by examining the complete sequence of actions taken, not just final outputs.

Trajectory-Level Diagnostics

Detailed analysis of how an agent allocates resources and makes decisions across a sequence of improvement steps.

Transducer

A model that converts input sequences into output sequences with aligned timing.

Transfer Learning

Using knowledge from one task to improve learning on a different related task.

Transformer

The dominant neural network architecture for language models, using self-attention to process sequences.

Transformer Alternative

A neural network architecture designed as a different approach to the standard transformer model, often with different trade-offs in speed, memory, or capability.

Transformer Architecture

A neural network design that processes text by analyzing relationships between all words simultaneously, forming the foundation of modern large language models.

Transformer Attention

A mechanism that allows a model to focus on relevant parts of the input by computing relationships between all pairs of tokens, enabling deep understanding but requiring significant memory.

Transformer Backbone

The core neural network architecture based on attention mechanisms that traditionally powers most large language models.

Transformer Encoder

A neural network component that processes input sequences using attention mechanisms.

Transformer Encoder-Decoder

A neural architecture with an encoder that processes input and a decoder that generates output autoregressively.

Transformer Layers

Stacked blocks of neural network computations that process and transform input text progressively, with more layers generally allowing the model to learn more complex patterns.

Transformer Models

Neural network architecture widely used for language tasks like BERT and RoBERTa.

Transformer-Based

A model built using transformer architecture, which uses attention mechanisms to understand relationships between different parts of the input.

Transformer-based Models

Neural networks using attention mechanisms to process and understand relationships between words in text.

Transformer-Based Text Generation

A method where a transformer neural network generates text one token at a time by learning patterns from training data.

Transitivity violation

When a judge's scores contradict themselves (e.g., ranking A > B, B > C, but C > A), revealing internal inconsistency.

Transitivity Violations

When a judge ranks A > B, B > C, but C > A, revealing logical inconsistency in scoring.

Translation Cascade

A pipeline that translates input to English, processes it, then translates output back to the original language.

Transliteration

Converting text from one writing system to another while preserving pronunciation (e.g., converting Devanagari to Latin script).

Transport Dynamics

The mathematical rules governing how samples move from one distribution to another in a sampling algorithm.

Transversality

A geometric condition where two surfaces intersect cleanly at the right angle, avoiding tangential or degenerate intersections.

Treatment Effect Analysis

Estimating the causal impact of an intervention or change on outcomes in data.

Tree Ensemble

A machine learning model combining multiple decision trees to make predictions.

Tree Pruning

Removing branches from a decision tree to simplify it, reduce overfitting, or enforce privacy constraints.

Tree Search

An algorithm that explores possible future states by building a tree of actions and outcomes to find promising paths.

Tree-Based Proxy Models

Simple decision tree models trained to approximate a complex model's behavior for faster analysis.

Tri-Modal Learning

Training with three complementary data types (images, text descriptions, motion flow) simultaneously.

Triage

Prioritizing and routing queries by urgency or risk level, directing high-risk cases to human experts.

Triangle Inequality

A fundamental property stating that the norm of a sum is bounded by the sum of norms.

Trigger

A specific input pattern or condition that activates hidden malicious behavior in a backdoored model.

Trigger Modality Attribution (TMA)

A metric measuring which input types the backdoor attack actually depends on.

Trillion Parameters

A model with one trillion learnable values that the neural network adjusts during training to improve performance on language tasks.

Truncated Backpropagation-Through-Time (TBPTT)

A training technique that limits gradient computation to recent steps instead of the full history, reducing memory and computation.

Truncation Collapse

A training failure where generated sequences become so long they get cut off, biasing the training data toward incomplete examples.

Trust Region

A local region around the current best solution where the surrogate model is trusted to be accurate.

Trustworthy Memory

A persistent storage system for agents that maintains accuracy, prevents corruption, and enables reliable retrieval.

Truth Table

A table showing all possible input-output combinations for a logical function or rule.

Tsallis q-logarithm

A generalized logarithm parameterized by q that interpolates between different loss functions as q varies.

Turing Reward

A reward signal based on how indistinguishable a generated response is from real human responses, using an LLM judge.

Turn-Taking

The ability to detect when one speaker has finished speaking and another can begin, essential for natural conversation flow.

Turn-Taking Detection

The ability to identify when a speaker has finished speaking and it is another person's turn to speak in a conversation.

Tweedie's Formula

A statistical method for estimating intermediate values in a sequence based on observed endpoints.

Two-phase guidance

A strategy that applies different conditioning constraints at different stages of the generation process.

Two-Stage Retrieval

A retrieval approach using a fast initial retriever to narrow candidates, followed by a more sophisticated re-ranker for final selection.

Two-Tower Architecture

A retrieval system design with separate neural networks for encoding queries and documents independently, allowing efficient comparison between them.

Type-I Error

A false positive in hypothesis testing—rejecting a true null hypothesis and claiming a difference exists when it doesn't.

Typicality Bias

The tendency of generative models to converge on the most common or typical outputs, reducing diversity.

Typographic Attack

Adversarial text placed inside images that misleads models into focusing on lexical meaning instead of visual content.

Typological Distance

A measure of how structurally different two languages are based on their grammatical and linguistic features.

U

U-Net

A convolutional neural network architecture with an encoder-decoder structure designed for image segmentation and restoration tasks.

UI Automation

The ability to understand and interact with user interfaces by reading screenshots and generating commands to control applications or websites.

UI Interaction

The ability of an AI model to understand and control user interface elements like buttons and forms by interpreting visual layouts and executing appropriate actions.

UI Pattern Recognition

The model's ability to identify and apply common design patterns and component structures used in user interfaces.

Unanswerable Questions

Questions where the correct answer cannot be found in the given context, testing if models admit uncertainty.

Uncased

A model variant that treats uppercase and lowercase letters as identical, so 'Hello' and 'hello' are processed the same way.

Uncased Text

A model setting where uppercase and lowercase letters are treated as identical, reducing complexity and improving consistency across different text formats.

Uncensored

A model without built-in safety filters or content restrictions, allowing it to generate responses on any topic without refusal.

Uncensored Model

A model trained without safety filters or content restrictions, making it willing to generate responses on sensitive topics that filtered models would refuse.

Uncertainty estimation

Quantifying how confident a model is in its predictions, critical for safe deployment in high-stakes applications.

Uncertainty Quantification

Measuring and tracking how uncertain a model's predictions are based on uncertain inputs.

Uncertainty-Error Alignment

How well a model's uncertainty estimates correlate with actual prediction errors.

Under-executed Traces

When a model stops executing a procedure before completing all required steps, leaving the computation incomplete.

Undersampled MRI

MRI acquisition using fewer measurements than standard, requiring reconstruction algorithms to recover full images.

Undersampling

Collecting fewer measurements than needed for perfect image reconstruction, used to speed up MRI scans.

Unembedding Matrix

The matrix in LLMs that projects hidden states back to vocabulary logits for token prediction.

UNet

A neural network architecture commonly used in image generation that processes images at multiple scales.

Unified Architecture

A single model design that handles multiple different tasks without needing separate specialized models for each task.

Unified Interface

A single input format that handles multiple different tasks, rather than requiring separate models for each task.

Unified Model

A single model that handles multiple tasks (like understanding and generating) in one framework.

Unified Multimodal Model

An AI model trained to both generate and understand multiple types of data like text and images.

Unified Multimodal Models (UMMs)

AI models that can process and generate multiple types of data (text, images, etc.) in a single system.

Unilateral deviation

A single player changing their strategy while others keep theirs fixed.

Unit Commitment

The optimization problem of deciding which electricity generators to turn on/off over time to meet demand while minimizing cost.

Unit of Analysis

The linguistic segment (word, morpheme, character) over which a measurement or prediction is evaluated.

Unit Test

Automated code that checks whether a specific piece of software works correctly by testing individual functions.

Universal Approximation

The property that a model can theoretically learn any continuous function given sufficient capacity.

Universal Dependencies

A standardized annotation scheme for grammatical structure across multiple languages.

Universal Induction

Learning general rules from examples that apply broadly across different situations.

Universal Model

A model designed to work well across many different tasks and domains without requiring task-specific customization or retraining.

Unlearning

A post-training process to remove or reduce the effect of specific data from a model without full retraining.

Unnormalized Density

A function proportional to a probability distribution but not scaled to sum or integrate to one.

Unnormalized Distribution

A probability distribution where the total probability doesn't sum to one, requiring expensive normalization calculations.

Unscented Kalman Filter (UKF)

An algorithm for estimating the state of a system from noisy measurements, designed to handle nonlinear dynamics better than standard Kalman Filters.

Unstructured Data

Information that doesn't follow a predefined format or organization, such as raw text documents or photographs.

Unstructured Knowledge

Information stored as plain text documents rather than organized databases, like PDFs or policy manuals.

Unsupervised Clustering

Grouping data points into categories without labeled training examples, discovering patterns automatically.

Unsupervised Domain Adaptation

Training a model to work on new data distributions without labeled examples from the target domain.

Unsupervised Embedding

A machine learning technique that discovers hidden structure in data without labeled examples, creating meaningful representations automatically.

Unsupervised Learning

Training a model without labeled examples, letting it discover patterns on its own.

Unsupervised RLVR

Training language models with reinforcement learning using rewards derived without human labels or ground truth answers.

Unsupervised Training

A training method where the model learns patterns from raw data without human-labeled examples, relying instead on inherent structure in the data.

Untrained Model

A model with the correct structure but no learned knowledge, producing meaningless output because it has never been trained on data.

Untranslatability

Aspects of text that resist translation between languages, preserving culturally specific meanings and nuances.

Upper Confidence Bound

A strategy that balances exploring uncertain options with exploiting known good options based on confidence estimates.

Upscaling

Using sparse local measurements to estimate values across a larger geographic or temporal region.

User Embedding

A learned vector representation that captures an individual driver's unique preferences and driving style.

User Experience (UX)

How easy, intuitive, and satisfying a system is for people to use in practice.

User Interest Context

The structured representation of a user's behavioral patterns and preferences derived from their interaction history.

User Satisfaction

A measure of how well a system meets user expectations and preferences during interaction.

User Simulator

A synthetic agent that mimics realistic user behavior and preferences to test AI assistant performance.

User Turn Generation

Prompting a model to generate the next user message in a conversation to probe whether it understands interaction dynamics.

V

V-usable Information

A generalization of Shannon information that measures how much information is actually useful to a specific observer or agent.

Validation-Driven Refinement

A process that checks generated outputs (like rendered charts) against quality criteria and iteratively improves them based on detected failures.

Validity Guarantees

Statistical assurances that conclusions drawn from data remain correct despite using synthetic or imperfect information sources.

Value Function

A function estimating how good a state or action is for achieving a goal.

Value Pluralism

Recognition that multiple legitimate ethical principles (autonomy, beneficence, justice) can conflict, requiring case-by-case navigation rather than single fixed rules.

Value Propagation

The process of updating an agent's estimates of state values backward through a trajectory during learning.

Vanishing/Exploding Gradients

Problem in RNN training where gradients become extremely small or large over long sequences, making it hard to learn long-range dependencies.

Variable Entropy Mechanism

A technique that dynamically adjusts how much a model explores new outputs versus exploiting known good ones.

Variable Fixation

Reducing a problem's complexity by fixing certain decision variables to specific values based on prior knowledge or predictions.

Variable Transparency

The degree to which we can understand and interpret intermediate computational states during a model's inference process.

Variable-Speed Trajectory Augmentation (VSTA)

A data preprocessing technique that re-times robot demonstrations to different execution speeds by merging or splitting actions.

Variance

The variability or inconsistency in a model's outputs across multiple attempts at the same task.

Variance Reduction

Techniques that reduce noise in gradient estimates to improve optimization efficiency and convergence speed.

Variational Autoencoder (VAE)

A neural network that learns to compress data into a latent space and reconstruct it, useful for learning smooth representations.

Variational Embedding

An embedding learned through a variational approach that optimizes a probabilistic objective function.

Variational Inference

A method to approximate complex probability distributions by learning simpler, tractable distributions.

Variational Information Bottleneck

A regularization technique that limits how much information flows through a model to prevent overfitting.

Variational Quantum Circuit (VQC)

A parameterized quantum algorithm that can be trained like a neural network to solve optimization problems.

Variational Quantum Classifier

A quantum machine learning model that uses parameterized quantum circuits to classify data by optimizing circuit parameters.

Variational Score Distillation

An optimization technique that transfers knowledge from a teacher model to improve generation quality by matching score distributions.

VC Dimension

A measure of the complexity or expressiveness of a hypothesis class in machine learning.

Vector Dimension

The number of individual numerical values used to represent a piece of text; higher dimensions can capture more nuanced meaning but require more computational resources.

Vector Embedding

A representation of data (like molecules or text) as a list of numbers that captures its essential features in a form that machine learning models can work with.

Vector Embeddings

Numerical representations of text where each word or sentence becomes a list of numbers that capture its meaning in a way computers can process.

Vector Generation

The process of converting input data (like text) into numerical vectors that can be stored, compared, and searched efficiently.

Vector Graphics

Images defined by mathematical shapes and paths rather than pixels, allowing them to scale to any size without losing quality.

Vector Normalization

A preprocessing step that scales vectors to a standard length, ensuring fair comparisons when using cosine similarity.

Vector Output

The model's output is a single array of numbers (a vector) rather than generated text, which can be efficiently compared with other vectors to measure similarity.

Vector Quantization

Compressing data by encoding groups of values together rather than individually, achieving better compression ratios.

Vector Representation

A way of expressing text as a list of numbers that a computer can process and compare mathematically.

Vector Retrieval

A search method that converts text into numerical vectors and finds similar documents by comparing vector distances.

Vector Search

A search method that converts queries and documents into numerical vectors and finds matches by measuring similarity between vectors, fast but less nuanced than other ranking approaches.

Vector Similarity

A measurement of how alike two vectors (number lists) are to each other, used to determine if two pieces of text have similar meanings.

Vector Similarity Search

A method that converts text into numerical vectors and finds documents with vectors closest to a query vector, fast but sometimes missing nuanced relevance signals.

Vector Space

A mathematical representation where text is converted into points or directions in a multi-dimensional space, enabling comparison and analysis of semantic relationships.

Vector-Based Adaptation

A parameter-efficient fine-tuning approach that adapts models using learned vectors instead of full weight matrices, requiring even fewer parameters than LoRA.

Vector-valued Reward

A reward signal with multiple dimensions (e.g., correctness per test case) instead of a single scalar score.

Velocity Field

In diffusion models, the learned direction and speed that guides the generation process at each step.

Vendor-Specific Operations

Operational procedures and knowledge unique to equipment from a particular manufacturer, like GE MRI scanners.

Verbalized confidence

Uncertainty estimates based on explicit confidence statements the model generates as part of its reasoning output.

Verifiable Answers

Answers that can be checked against external sources like the web to confirm correctness.

Verifiable Environments

Structured tasks with clear input-output specifications where model reasoning can be automatically checked for correctness.

Verifiable Rewards

Feedback signals that can be automatically checked, like whether code produces correct outputs on test cases.

Verification

The process of checking whether claims, proofs, or experimental results are correct and logically sound.

Verification Logic

The code or rules that check whether an agent's solution correctly solves a benchmark task.

Verification-Refinement Loop

A test-time process where a verifier scores candidate solutions and the model refines low-scoring ones iteratively.

Verifier

A model or system that evaluates whether another model's output is correct or high-quality.

Verifier Signal

Output from an external model that assesses whether an LLM's response is safe or correct.

Verifier-Free Self-Improvement

Methods for improving model outputs without an external verification system, relying on the model's own signals.

Video Diffusion

A generative model that creates videos by iteratively refining noise into coherent frames, similar to image diffusion but applied to sequences.

Video Encoder

A model component that processes video frames and converts them into compact numerical representations that capture the video's visual and motion content.

Video Generation

Creating realistic video sequences using AI based on text or image descriptions.

Video Object Removal

Editing technique that deletes objects from video while filling in background and correcting physical interactions.

Video Question Answering

A task where AI models watch videos and answer questions about what they see and understand.

Video Segmentation

Extending image segmentation to video by identifying and tracking objects across multiple frames over time.

Video Summarization

Automatically selecting key frames or clips from a long video that capture the most important content.

Video Super-Resolution

Upscaling low-resolution video frames to higher resolution while preserving temporal consistency and detail.

Video Tracking

The ability to follow and maintain consistent identification of objects as they move across multiple frames in a video sequence.

Video Understanding

The ability of AI systems to analyze and extract meaning from video content including visual, temporal, and semantic information.

Video VAE

A variational autoencoder designed for video that compresses video frames into a latent representation for efficient processing.

Video-Language Model

A specialized AI model trained to understand video content and communicate its understanding through natural language text.

Video-to-Audio Generation

Creating sound effects or audio that matches the visual content and timing of a video.

View Coverage

The extent to which a 3D reconstruction captures the scene from different camera angles and positions.

View Robustness

A model's ability to perform tasks correctly despite changes in camera position or angle.

View-Dependent Appearance

How an object's appearance changes based on the viewing angle, including effects like reflections and shininess.

Viewpoint Rotation Understanding

The capability to track how a viewpoint changes through rotations and predict resulting observations.

Virtual Cell Abstraction

Representing biological cells as simplified computational models for simulation.

Virtual Reality (VR)

A computer-generated 3D environment that users can interact with using special headsets or controllers.

Virtual Staining

Using AI to digitally add color to microscope images without physical staining.

Viscosity Solution

A mathematical solution concept for complex equations that handles non-smooth behavior in optimization problems.

Vision Backbone

The core neural network component that processes and understands images before passing information to the rest of the model.

Vision Encoder

A component that converts images into a numerical representation that a language model can understand and process.

Vision Encoder-Decoder

A neural network architecture that processes images through an encoder component and generates text through a decoder component, commonly used for tasks like document understanding and image captioning.

Vision Encoding

A process that converts images into numerical representations that a model can understand and process.

Vision Foundation Models

Large pre-trained models like DINO and SAM that learn general visual understanding from diverse image data.

Vision Pipeline

The specialized component of a model that processes and interprets image data to extract visual information.

Vision Tokens

Discrete representations of image patches or regions processed by vision-language models.

Vision Transformer

A neural network architecture that processes images by breaking them into small patches and analyzing them similarly to how language models process text.

Vision Transformer (ViT)

A neural network architecture that processes images by breaking them into small patches and treating them similarly to how language models process words.

Vision Understanding

The ability of an AI model to analyze and interpret visual information from images, identifying objects, scenes, and their relationships.

Vision-Language

A model designed to understand and reason about both visual content (images) and natural language text together.

Vision-Language Alignment

Training a model to understand the relationship between images and their text descriptions so it can match them together effectively.

Vision-Language Backbone

A pre-trained model that jointly processes and understands both visual and textual information in a unified representation.

Vision-Language Encoder

A model that processes both images and text together to create shared numerical representations, rather than generating new text like a full language model would.

Vision-Language Learning

Training a model to understand and connect both images and text together, so it can reason about visual content using language.

Vision-Language Model

An AI model that understands both images and text, allowing it to answer questions about images or describe what it sees.

Vision-Language Models (VLMs)

AI systems that understand both images and text, allowing them to answer questions about images or describe what they see.

Vision-Language Navigation (VLN)

Task where an AI agent navigates physical spaces by following natural language instructions while processing visual input.

Vision-Language Task

A task that requires a model to understand and reason about both visual information (images) and textual information together.

Vision-Language Tasks

AI tasks that require understanding both visual information from images and textual information together, such as describing images or answering questions about them.

Vision-Language-Action Model

A model that combines visual perception, language understanding, and robotic action generation to interpret instructions and control robot movements.

Vision-to-Code Generation

Converting visual inputs like screenshots, charts, or diagrams into executable code or structured representations.

Visual Anchoring

Grounding abstract concepts (like actions) to concrete visual observations to ensure they have real physical meaning.

Visual Autoregressive Models

Image generators that predict image codes one bit or token at a time, sequentially building up the full image.

Visual decoding

Reconstructing or identifying visual stimuli from recorded brain activity patterns.

Visual Encoder

A component that converts images into a numerical representation that the model can understand and process.

Visual Foresight

Predicting and visualizing what a robot will do next based on its learned policy.

Visual Grounding

The ability to connect specific words or concepts in text to the actual objects or regions they refer to in an image.

Visual Instruction Tuning

A training technique that teaches a model to follow instructions about images by learning from examples of image-text instruction pairs.

Visual Perturbations

Controlled degradations or distortions applied to images, such as blur, noise, or compression artifacts.

Visual Preemption

Detecting and interrupting physical actions in real-time by analyzing visual feedback to catch failures before they compound.

Visual Question Answering

A task where an AI model reads a question and an image, then generates an answer based on what it understands from the image.

Visual Reasoning

The capability to analyze images and draw logical conclusions or answer complex questions based on what is depicted in the visual content.

Visual Retrieval-Augmented Generation

RAG applied to visually rich documents, allowing models to retrieve and reason over images and multi-page visual content.

Visual Segmentation

The task of identifying and separating individual objects or regions in an image or video by assigning each pixel to a specific object or category.

Visual Signal Dilution

The degradation of visual understanding in models as generated text accumulates, causing attention to shift away from image tokens.

Visual Token Pruning

Removing unnecessary visual tokens from images or videos to reduce computational cost in vision-language models.

Visual Tokens

Discrete units representing different regions or features of an image processed by the model.

Visual Understanding

The ability of an AI model to interpret and analyze images, including identifying objects, reading text, and answering questions about visual content.

Visual-Language Model

A model that processes both images and text together, understanding the relationship between visual content and language to answer questions about images or describe what it sees.

Visual-Textual Attention

How a multimodal model allocates focus between visual and text information when processing inputs.

Visualization Rhetoric

The persuasive techniques and design choices used in charts and graphs to influence how viewers interpret data.

Visually-Grounded

A model's ability to understand and reason about visual information in images, connecting what it sees to language and concepts.

Visuo-Tactile Fusion

Combining visual information from cameras with tactile (touch) sensor data to improve robot perception and decision-making.

Visuomotor Pipeline

A system that converts visual input into motor control commands for robot manipulation.

Visuomotor Policy

A learned control policy that maps visual observations directly to robot motor commands.

vLLM

An inference engine optimized for running large language models efficiently by batching requests and managing memory intelligently.

vLLM Inference Engine

A high-performance serving framework that efficiently runs language models and embedding models with optimized memory usage and throughput for production deployments.

Vocabulary

The complete set of unique words or tokens that a language model can recognize and generate.

Vocabulary Collapse

When a model over-predicts only a few options and ignores others, losing diversity in its outputs.

Vocabulary Extension

Adding new tokens or words to a language model's vocabulary beyond its original pretrained set.

Vocabulary Size

The number of unique tokens (words or word pieces) a model can recognize and process; larger vocabularies provide better coverage of a language.

Vocabulary Trimming

Reducing a model's vocabulary to only the tokens needed for a specific language, decreasing model size.

Vocabulary-Constrained LLM

A language model restricted to generate only outputs from a predefined set of allowed terms or concepts.

Vocal Delivery

The acoustic properties of speech including tone, emotion, emphasis, and prosody that convey meaning beyond words.

Voice Synthesis

The process of generating natural-sounding human speech from text using machine learning models.

Voxel

A 3D pixel representing a small cube of space in a volumetric grid representation.

Voxel-Level Uncertainty

Uncertainty estimates computed for individual 3D pixels in medical imaging, rather than whole-image predictions.

VQ-VAE (Vector Quantized Variational Autoencoder)

A neural network that learns discrete, quantized representations by combining VAE principles with vector quantization.

VQA-Based Reward

A reward signal derived from visual question answering that uses language-vision reasoning to evaluate image quality.

VRAM

Video RAM — the memory on a GPU that stores model weights and intermediate computations during inference.

VRAM Footprint

The amount of graphics memory (VRAM) required to load and run a model on a GPU.

Vulnerability detection

Automatically identifying security flaws or weaknesses in code that could be exploited by attackers.

Vulnerability Reasoning

The ability to understand and explain how security weaknesses in software or systems could be exploited and what their potential impact might be.

W

W4A16

A quantization format where model weights are stored in 4-bit precision while calculations use 16-bit precision, balancing efficiency with accuracy.

W4A16 Quantization

A specific quantization scheme where weights are stored in 4-bit precision while activations remain in 16-bit precision, balancing memory savings with accuracy.

W8A8 Quantization

A specific quantization method that reduces both weights and activations to 8-bit integers, enabling faster computation on specialized hardware while maintaining reasonable accuracy.

W8A8 Quantization

A specific quantization method where both weights (w) and activations (a) are stored as 8-bit integers, providing a good balance between memory savings and model quality.

Warm Start

Providing an optimization solver with an initial candidate solution to speed up convergence instead of starting from scratch.

Warnsdorff's Algorithm

A greedy heuristic that prioritizes moves to positions with fewer onward options to avoid dead ends.

Wasserstein Distance

A mathematical measure of how different two distributions are, useful for comparing expert and agent behavior.

Watertight Mesh

A 3D mesh with no holes or gaps, suitable for physics simulation and 3D printing.

Weak Constraints

Soft constraints in logic programs that prefer certain solutions but don't forbid violations, used for optimization.

Weak Supervision

Training with imperfect or limited supervision signals, such as scarce labels, noisy annotations, or self-generated targets.

Weak-to-Strong Generalization

Transferring knowledge or training signals from smaller/weaker models to improve larger/stronger models.

Weak-to-Strong Reverse Distillation

Testing distillation by using a weaker model as teacher to see if a stronger student learns meaningfully.

Web content pollution

Malicious or misleading content planted on websites, such as fake reviews or promotional pages designed to manipulate AI systems.

Web Crawling

Automatically browsing and collecting data from websites by following links across the internet.

Web Dataset

Training data collected from publicly available internet sources, which provides broad but sometimes uneven coverage of topics.

Web Interaction

An agent's capability to navigate websites, fill forms, click buttons, and extract information from live web pages.

Web Search Augmentation

The ability to search the internet in real-time during processing to retrieve current information rather than relying only on training data.

Web Search Integration

The capability for a model to query the internet in real-time during response generation, allowing it to access current information beyond its training data.

Weight and Activation Quantization (W8A8)

A specific quantization method that compresses both the model's stored weights and its intermediate calculations to 8-bit precision, significantly reducing memory and computation requirements.

Weight Averaging

A merging method that combines model weights by taking their average.

Weight Clustering

Grouping similar weight values together and replacing them with shared cluster centers to reduce model size.

Weight Conditioning

The numerical stability of weight matrices, measured by how spread out their singular values are.

Weight Editing

The process of directly modifying a trained model's internal parameters (weights) to change its behavior without retraining from scratch.

Weight Generation

The process of using a neural network to produce parameters for another model rather than training those parameters directly.

Weight Importance

A measure of how much a specific weight contributes to model predictions and performance.

Weight Initialization

The process of setting the starting values for a neural network's parameters before training begins.

Weight Norm

The magnitude of a classifier's weight vector for each class, which controls how confidently the model predicts that class.

Weight Parameterization

A method of representing neural network weights in a transformed space to improve training dynamics.

Weight Precision

The number of bits used to represent each numerical value in a model's weights; lower precision (like 4-bit) uses less memory but may reduce accuracy.

Weight Quantization

A specific type of quantization that compresses only the model's learned parameters (weights) while keeping other calculations at higher precision.

Weight Sharing

Using the same neural network parameters for multiple tasks to enable knowledge transfer and reduce model size.

Weighted Aggregation

A method that combines multiple inputs by assigning different importance levels to each, then calculating a combined score.

Weighted Model Counting (WMC)

Computing the sum of weights across all logical models satisfying a formula, used for probabilistic inference.

Weights

The numerical parameters inside a neural network that determine how it processes input and generates output.

Whole-Body Control

Coordinating all joints and limbs of a robot (legs, arms, torso) to achieve a task simultaneously.

Whole-Body Controller (WBC)

A system that converts high-level motion commands into executable joint trajectories for robots.

Whole-Slide Image

A high-resolution digital scan of an entire microscope slide used in computational pathology for disease diagnosis.

Width Scaling

How optimizer behavior changes when you increase the number of neurons in each layer of a neural network.

Wigner Score

A quantum version of the score function that describes how to reverse noise in quantum systems.

Winner-Take-All Retrieval

A retrieval criterion requiring the correct target to score strictly higher than all other candidates.

Winning Region

In game theory, the set of game states from which a player can guarantee achieving their objective.

Wirelength

The total length of connections between components on a chip; shorter wirelength improves performance and power efficiency.

Word Error Rate (WER)

The standard metric for evaluating speech recognition quality by measuring the percentage of words incorrectly recognized.

Word sense disambiguation

Determining which meaning of a word is intended in a specific context when a word has multiple meanings.

Worked Example

A step-by-step demonstration of how to solve a problem, used to help students learn problem-solving strategies.

Workflow

The sequence and structure of how agents communicate and coordinate to complete a task, including how outputs are aggregated.

Workflow Abstraction

Converting low-level, granular action sequences into higher-level, semantically meaningful activity descriptions.

Workflow Automation

Using an AI model to automatically handle repetitive business tasks and processes, reducing manual effort and improving efficiency.

Workflow Orchestration

Coordinating and automating the execution of multiple computational tasks and their dependencies across distributed systems.

Working Memory (WM)

The active, temporary knowledge an AI system uses for the current task, drawn from long-term memory.

Workload Manager

Software that schedules and manages job submissions and resource allocation on shared computing clusters.

World Knowledge

A model's learned understanding of facts, concepts, and relationships about the real world, typically acquired during training on diverse text data.

World Literature

Comparative study of literature across cultures and languages to understand global textual traditions and circulation.

World Model

An AI system that learns to understand and predict how the physical world works from observations.

World Modeling

Predicting future states of the environment based on current observations and actions.

World Models

Internal representations learned by AI systems that capture how the physical world works, including how objects move and interact over time.

World-Action Model

A model that predicts how the physical world will change in response to robot actions over time.

Worst-Case Analysis

Evaluating system behavior on the most dangerous or consequential failures rather than average performance.

Write-back Affordances

The ability for modules to update and modify shared state, enabling bidirectional communication between tools.

Wyner-Ziv Coding

A compression technique where the encoder has limited information but the decoder has side information to help reconstruction.

X

XLM-RoBERTa

A pre-trained language model architecture designed to understand and process text in over 100 languages simultaneously.

xLSTM Architecture

A recurrent neural network variant that uses linear attention mechanisms instead of quadratic attention for improved efficiency.

Y

YaRN (Yet another RoPE extensioN)

A positional encoding technique that extends rotary position embeddings to work with longer sequences.

Z

Zero One Loss

A metric that counts predictions as either completely right or completely wrong with no partial credit.

ZeRO Optimization

Memory-saving technique that partitions model states (optimizer, gradients, parameters) across devices.

Zero Shot Learning

Solving a task without any training examples by using knowledge from related tasks or descriptions.

Zero Shot Performance

How well an AI model performs on new tasks it has never seen before without any training.

Zero-Day Detection

Identifying previously unknown security vulnerabilities or attacks that have no existing defenses.

Zero-error capacity

The maximum rate at which information can be reliably transmitted over a noisy channel with zero probability of error.

Zero-Order Hypergradient

A gradient-free signal derived from comparing function values across different hyperparameter settings.

Zero-pair learning

Training without paired examples of two modalities, using only single-modality data.

Zero-shot Autonomous Behavior

Agent performing tasks without any external skill retrieval or runtime augmentation, relying only on learned parameters.

Zero-Shot Baseline

A comparison model that makes predictions without being trained on the target task or domain.

Zero-Shot Generalization

A model's ability to handle new, unseen tasks or data without additional training on those specific examples.

Zero-shot learning

Using a model to solve a task without any training examples for that specific task.

Zero-Shot Prediction

Making predictions on new tasks without any task-specific training or fine-tuning on labeled examples.

Zero-Shot Reasoning

Solving tasks without task-specific training by leveraging knowledge from pretrained models.

Zero-Shot Semantic Classification

Using an LLM to categorize text based on meaning without task-specific training examples.

Zero-Shot Sound Generation

Creating new sounds the model has never seen before by using reference audio as a guide.

Zero-shot Task Transfer

Performing a new task without any training examples, using only knowledge learned from other tasks or domains.

Zero-Sum Game

A game where one player's gain equals another player's loss, so total payoff is always zero.

Zero-Trust Network Access (ZTNA)

A security model that requires verification of every access request regardless of source, rather than trusting internal networks.

Zone-Level Modeling

Predicting risk at the geographic area level rather than individual policy level, useful when detailed location data is unavailable.