Results for "reasoning + action"
A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.
Capabilities that appear only beyond certain model sizes.
Structured graph encoding facts as entity–relation–entity triples.
Software pipeline converting raw sensor data into structured representations.
Enables external computation or lookup.
Understanding objects exist when unseen.
Human-like understanding of physical behavior.
Mathematical guarantees of system behavior.
AI supporting legal research, drafting, and analysis.
Legal right to fair treatment.
Predicting case success probabilities.
AI proposing scientific hypotheses.
System-level design for general intelligence.
A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.
A high-priority instruction layer setting overarching behavior constraints for a chat model.
Combines value estimation (critic) with policy learning (actor).
Separates planning from execution in agent architectures.
RL using learned or known environment models.
Directly optimizing control policies.
Optimizing continuous action sequences.
Reward only given upon task completion.
Imagined future trajectories.
Control shared between human and agent.