Topics

Browse by topic

Browse every AI research topic on Research Papers — language models, diffusion, agents, world models, embodied AI, and more.

Language Models · 63 Models trained to understand, generate, and transform natural language at scale. AI Agents · 62 LLM-driven systems that plan, act, use tools, and carry skills across tasks. LLM Reasoning · 62 Eliciting and improving step-by-step reasoning in large language models. Multimodal Models · 54 Foundation models that combine language with images, audio, video, or other signals. Efficient AI · 47 Algorithms and systems that reduce memory, compute, or latency for large models. Diffusion Models · 40 Generative models that synthesize data through iterative denoising. Reinforcement Learning · 28 Training language models and agents from reward — RLHF, RLVR, GRPO, and verifiable-reward methods that drive reasoning gains. Vision Foundation Models · 28 Large visual representation models that transfer across recognition, localization, and perception tasks. Text-to-Image · 25 Models that generate or edit images from natural-language prompts. Robotics · 22 Learning and control for physical robots. Transformers · 22 Attention-based architectures that became the backbone of modern language and multimodal models. World Models · 21 Generative models that simulate consistent, controllable environments over time. Fine-Tuning & Adaptation · 19 Adapting pretrained models to new tasks cheaply, including parameter-efficient methods like LoRA. Alignment · 18 Methods for steering models toward preferred, safer, or more useful behavior. Retrieval-Augmented Generation · 18 Grounding language model outputs in retrieved documents to improve factuality and freshness. Video Generation · 17 Models that synthesize video from text or other conditions, including streaming and autoregressive diffusion approaches. Long Context · 16 Models and evaluations for reasoning over very large text, audio, video, or code contexts. Vision-Language-Action · 16 Models that map perception and language directly to robot actions. Open Models · 15 Open-weight model releases and the training recipes behind them. AI for Science · 14 Machine learning applied to scientific discovery — biology, chemistry, physics, and materials, from protein structure to new materials. Agent Memory · 13 How AI agents store, retrieve, and update long-term memory across tasks and sessions — beyond the context window. Code Generation · 11 Models and systems that synthesize, complete, or reason about programs. Sequence Modeling · 9 Architectures for modeling long ordered data such as text, audio, code, and genomics. Self-Supervised Learning · 8 Training methods that learn useful representations from data without task-specific labels. Mixture of Experts · 7 Sparsely activating subsets of parameters so model capacity grows without proportional compute. Speech Synthesis · 7 Text-to-speech and voice generation models, including zero-shot, expressive, and dialogue synthesis. Theorem Proving · 7 Neural, symbolic, and hybrid systems for mathematical proof search. Biomolecular Modeling · 6 AI systems for protein structures, molecular interactions, and computational biology. Brain Decoding · 6 Using machine learning to read, model, and causally probe how the brain represents perception. Diffusion Language Models · 6 Text generation by iterative denoising instead of left-to-right decoding — parallel, non-autoregressive language models. Segmentation · 6 Promptable and automatic systems for separating objects in images and videos. Small Language Models · 6 Compact, on-device, and edge-deployable models — strong capability per parameter for local and low-cost inference. Text Embeddings · 6 Methods for turning text into dense vectors for retrieval, similarity, and search, including using LLMs as encoders. Interpretability · 5 Reverse-engineering what neural networks compute inside — features, circuits, and the mechanisms behind model behavior. Speech Recognition · 5 Models for transcribing, translating, and understanding spoken audio.