All 162 seed papers. Seed status means candidacy, not canonical status. Papers with harvested evidence link to their breakdown; the rest are an honestly-declared coverage gap, not a zero.
| # | Paper | Year | Venue | Evidence |
|---|---|---|---|---|
| 0001 | A Logical Calculus of the Ideas Immanent in Nervous Activity | 1943 | Bulletin of Mathematical Biophysics | scored |
| 0002 | As We May Think | 1945 | The Atlantic | no evidence yet |
| 0003 | A Mathematical Theory of Communication | 1948 | Bell System Technical Journal | scored |
| 0004 | Computing Machinery and Intelligence | 1950 | Mind | scored |
| 0005 | A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence | 1955 | Proposal (Dartmouth) | no evidence yet |
| 0006 | The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain | 1958 | Psychological Review | scored |
| 0007 | Some Studies in Machine Learning Using the Game of Checkers | 1959 | IBM Journal | scored |
| 0008 | Programs with Common Sense | 1959 | Mechanisation of Thought Processes | no evidence yet |
| 0009 | Man-Computer Symbiosis | 1960 | IRE Transactions on Human Factors | scored |
| 0010 | Steps Toward Artificial Intelligence | 1961 | Proceedings of the IRE | no evidence yet |
| 0011 | GPS: A Program That Simulates Human Thought | 1961 | Lernende Automaten / RAND | scored |
| 0012 | Fuzzy Sets | 1965 | Information and Control | scored |
| 0013 | ELIZA: A Computer Program for the Study of Natural Language Communication | 1966 | Communications of the ACM | no evidence yet |
| 0014 | Some Philosophical Problems from the Standpoint of Artificial Intelligence | 1969 | Machine Intelligence 4 | scored |
| 0015 | STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving | 1971 | Artificial Intelligence | no evidence yet |
| 0016 | Adaptation in Natural and Artificial Systems (foundational monograph) | 1975 | University of Michigan Press | no evidence yet |
| 0017 | Maximum Likelihood from Incomplete Data via the EM Algorithm | 1977 | Journal of the Royal Statistical Society B | scored |
| 0018 | Minds, Brains, and Programs | 1980 | Behavioral and Brain Sciences | scored |
| 0019 | Neocognitron: A Self-Organizing Neural Network Model | 1980 | Biological Cybernetics | no evidence yet |
| 0020 | Neural Networks and Physical Systems with Emergent Collective Computational Abilities | 1982 | PNAS | no evidence yet |
| 0021 | Self-Organized Formation of Topologically Correct Feature Maps | 1982 | Biological Cybernetics | scored |
| 0022 | A Learning Algorithm for Boltzmann Machines | 1985 | Cognitive Science | scored |
| 0023 | A Robust Layered Control System for a Mobile Robot | 1986 | IEEE Journal of Robotics and Automation | scored |
| 0024 | Learning Representations by Back-Propagating Errors | 1986 | Nature | scored |
| 0025 | Fusion, Propagation, and Structuring in Belief Networks | 1986 | Artificial Intelligence | scored |
| 0026 | Learning to Predict by the Methods of Temporal Differences | 1988 | Machine Learning | scored |
| 0027 | Multilayer Feedforward Networks Are Universal Approximators | 1989 | Neural Networks | scored |
| 0028 | A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition | 1989 | Proceedings of the IEEE | no evidence yet |
| 0029 | The Symbol Grounding Problem | 1990 | Physica D | scored |
| 0030 | Finding Structure in Time | 1990 | Cognitive Science | scored |
| 0031 | Intelligence Without Representation | 1991 | Artificial Intelligence | scored |
| 0032 | Q-Learning | 1992 | Machine Learning | scored |
| 0033 | Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning | 1992 | Machine Learning | no evidence yet |
| 0034 | Causal Diagrams for Empirical Research | 1995 | Biometrika | no evidence yet |
| 0035 | Support-Vector Networks | 1995 | Machine Learning | scored |
| 0036 | Temporal Difference Learning and TD-Gammon | 1995 | Communications of the ACM | no evidence yet |
| 0037 | Regression Shrinkage and Selection via the Lasso | 1996 | Journal of the Royal Statistical Society B | scored |
| 0038 | Long Short-Term Memory | 1997 | Neural Computation | scored |
| 0039 | A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting | 1997 | Journal of Computer and System Sciences | scored |
| 0040 | No Free Lunch Theorems for Optimization | 1997 | IEEE Transactions on Evolutionary Computation | scored |
| 0041 | Gradient-Based Learning Applied to Document Recognition | 1998 | Proceedings of the IEEE | scored |
| 0042 | The PageRank Citation Ranking: Bringing Order to the Web | 1998 | Stanford Technical Report | scored |
| 0043 | Random Forests | 2001 | Machine Learning | scored |
| 0044 | Statistical Modeling: The Two Cultures | 2001 | Statistical Science | scored |
| 0045 | Greedy Function Approximation: A Gradient Boosting Machine | 2001 | Annals of Statistics | scored |
| 0046 | BLEU: A Method for Automatic Evaluation of Machine Translation | 2002 | ACL | no evidence yet |
| 0047 | Latent Dirichlet Allocation | 2003 | Journal of Machine Learning Research | scored |
| 0048 | A Neural Probabilistic Language Model | 2003 | Journal of Machine Learning Research | scored |
| 0049 | A Fast Learning Algorithm for Deep Belief Nets | 2006 | Neural Computation | scored |
| 0050 | Reducing the Dimensionality of Data with Neural Networks | 2006 | Science | scored |
| 0051 | Universal Intelligence: A Definition of Machine Intelligence | 2007 | Minds and Machines | no evidence yet |
| 0052 | The Basic AI Drives | 2008 | AGI Conference | no evidence yet |
| 0053 | Matrix Factorization Techniques for Recommender Systems | 2009 | IEEE Computer | scored |
| 0054 | ImageNet: A Large-Scale Hierarchical Image Database | 2009 | CVPR | scored |
| 0055 | The Unreasonable Effectiveness of Data | 2009 | IEEE Intelligent Systems | scored |
| 0056 | ImageNet Classification with Deep Convolutional Neural Networks | 2012 | NeurIPS | no evidence yet |
| 0057 | Deep Neural Networks for Acoustic Modeling in Speech Recognition | 2012 | IEEE Signal Processing Magazine | scored |
| 0058 | Fairness Through Awareness | 2012 | ITCS | scored |
| 0059 | A Few Useful Things to Know About Machine Learning | 2012 | Communications of the ACM | scored |
| 0060 | Efficient Estimation of Word Representations in Vector Space | 2013 | ICLR Workshop | scored |
| 0061 | Dropout: A Simple Way to Prevent Neural Networks from Overfitting | 2014 | Journal of Machine Learning Research | scored |
| 0062 | Generative Adversarial Networks | 2014 | NeurIPS | scored |
| 0063 | Auto-Encoding Variational Bayes | 2014 | ICLR | scored |
| 0064 | Intriguing Properties of Neural Networks | 2014 | ICLR | scored |
| 0065 | GloVe: Global Vectors for Word Representation | 2014 | EMNLP | scored |
| 0066 | Sequence to Sequence Learning with Neural Networks | 2014 | NeurIPS | scored |
| 0067 | Very Deep Convolutional Networks for Large-Scale Image Recognition | 2015 | ICLR | no evidence yet |
| 0068 | Going Deeper with Convolutions | 2015 | CVPR | scored |
| 0069 | Batch Normalization: Accelerating Deep Network Training | 2015 | ICML | no evidence yet |
| 0070 | Adam: A Method for Stochastic Optimization | 2015 | ICLR | scored |
| 0071 | Deep Learning (review) | 2015 | Nature | no evidence yet |
| 0072 | U-Net: Convolutional Networks for Biomedical Image Segmentation | 2015 | MICCAI | no evidence yet |
| 0073 | Distilling the Knowledge in a Neural Network | 2015 | NeurIPS Workshop | scored |
| 0074 | Explaining and Harnessing Adversarial Examples | 2015 | ICLR | scored |
| 0075 | Neural Machine Translation by Jointly Learning to Align and Translate | 2015 | ICLR | scored |
| 0076 | Human-Level Control Through Deep Reinforcement Learning | 2015 | Nature | scored |
| 0077 | Hidden Technical Debt in Machine Learning Systems | 2015 | NeurIPS | no evidence yet |
| 0078 | XGBoost: A Scalable Tree Boosting System | 2016 | KDD | no evidence yet |
| 0079 | Deep Residual Learning for Image Recognition | 2016 | CVPR | scored |
| 0080 | You Only Look Once: Unified, Real-Time Object Detection | 2016 | CVPR | scored |
| 0081 | WaveNet: A Generative Model for Raw Audio | 2016 | arXiv | no evidence yet |
| 0082 | Mastering the Game of Go with Deep Neural Networks and Tree Search | 2016 | Nature | scored |
| 0083 | End-to-End Training of Deep Visuomotor Policies | 2016 | Journal of Machine Learning Research | no evidence yet |
| 0084 | Concrete Problems in AI Safety | 2016 | arXiv | no evidence yet |
| 0085 | Why Should I Trust You? Explaining the Predictions of Any Classifier | 2016 | KDD | no evidence yet |
| 0086 | Big Data's Disparate Impact | 2016 | California Law Review | no evidence yet |
| 0087 | Machine Bias | 2016 | ProPublica | no evidence yet |
| 0088 | Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings | 2016 | NeurIPS | no evidence yet |
| 0089 | Equality of Opportunity in Supervised Learning | 2016 | NeurIPS | no evidence yet |
| 0090 | The Ethics of Algorithms: Mapping the Debate | 2016 | Big Data & Society | no evidence yet |
| 0091 | Deep Learning with Differential Privacy | 2016 | CCS | scored |
| 0092 | Semi-Supervised Classification with Graph Convolutional Networks | 2017 | ICLR | scored |
| 0093 | Attention Is All You Need | 2017 | NeurIPS | scored |
| 0094 | Mastering the Game of Go Without Human Knowledge | 2017 | Nature | scored |
| 0095 | Proximal Policy Optimization Algorithms | 2017 | arXiv | no evidence yet |
| 0096 | Deep Reinforcement Learning from Human Preferences | 2017 | NeurIPS | no evidence yet |
| 0097 | A Unified Approach to Interpreting Model Predictions | 2017 | NeurIPS | scored |
| 0098 | Inherent Trade-Offs in the Fair Determination of Risk Scores | 2017 | ITCS | no evidence yet |
| 0099 | Semantics Derived Automatically from Language Corpora Contain Human-Like Biases | 2017 | Science | scored |
| 0100 | Membership Inference Attacks Against Machine Learning Models | 2017 | IEEE S&P | no evidence yet |
| 0101 | Communication-Efficient Learning of Deep Networks from Decentralized Data | 2017 | AISTATS | no evidence yet |
| 0102 | World Models | 2018 | NeurIPS | no evidence yet |
| 0103 | Improving Language Understanding by Generative Pre-Training | 2018 | OpenAI Technical Report | no evidence yet |
| 0104 | A General Reinforcement Learning Algorithm That Masters Chess, Shogi, and Go Through Self-Play | 2018 | Science | no evidence yet |
| 0105 | The Mythos of Model Interpretability | 2018 | ACM Queue / CACM | no evidence yet |
| 0106 | Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification | 2018 | FAT* | no evidence yet |
| 0107 | Counterfactual Explanations Without Opening the Black Box | 2018 | Harvard Journal of Law & Technology | no evidence yet |
| 0108 | The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation | 2018 | arXiv | no evidence yet |
| 0109 | BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | 2019 | NAACL | no evidence yet |
| 0110 | Language Models are Unsupervised Multitask Learners | 2019 | OpenAI Technical Report | no evidence yet |
| 0111 | Grandmaster Level in StarCraft II Using Multi-Agent Reinforcement Learning | 2019 | Nature | no evidence yet |
| 0112 | Risks from Learned Optimization in Advanced Machine Learning Systems | 2019 | arXiv | no evidence yet |
| 0113 | Stop Explaining Black Box Machine Learning Models for High Stakes Decisions | 2019 | Nature Machine Intelligence | no evidence yet |
| 0114 | Model Cards for Model Reporting | 2019 | FAT* | no evidence yet |
| 0115 | Dissecting Racial Bias in an Algorithm Used to Manage the Health of Populations | 2019 | Science | no evidence yet |
| 0116 | Energy and Policy Considerations for Deep Learning in NLP | 2019 | ACL | no evidence yet |
| 0117 | The Global Landscape of AI Ethics Guidelines | 2019 | Nature Machine Intelligence | no evidence yet |
| 0118 | The Bitter Lesson | 2019 | Essay (incompleteideas.net) | no evidence yet |
| 0119 | On the Measure of Intelligence | 2019 | arXiv | no evidence yet |
| 0120 | Denoising Diffusion Probabilistic Models | 2020 | NeurIPS | no evidence yet |
| 0121 | Neural Radiance Fields (NeRF): Representing Scenes for View Synthesis | 2020 | ECCV | no evidence yet |
| 0122 | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | 2020 | Journal of Machine Learning Research | no evidence yet |
| 0123 | Language Models are Few-Shot Learners | 2020 | NeurIPS | no evidence yet |
| 0124 | Scaling Laws for Neural Language Models | 2020 | arXiv | no evidence yet |
| 0125 | Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks | 2020 | NeurIPS | no evidence yet |
| 0126 | Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | 2020 | Nature | no evidence yet |
| 0127 | Zoom In: An Introduction to Circuits | 2020 | Distill | no evidence yet |
| 0128 | Closing the AI Accountability Gap: Defining an End-to-End Framework for Internal Algorithmic Auditing | 2020 | FAT* | no evidence yet |
| 0129 | An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | 2021 | ICLR | no evidence yet |
| 0130 | Learning Transferable Visual Models From Natural Language Supervision | 2021 | ICML | no evidence yet |
| 0131 | Evaluating Large Language Models Trained on Code | 2021 | arXiv | no evidence yet |
| 0132 | Unsolved Problems in ML Safety | 2021 | arXiv | no evidence yet |
| 0133 | Datasheets for Datasets | 2021 | Communications of the ACM | no evidence yet |
| 0134 | On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? | 2021 | FAccT | no evidence yet |
| 0135 | Extracting Training Data from Large Language Models | 2021 | USENIX Security | no evidence yet |
| 0136 | Highly Accurate Protein Structure Prediction with AlphaFold | 2021 | Nature | no evidence yet |
| 0137 | The Hardware Lottery | 2021 | Communications of the ACM | no evidence yet |
| 0138 | High-Resolution Image Synthesis with Latent Diffusion Models | 2022 | CVPR | no evidence yet |
| 0139 | Training Compute-Optimal Large Language Models | 2022 | NeurIPS | no evidence yet |
| 0140 | LoRA: Low-Rank Adaptation of Large Language Models | 2022 | ICLR | no evidence yet |
| 0141 | Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity | 2022 | Journal of Machine Learning Research | no evidence yet |
| 0142 | Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | 2022 | NeurIPS | no evidence yet |
| 0143 | Emergent Abilities of Large Language Models | 2022 | TMLR | no evidence yet |
| 0144 | Training Language Models to Follow Instructions with Human Feedback | 2022 | NeurIPS | no evidence yet |
| 0145 | Constitutional AI: Harmlessness from AI Feedback | 2022 | arXiv | no evidence yet |
| 0146 | Toy Models of Superposition | 2022 | Anthropic / Transformer Circuits | no evidence yet |
| 0147 | Discovering Faster Matrix Multiplication Algorithms with Reinforcement Learning | 2022 | Nature | no evidence yet |
| 0148 | Segment Anything | 2023 | ICCV | no evidence yet |
| 0149 | ReAct: Synergizing Reasoning and Acting in Language Models | 2023 | ICLR | no evidence yet |
| 0150 | Toolformer: Language Models Can Teach Themselves to Use Tools | 2023 | NeurIPS | no evidence yet |
| 0151 | LLaMA: Open and Efficient Foundation Language Models | 2023 | arXiv | no evidence yet |
| 0152 | GPT-4 Technical Report | 2023 | arXiv | no evidence yet |
| 0153 | Sparks of Artificial General Intelligence: Early Experiments with GPT-4 | 2023 | arXiv | no evidence yet |
| 0154 | Mamba: Linear-Time Sequence Modeling with Selective State Spaces | 2023 | arXiv | no evidence yet |
| 0155 | Universal and Transferable Adversarial Attacks on Aligned Language Models | 2023 | arXiv | no evidence yet |
| 0156 | Towards Monosemanticity: Decomposing Language Models with Dictionary Learning | 2023 | Anthropic / Transformer Circuits | no evidence yet |
| 0157 | Frontier AI Regulation: Managing Emerging Risks to Public Safety | 2023 | arXiv | no evidence yet |
| 0158 | GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models | 2023 | arXiv (later Science) | no evidence yet |
| 0159 | Sleeper Agents: Training Deceptive LLMs That Persist Through Safety Training | 2024 | arXiv | no evidence yet |
| 0160 | Managing Extreme AI Risks Amid Rapid Progress | 2024 | Science | no evidence yet |
| 0161 | Accurate Structure Prediction of Biomolecular Interactions with AlphaFold 3 | 2024 | Nature | no evidence yet |
| 0162 | DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | 2025 | arXiv | no evidence yet |