Index of /Papers/


../
1810.04805v2.pdf                                   05-Apr-2025 02:13    757K
2006.11477v3.pdf                                   29-Apr-2025 03:19    723K
2010.11929v2.pdf                                   29-Apr-2025 00:51      4M
2210.02186v3.pdf                                   29-Apr-2025 03:50      4M
2305.17026v4.pdf                                   29-Apr-2025 00:15    459K
24-038_51f8444f-502c-4139-8bf2-56eb4b65c58a(1).pdf 12-Apr-2025 04:54    925K
2404.06654v3.pdf                                   06-Apr-2025 23:13    667K
2503.01996v1.pdf                                   06-Apr-2025 23:41     10M
2504.06214v1.pdf                                   14-Apr-2025 11:50    473K
2506.08872v1.pdf                                   19-Jun-2025 05:31     35M
A Few Useful Things to Know About Machine Learn..> 05-Apr-2025 02:13    156K
A Survey on Mixture of Experts.pdf                 06-Apr-2025 00:40      1M
Attention Is All You Need.pdf                      04-Apr-2025 04:42    556K
Attention is Not Explanation.pdf                   05-Apr-2025 02:13      1M
Auto-Encoding Variational Bayes (VAE).pdf          05-Apr-2025 02:21      4M
Batch Normalization: Accelerating Deep Network ..> 05-Apr-2025 02:13    169K
Concrete Problems in AI Safety (Revisited).pdf     05-Apr-2025 02:13    105K
Concrete Problems in AI Safety.pdf                 05-Apr-2025 02:13    469K
DALL·E: Zero-Shot Text-to-Image Generation.pdf    05-Apr-2025 02:21     10M
Deep Residual Learning for Image Recognition (R..> 05-Apr-2025 02:13    800K
DeepSeek-R1: Incentivizing Reasoning Capability..> 05-Apr-2025 02:16      1M
Determining_Chess_Piece_Values_Using_Machine_Le..> 16-Jul-2025 13:44      2M
Diffusion Models Beat GANs on Image Synthesis.pdf  05-Apr-2025 02:21     38M
Explainable AI_ Visualizing Attention in Transf..> 05-Apr-2025 14:30      8M
GPT-2_ Language Models are Unsupervised Multita..> 05-Apr-2025 02:13    569K
GPT-3_ Language Models are Few-Shot Learners.pdf   05-Apr-2025 02:13      6M
GPT_ Improving Language Understanding by Genera..> 05-Apr-2025 02:13    528K
Generative Adversarial Nets (GANs).pdf             05-Apr-2025 02:21    518K
How Not To Sort By Average.pdf                     01-Jun-2025 04:44    606K
ImageNet Classification with Deep CNNs (AlexNet..> 05-Apr-2025 02:13      1M
Imagen_ Photorealistic Text-to-Image Generation..> 05-Apr-2025 02:21     11M
LoRA_ Low-Rank Adaptation of LLMs.pdf              05-Apr-2025 02:13      2M
Mastering the Game of Go with Deep Neural Netwo..> 05-Apr-2025 02:13      2M
Mixture-of-Recursions.pdf                          20-Jul-2025 19:42      2M
Mixtures of Experts Models.pdf                     06-Apr-2025 00:40      1M
NoLiMa Long-Context Evaluation Beyond Litera..> 04-Apr-2025 04:41    658K
Pattern Recognition and Machine Learning.pdf       05-Apr-2025 02:13      9M
Playing Atari with Deep Reinforcement Learning ..> 05-Apr-2025 02:13    472K
Reward is Enough.pdf                               05-Apr-2025 02:13    780K
SPEECH-TRANSFORMER A NO-RECURRENCE SEQUENCE-TO-..> 29-Apr-2025 02:19    641K
Scaling Laws for Neural Language Models.pdf        05-Apr-2025 02:21      2M
Sequence to Sequence Learning with Neural Netwo..> 05-Apr-2025 02:19    109K
T5_ Exploring the Limits of Transfer Learning w..> 05-Apr-2025 02:13      1M
TIME SERIES IS WORTH 64 WORDS.pdf                  16-Apr-2025 03:38      4M
The Bitter Lesson.pdf                              05-Apr-2025 02:13     53K
The Elements of Statistical Learning.pdf           05-Apr-2025 02:13     13M
Thompson_1984_ReflectionsonTrustingTrust.pdf       02-May-2025 15:26    220K
Training Compute-Optimal Large Language Models ..> 05-Apr-2025 02:13      6M
Understanding Deep Learning Requires Rethinking..> 05-Apr-2025 02:13    394K
best-friend-passed-need-ex-removed-from-photo-v..> 24-Apr-2025 16:23    158K
icon.png                                           10-Aug-2025 05:15    1156
lee_2025_ai_critical_thinking_survey.pdf           09-Apr-2025 01:09    995K
the-illusion-of-thinking.pdf                       10-Jun-2025 03:02     13M