Algorithmic progress vs Compute progress for ML
|
|
1
|
390
|
December 14, 2022
|
Build images for arm32
|
|
2
|
400
|
December 11, 2022
|
Docker image size has doubled after v1.3.0
|
|
0
|
346
|
December 11, 2022
|
Competitive programming with AlphaCode
|
|
0
|
305
|
December 9, 2022
|
A Walk in the Park: Learning to Walk in 20 Minutes With RL
|
|
2
|
334
|
December 6, 2022
|
Pretrained Catalan OpenNMT models
|
|
6
|
1175
|
December 6, 2022
|
NLLB-200 with CTranslate2
|
|
1
|
1014
|
December 4, 2022
|
GPT-3 Based Chatbot by OpenAI
|
|
3
|
1066
|
December 2, 2022
|
Decentralized Training of Foundation Models in Heterogeneous Environments
|
|
3
|
403
|
December 1, 2022
|
MineDojo - A Minecraft environment for AI agents
|
|
1
|
455
|
December 1, 2022
|
T5 Language Model
|
|
0
|
351
|
November 24, 2022
|
Human-level play in the game of Diplomacy by combining language models with strategic reasoning
|
|
0
|
506
|
November 22, 2022
|
Improving Transliteration in LibreTranslate
|
|
2
|
1143
|
November 16, 2022
|
Few-shot Translation with Multilingual Language Models
|
|
1
|
1198
|
April 26, 2022
|
GPT-4 Rumors - Has the Turing Test Been Passed?
|
|
1
|
632
|
November 15, 2022
|
Generating Human-level Text with Contrastive Search in Transformers 🤗
|
|
1
|
351
|
November 11, 2022
|
OpenNMT-py v3.0
|
|
2
|
616
|
November 8, 2022
|
BLOOM - A large multilingual open source language model
|
|
1
|
541
|
November 5, 2022
|
Explainpaper.com
|
|
1
|
760
|
November 5, 2022
|
Public Datasets
|
|
6
|
1491
|
November 2, 2022
|
The Stack - a 3TB dataset of permissively licensed source code
|
|
0
|
590
|
November 2, 2022
|
Lila: A Unified Benchmark for Mathematical Reasoning
|
|
2
|
539
|
November 2, 2022
|
Is Encoder-Decoder Redundant for Neural Machine Translation?
|
|
1
|
461
|
November 1, 2022
|
Text to speech in opennmt-tf
|
|
1
|
661
|
October 30, 2022
|
Vision Transformers (ViT)
|
|
1
|
504
|
October 30, 2022
|
Yuhuai Wu | Memorizing Transformers
|
|
1
|
597
|
October 30, 2022
|
SGConv - Structural Global Convolution Networks
|
|
1
|
1149
|
October 30, 2022
|
Convolutional Neural Networks (CNN)
|
|
0
|
337
|
October 30, 2022
|
Using FairSeq models with CTranslate2
|
|
1
|
643
|
October 26, 2022
|
MetalTranslate: Customizable machine translation in C++
|
|
0
|
576
|
June 25, 2022
|