NATURAL LANGUAGE PROCESSING (NLP) WEEKLY NEWSLETTER

The NLP Cypher | 12.12.21

Magnus

Ricky Costa

6 min readDec 12, 2021

Is Moore’s Law Finito?

Moore's Law, AI, and the pace of progress - LessWrong

It seems to be a minority view nowadays to believe in Moore's Law, the routine doubling of transistor density roughly…

www.lesswrong.com

NeurIPS Research Papers by Institution

Here’s a collection of papers by your favorite big tech and educational institutions.

Carnegie Mellon University at NeurIPS 2021

Carnegie Mellon University is proud to present 92 papers in the main conference and 9 papers in the datasets and…

blog.ml.cmu.edu

Google at NeurIPS 2021

This week marks the beginning of the 35 th annual Conference on Neural Information Processing Systems (NeurIPS 2021)…

ai.googleblog.com

Stanford AI Lab Papers and Talks at NeurIPS 2021

The thirty-fifth Conference on Neural Information Processing Systems (NeurIPS) 2021 is being hosted virtually from Dec…

ai.stanford.edu

Meta AI research at NeurIPS 2021: Embodied agents, unsupervised speech recognition, and more

We're excited to share that Meta AI researchers will be presenting 83 papers at NeurIPS 2021, including eight as…

ai.facebook.com

A New and Blazing Fast WordPiece Tokenizer

A Fast WordPiece Tokenization System

Tokenization is a fundamental pre-processing step for most natural language processing (NLP) applications. It involves…

ai.googleblog.com

GLaM| 1.2 Trillion Param Sparse Model

More Efficient In-Context Learning with GLaM

Large language models (e.g., GPT-3) have many significant capabilities, such as performing few-shot learning across a…

ai.googleblog.com

“The Generalist Language Model (GLaM), a trillion weight model that can be trained and served efficiently (in terms of computation and energy use) thanks to sparsity, and achieves competitive performance on multiple few-shot learning tasks. GLaM’s performance compares favorably to a dense language model, GPT-3 (175B) with significantly improved learning efficiency across 29 public NLP benchmarks in seven categories, spanning language completion, open-domain question answering, and natural language inference tasks.”

Glam vs. GPT-3 on NLG and NLU Tasks

Awesome Take Away:

This large sparse model is competitive with dense counterparts while training on much less data and consuming less energy.

Information Extraction from Scanned Receipts: Fine-tuning LayoutLM on SROIE

An OCR demo with LayoutLM fine-tuned for information extraction on receipts data.

Information Extraction from Scanned Receipts: Fine-tuning LayoutLM on SROIE

An OCR demo with LayoutLM fine-tuned for information extraction on receipts data. Made by Eric Bunch using Weights &…

wandb.ai

AI Predictions Survey

http://www.pwc.com/us/en/tech-effect/ai-analytics/ai-predictions.html

Improving GitHub Search

Improving GitHub code search | The GitHub Blog

Today, we are rolling out a technology preview for substantial improvements to searching code on GitHub. We want to…

github.blog

Gopher — Deepmind’s Language Model

Language modelling at scale

Language modelling at scale: Gopher, ethical considerations, and retrieval Language, and its role in demonstrating and…

deepmind.com

GauGAN2 | Photorealistic Text 2 Image

NVIDIA Research's GauGAN AI Art Demo Responds to Words | NVIDIA Blog

A picture worth a thousand words now takes just three or four words to create, thanks to GauGAN2, the latest version of…

blogs.nvidia.com

Transformers From Scratch

“I procrastinated a deep dive into transformers for a few years. Finally the discomfort of not knowing what makes them tick grew too great for me. Here is that …”

https://e2eml.school/transformers.html

PyTorch | Julia (but not exactly like Julia)

“When trying to predict how PyTorch would itself get disrupted, we used to joke a bit about the next version of PyTorch being written in Julia. This was not very serious: a huge factor in moving PyTorch from Lua to Python was to tap into Python’s immense ecosystem (an ecosystem that shows no signs of going away) and even today it is still hard to imagine how a new language can overcome the network effects of Python.”

Where we are headed and why it looks a lot like Julia (but not exactly like Julia)

When trying to predict how PyTorch would itself get disrupted, we used to joke a bit about the next version of PyTorch…

dev-discuss.pytorch.org

Decoding Text Generation Tutorial Top-K and Top-P

One of the most intuitive tutorials out there.

API Documentation | Cohere AI

The method of picking output tokens is a key concept in text generation with language models. There are several methods…

docs.cohere.ai

Punctuation Model

Attention Neural Networks Slides

Slides

https://www.dropbox.com/s/rahrg6s7w4vud9f/lecture12_attention_neural_networks.pdf?dl=0

Code

CS5242_2021/codes/labs_lecture12 at main · xbresson/CS5242_2021

Neural Networks and Deep Learning, NUS CS5242, 2021 - CS5242_2021/codes/labs_lecture12 at main · xbresson/CS5242_2021

github.com

Lemmatize spaCy

spaCy’s new lemmatizer is super accurate and blows XLM-RoBERTa out of the water! This blog post presents inner workings, benchmarks and quick start snippets. 😎

Neural edit-tree lemmatization for spaCy · Explosion

We are happy to introduce a new, experimental, machine learning-based lemmatizer that posts accuracies above 95% for…

explosion.ai

Awesome Papers 📚

https://arxiv.org/pdf/2111.10952.pdf

https://arxiv.org/pdf/2112.01989.pdf

https://arxiv.org/pdf/2112.03572.pdf

Repo Cypher 👨‍💻

A collection of recently released repos that caught our 👁

Coqui TTS

TTS is a library for advanced text-to-speech generation. TTS comes with pretrained models, tools for measuring dataset quality and already used in 20+ languages.

GitHub - coqui-ai/TTS: 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research…

🐸TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve…

github.com

Connected Papers 📈

Causal Distillation for Language Models

Distillation library that uses a third objective that encourages the student to imitate the causal computation process of the teacher through interchange intervention training (IIT).

GitHub - frankaging/Causal-Distill: The Codebase for Causal Distillation for Language Models.

Zhengxuan Wu,Atticus Geiger, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman…

github.com

Connected Papers 📈

NL-Augmenter

[NL-Augmenter] augments text datasets in several ways, includes: randomizing names and numbers, changing style/syntax, paraphrasing, and KB-based paraphrasing.

GitHub - GEM-benchmark/NL-Augmenter: NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural…

The NL-Augmenter is a collaborative effort intended to add transformations of datasets dealing with natural language…

github.com

Google Colaboratory

Edit description

colab.research.google.com

Connected Papers 📈

Deepparse

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning.

GitHub - GRAAL-Research/deepparse: Deepparse is a state-of-the-art library for parsing…

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning. Use deepparse…

github.com

Connected Papers 📈

CALVIN — A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

A simulated benchmark to learn long-horizon language-conditioned tasks. The aim is to make it possible to develop agents that can solve many robotic manipulation tasks over a long horizon, from onboard sensors, and specified only via human language.

GitHub - mees/calvin: CALVIN - A benchmark for Language-Conditioned Policy Learning for…

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks Oier Mees…

github.com

Connected Papers 📈

Hashformers

Library for the hashtag segmentation task which automatically inserts missing spaces between words in a hashtag.

GitHub - ruanchaves/hashformers: Hashformers is a framework for hashtag segmentation with…

Hashtag segmentation is the task of automatically inserting the missing spaces between the words in a hashtag…

github.com

Google Colaboratory

Edit description

colab.research.google.com

Connected Papers 📈

Quantum Stat

NATURAL LANGUAGE PROCESSING (NLP) WEEKLY NEWSLETTER

The NLP Cypher | 12.12.21

Magnus

Is Moore’s Law Finito?

Moore's Law, AI, and the pace of progress - LessWrong

It seems to be a minority view nowadays to believe in Moore's Law, the routine doubling of transistor density roughly…

NeurIPS Research Papers by Institution

Carnegie Mellon University at NeurIPS 2021

Carnegie Mellon University is proud to present 92 papers in the main conference and 9 papers in the datasets and…

Google at NeurIPS 2021

This week marks the beginning of the 35 th annual Conference on Neural Information Processing Systems (NeurIPS 2021)…

Stanford AI Lab Papers and Talks at NeurIPS 2021

The thirty-fifth Conference on Neural Information Processing Systems (NeurIPS) 2021 is being hosted virtually from Dec…

Meta AI research at NeurIPS 2021: Embodied agents, unsupervised speech recognition, and more

We're excited to share that Meta AI researchers will be presenting 83 papers at NeurIPS 2021, including eight as…

A New and Blazing Fast WordPiece Tokenizer

A Fast WordPiece Tokenization System

Tokenization is a fundamental pre-processing step for most natural language processing (NLP) applications. It involves…

GLaM| 1.2 Trillion Param Sparse Model

More Efficient In-Context Learning with GLaM

Large language models (e.g., GPT-3) have many significant capabilities, such as performing few-shot learning across a…

Glam vs. GPT-3 on NLG and NLU Tasks

Awesome Take Away:

Information Extraction from Scanned Receipts: Fine-tuning LayoutLM on SROIE

Information Extraction from Scanned Receipts: Fine-tuning LayoutLM on SROIE

An OCR demo with LayoutLM fine-tuned for information extraction on receipts data. Made by Eric Bunch using Weights &…

AI Predictions Survey

Improving GitHub Search

Improving GitHub code search | The GitHub Blog

Today, we are rolling out a technology preview for substantial improvements to searching code on GitHub. We want to…

Gopher — Deepmind’s Language Model

Language modelling at scale

Language modelling at scale: Gopher, ethical considerations, and retrieval Language, and its role in demonstrating and…

GauGAN2 | Photorealistic Text 2 Image

NVIDIA Research's GauGAN AI Art Demo Responds to Words | NVIDIA Blog

A picture worth a thousand words now takes just three or four words to create, thanks to GauGAN2, the latest version of…

Transformers From Scratch

PyTorch | Julia (but not exactly like Julia)

Where we are headed and why it looks a lot like Julia (but not exactly like Julia)

When trying to predict how PyTorch would itself get disrupted, we used to joke a bit about the next version of PyTorch…

Decoding Text Generation Tutorial Top-K and Top-P

API Documentation | Cohere AI

The method of picking output tokens is a key concept in text generation with language models. There are several methods…

Punctuation Model

Attention Neural Networks Slides

CS5242_2021/codes/labs_lecture12 at main · xbresson/CS5242_2021

Neural Networks and Deep Learning, NUS CS5242, 2021 - CS5242_2021/codes/labs_lecture12 at main · xbresson/CS5242_2021

Lemmatize spaCy

Neural edit-tree lemmatization for spaCy · Explosion

We are happy to introduce a new, experimental, machine learning-based lemmatizer that posts accuracies above 95% for…

Awesome Papers 📚

Repo Cypher 👨‍💻

A collection of recently released repos that caught our 👁

Coqui TTS

GitHub - coqui-ai/TTS: 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research…

🐸TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve…

Causal Distillation for Language Models

GitHub - frankaging/Causal-Distill: The Codebase for Causal Distillation for Language Models.

Zhengxuan Wu*,Atticus Geiger*, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman…

NL-Augmenter

GitHub - GEM-benchmark/NL-Augmenter: NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural…

The NL-Augmenter is a collaborative effort intended to add transformations of datasets dealing with natural language…

Google Colaboratory

Edit description

Deepparse

GitHub - GRAAL-Research/deepparse: Deepparse is a state-of-the-art library for parsing…

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning. Use deepparse…

CALVIN — A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

GitHub - mees/calvin: CALVIN - A benchmark for Language-Conditioned Policy Learning for…

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks Oier Mees…

Hashformers

GitHub - ruanchaves/hashformers: Hashformers is a framework for hashtag segmentation with…

Hashtag segmentation is the task of automatically inserting the missing spaces between the words in a hashtag…

Google Colaboratory

Edit description

Written by Ricky Costa

Zhengxuan Wu,Atticus Geiger, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman…