NATURAL LANGUAGE PROCESSING (NLP) WEEKLY NEWSLETTER
The NLP Cypher | 04.17.22
An updated chart w/r/t the last newsletter. This is the DeepSparse Engine and how it performs against ONNX runtime (deets above). With each new DeepSparse version release; inference performance just keeps on improving while you drink your Mai Tai. 🍾
Sad Tales from the Dark Web
Dread our favorite dark site continues to deal with active DDoS attacks.
A PGP message…
Dread message 4/11 - Pastebin.com
Not a member of Pastebin yet? Sign Up , it unlocks many cool features! Someone doesn't want us up. April 11 DDOS…
OH … ever had anyone post a file you didn’t trust? Dangerzone comes in handy, but as usual, I wouldn’t know anything about this. 😉
Dangerzone works like this: You give it a document that you don't know if you can trust (for example, an email…
Business is hard bruh
Elon got bored and decided to buy Twitter this past week. He claims to want to make Twitter’s algo open-source. The first flex:
Elon applied further pressure, by initially joining and then deciding not to join Twitter’s board, then offering 43Billi to get the whole pie … Twitter CEO explains 🤣👇
We can expect further trolls from Elon in the near future.
… the source code: Elon’s Twitter Buyout Doc
… Substack throwing up gang signs already 😭:
Dall-E 2 vs. Latent-Diffusion | Thoughts?
Colab of the Week 🏆Latent-Diffusion Notebook
Megatron-DeepSpeed Hacks from BigScience
GitHub - microsoft/Megatron-DeepSpeed: Ongoing research training transformer language models at…
Megatron ( 1 and 2) is a large, powerful transformer developed by the Applied Deep Learning Research team at NVIDIA…
Quick start script for Megatron-DeepSpeed:
Megatron-DeepSpeed/start_fast.md at main · bigscience-workshop/Megatron-DeepSpeed
This quick instructions document contains 3 steps: installing software preparing data running the script This is useful…
A gallery for benchmarking Graph Neural Networks (GNNs) based on pure PyTorch backend.
GitHub - EdisonLeeeee/GraphGallery: GraphGallery is a gallery for benchmarking Graph Neural…
PyTorch is all you need! GraphGallery is a gallery for benchmarking Graph Neural Networks (GNNs) based on pure PyTorch…
HTTPie: Human-Friendly CLI HTTP Client
GitHub - httpie/httpie: As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP…
As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP client for the API era. JSON support, colors…
UFOs and OSINT | A Repo (Yes, this is real)
UFO research is so lit, it’s now expanding into GitHub. 😭
GitHub - richgel999/uap_resources: Key OSINT UAP Related Materials
At this point in the UAP Disclosure (or "Scheduled Dissemination" - Ramirez) Process, what's been missing is a…
ok, ok, last thing on UFOs (promise)
❤️Thanks for the big up Francesco!
KeyBART & KBIR
Keyphrase Boundary Infilling with Replacement (KBIR) achieves SOTA performance for the task of keyphrase extraction.
KBART model achieves SOTA performance on the task of keyphrase generation.
bloomberg (Bloomberg Finance LP)
We're on a journey to advance and democratize artificial intelligence through open source and open science.
NLP Models | A Timeline
Knowledge Graphs ICLR 2021
ICLR 2021 papers summary.
Knowledge Graph Papers @ ICLR 2021
Hi! 👋 Today we are going to have a look at ICLR 2021 papers focusing on knowledge graphs (KGs), particularly in areas…
Knowledge Distillation with Haystack
Hey Deepset, if you’re reading this, let’s parlay homies!😍⭐
Knowledge Distillation with Haystack | deepset
Modern-day natural language processing (NLP) relies largely on big and powerful Transformer models. Language models…
Question Answering from UKP et al
Question Answering (QA) platform to enable users to easily implement, manage and share their custom QA pipelines.
It’s got a UI too.
GitHub - UKP-SQuARE/square-core: SQuARE: Software for question answering research.
Flexible and Extensible Question Answering Platform SQuARE is a flexible and extensible Question Answering (QA)…
Papers to Read 📚
A collection of recently released repos that caught our 👁
Code switching in the context of English/Spanish conversations for the task of speech translation (ST), generating and evaluating both transcript and translation.
An encoder evaluation framework for comparing the performance of SOTA pre-trained representations on the task of low-resource NER.
A unified generative model that can perform program synthesis (via left-to-right generation) as well as editing (via infilling). The model is the first generative model that is able to directly perform zero-shot code infilling.
A generative language model BioBART that adapts BART to the biomedical domain.
A corpus of English crude oil news for event extraction.
Repo for processing medical texts in electronic health records.