·
October 2022

Meta released ESMFold: A LLM to predict protein structures

biorxiv.org
Oct 31
·
biorxiv.org

Lex Friedman interviews Andrej Karpathy (formerly AI Lead at Tesla)

youtube.com
Oct 29
·
youtube.com

Interactive storybook demo powered by Stable Diffusion

twitter.com
Oct 27
·
twitter.com

BigCode releases The Stack: the largest code dataset (notably, all permissively licensed)

The Stack includes 3TB of code across 30 programming languages and is 3x bigger in size than the next-largest public code dataset. BigCode only includes code that has permissive software licenses (MIT, Apache 2.0, etc) and provides an opt-out process for developers to remove their code from the dataset.

huggingface.co
Oct 27
·
huggingface.co

Shutterstock announces DALL·E integration and fund to compensate contributors

theverge.com
Oct 25
·
theverge.com

DeepMind proposes a method for in-context RL with transformers.

The paper shows that transformers can improve themselves autonomously through trial and error without ever updating their weights. No prompting, no finetuning. A single transformer simply collects its own data and maximizes rewards on new tasks.

arxiv.org
Oct 25
·
arxiv.org

Stable Diffusion v1.5 unofficially released by Runway

huggingface.co
Oct 20
·
huggingface.co

Google announces U-PaLM 540B

arxiv.org
Oct 20
·
arxiv.org

Google releases open-source LLM Flan-T5

arxiv.org
Oct 20
·
arxiv.org

CarperAI announces plans for the first open-source GPT·3 like model

CarperAI, a new research lab within the EleutherAI research collective, is releasing an "instruction-tuned" large language model trained using Reinforcement Learning from Human Feedback (RHLF). In effect, releasing an open source equivalent of GPT·3.

carper.ai
Oct 20
·
carper.ai

Replit releases mobile app with codegen built-in

blog.replit.com
Oct 19
·
blog.replit.com

Meta releases first speech-to-speech translation system for Hokkien

Meta's Universal Speech Translator project makes it possible to train AI models on languages that are primarily oral and do not have a standard or widely used writing system. Meta built and shared an AI translation system for a primarily oral language, Hokkien.

ai.facebook.com
Oct 19
·
ai.facebook.com

Potential GitHub Copilot lawsuit

githubcopilotinvestigation.com
Oct 17
·
githubcopilotinvestigation.com

Microsoft announces Designer: A DALL-E powered design tool

designer.microsoft.com
Oct 12
·
designer.microsoft.com

Joe Rogan interviews Steve Jobs generated by AI

podcast.ai
Oct 11
·
podcast.ai

State of AI publishes their 2022 report

stateof.ai
Oct 11
·
stateof.ai

Stable Diffusion meets virtual reality

twitter.com
Oct 11
·
twitter.com

Google releases Audio LM: a LLM for audio generation

ai.googleblog.com
Oct 06
·
ai.googleblog.com

A working implementation of text-to-3D dreamfusion, powered by stable diffusion.

github.com
Oct 06
·
github.com

Google announces Phenaki: a model to generate videos from text

Google releases a model for generating videos from text, with prompts that can change over time, and videos that can be as long as multiple minutes.

phenaki.github.io
Oct 05
·
phenaki.github.io

US White House releases a "Blueprint for an AI Bill of Rights"

The blueprint is intended to "help guide the design, use, and deployment of automated systems to protect the American Public.” They are currently non-regulatory, non-binding, and not yet enforceable.

whitehouse.gov
Oct 04
·
whitehouse.gov
September 2022

Google introduces DreamFusion: Text-to-3D using 2D Diffusion

dreamfusion3d.github.io
Sep 29
·
dreamfusion3d.github.io

A human motion diffusion model

guytevet.github.io
Sep 29
·
guytevet.github.io

Meta introduces Make-A-Video to generate videos from text

Meta releases a paper for text-to-video generation using an improved model design to 1) accelerate training 2) not require paired text-video data, and 3) generated videos have greater possibilities and vastness than before.

makeavideo.studio
Sep 29
·
makeavideo.studio

DALL·E now available without waitlist

openai.com
Sep 28
·
openai.com

Stable Diffusion Photoshop plugin

twitter.com
Sep 26
·
twitter.com

Stable Diffusion running on iPhone

twitter.com
Sep 24
·
twitter.com

Reddit user claims GPT3 got them straight A's in school

reddit.com
Sep 23
·
reddit.com

NVIDIA releases GET3D, a generative 3D object model

nv-tlabs.github.io
Sep 23
·
nv-tlabs.github.io

DeepMind announces Sparrow: a safe dialogue agent

deepmind.com
Sep 22
·
deepmind.com

Getty Images bans AI-generated content

theverge.com
Sep 21
·
theverge.com

Whisper, OpenAI's near human level English speech recognition model

Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. It is effective with accented speech, background noise, and technical language. It works in multiple languages and can translate those languages into English.

openai.com
Sep 21
·
openai.com

NVIDIA announces BioNeMo: a service to train LLMs for bio

blogs.nvidia.com
Sep 20
·
blogs.nvidia.com

Character.AI opens public beta for their advanced chatbots

twitter.com
Sep 16
·
twitter.com

Google announces PaLI: A Jointly-Scaled Multilingual Language-Image Model

arxiv.org
Sep 16
·
arxiv.org

ACT-1: An LLM to automate manual tasks on software tools

Adept Labs announces Action Transformer 1 (ACT-1), a model that can control software from human requests. (For example, search Zillow or add new records to Salesforce.)

twitter.com
Sep 14
·
twitter.com

Action Transformer (ACT-1) announcement, natural language to computer action

adept.ai
Sep 14
·
adept.ai

A demo of GPT·3 armed with a python interpreter

twitter.com
Sep 12
·
twitter.com