TOOLS AND MODELS

INSTRUMENTATION, PRODUCTION, SOUND ENGINEERING

DATASETS AND MODELS

ML ENGINES AND DEVELOPMENT TOOLS

LYRICS AND TEXT

Composition

MIDI creation based on detailed and specific musical prompts

Anticipatory Transformer

Symbolic (notation or midi output)

AI powered royalty free music for content creators.

Boomy is a generative AI music platform where people create, publish, and monetize original songs - over 18 million to date.

Dance Diffusion

Dance Diffusion is the first in a suite of generative audio tools for producers and musicians to be released by Harmonai. Unconditional random audio sample generation; Audio sample regeneration/style transfer using a single audio file or recording; Audio interpolation between two audio files

Deep learning driven jazz generation using Keras & Theano!

Use your own music for training the models and creating new music

Musical Accompaniment Co-creation via Latent Diffusion Models

AI voice models and products powering millions of developers, creators, and enterprises.

Endel is an AI sound wellness company specializing in creating functional soundscapes to help people focus, relax, and sleep.

Fadr Stem Separation

SynthGPT, Stems, and Remixing tools

Infinite Album creates real-time generative AI music for games.

Ethically-trained. High-fidelity. Text-to-Music generative audio. World-class AI tech & research.

Lemonaide Music is an artist-focused music technology company that helps musicians find new ways to write, produce, and create, while keeping the artist front and center.

LifeScore is an AI-powered music generation technology that transforms original recorded music into high quality, endlessly varying remixes that retain artists’ unique musical fingerprint at scale.

By following strict ethical AI guidelines, Loudly guarantees that its proprietary music dataset has been carefully developed via consent, transparency and copyright compliance.

Ableton Live Plugin. Magenta is a research project from Google focused on using machine learning to generate music and art.

Magenta Studio AI Music Composition Assistant

Magenta Studio is a MIDI plugin for Ableton Live. It contains 5 tools: Continue, Groove, Generate, Drumify, and Interpolate, which let you apply Magenta models to your MIDI files.

MIDI melody generator

AI Test Kitchen is a place where people can experience and give feedback on some of Google's latest AI technologies. Our goal is to learn, improve, and innovate responsibly on AI together.

Symbolic (notation or midi output)

An exciting addition to the vibrant landscape of Multimodal Large Language Models designed for controlled music generation.

Symbolic (notation or midi output)

GPT3-based Multi-Instrumental MIDI Music AI Implementation

This is code for providing an augmented piano playing experience. When run, this code will provide computer accompaniment that learns in real-time from the human host pianist. When the host pianist stops playing for a given amount of time, the computer AI will then improvise in the space using the style learned from the host.

Realtime Audio Variational autoEncoder

Rightsify is a music licensing company pioneering the future of music creation and copyright management.

Soundful AI helps users generate distinctive studio-quality music. The platform is built for content creators, artists, music producers, brands, and creative agencies looking for the highest creative and sonic quality.

Generate up to 3-minute high-quality audio that you can use commercially.

Stable Audio Open

Stable Audio Open 1.0 generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an autoencoder that compresses waveforms into a manageable sequence length, a T5-based text embedding for text conditioning, and a transformer-based diffusion (DiT) model that operates in the latent space of the autoencoder.

Tuney builds AI tools that make music more accessible and fun for all creatives.

LyricStudio and MelodyStudio

Lyrics and Text

POE is an interface through which the users can prompt a variety of models from the same site.

Jarvis Songwriter AI Companion

Jarvis is a songwriting companion that helps overcome writer's block. It generates lyrics suggestions based on any artist, genre, title or/and lyrics prompt.

NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, wrappers for industrial-strength NLP libraries, and an active discussion forum.

Industrial-Strength Natural Language Processing

Visuals

Disco Diffusion v5.7

Image Generation model. The diffusion model in use is Katherine Crowson's fine-tuned 512x512 model.

Video and Image generator used for Music Videos and Animations

Video generator (lipsync)

Image generation service.

Video generator

Video generator

Create production-quality visual assets for your projects with unprecedented quality, speed, and style-consistency.

Music Videos, Animations

Microsoft Designer

Video generator

Video generator

Instrumentation
Production
Sound Engineering

AI-Audio Enhancer

Generate effects from text, describing the desired audio

Bandlab Mastering

Instantly master your tracks with the world’s leading online mastering service. Hear the difference mastering can make with the fastest, best sounding, and free artist-driven Mastering tool.

CataRT AI agent max patch

The concatenative real-time sound synthesis system CataRT, created in 2005, plays grains from a large corpus of segmented and descriptor-analysed sounds according to proximity to a target position in the descriptor space. This can be seen as a content-based extension to granular synthesis providing direct access to specific sound characteristics.

DDSP-VST morphs audio into a range of different instruments. Unlike MIDI notes, DDSP preserves the nuances of pitch and dynamics for expressive neural synthesis.

An online mastering engine that’s fast, easy to use, and sounds incredible. Made by Grammy-winning engineers, powered by AI

HARD Audio Remixer

HArmony-Rhythm Disentanglement audio remixer plugin.

LANDR AI Mastering

AI audio mastering tool.

Moises AI Stem Separation

Play your music in any key, at any speed. Remove vocals and instruments in any song. Discover the ultimate immersive practice experience powered by AI.

"Neutone Morpho, a realtime tone morphing plugin. Our cutting-edge machine learning technology can transform any sound into something new and inspiring. Neutone Morpho directly processes audio, capturing even the subtlest details from your input."

AI-powered DAW (stem splits, noise reduction, instrument substitution, etc.)

Generating samples

Soundtrap is a cloud-based digital audio workstation (DAW) that is reimagining how music and podcasts are created. Made by music producers, songwriters, and audio experts, our goal is to help you unlock your creative potential.

Sound Design, VST, Sound Generation

Datasets and Models

Dadabots DadaGP dataset

Encoder/decoder converts GuitarPro songs to/from a token-sequence format for generative language models like GPT2, TransformerXL, etc.

Making the world a better place through AI-generated beats & grooves

Clean foundation models for enterprise use.

Lakh MIDI Dataset

The Lakh MIDI dataset is a collection of 176,581 unique MIDI files, 45,129 of which have been matched and aligned to entries in the Million Song Dataset. Its goal is to facilitate large-scale music information retrieval, both symbolic (using the MIDI files alone) and audio content-based (using information extracted from the MIDI files as annotations for the matched audio files). Around 10% of all MIDI files include timestamped lyrics events with lyrics are often transcribed at the word, syllable or character level.

MUSIC FADERNETS

Controllable music generation based on high-level features via low-level feature modelling

The official implementation of NeuCoSVC, a versatile model for any-to-any singing voice conversion.

Symbolic music generation conditioned on continuous-valued emotions

Generates multi-instrument symbolic music (MIDI), based on user-provided emotions from valence-arousal plane.

Yating Music Transformer-based model

Yating Music Transformer-based model

ML Engines and Development Tools

Azure Machine Learning

ML as a Service. Use an enterprise-grade AI service for the end-to-end machine learning (ML) lifecycle.

The purpose of Keras is to give an unfair advantage to any developer looking to ship Machine Learning-powered apps. Keras focuses on debugging speed, code elegance & conciseness, maintainability, and deployability. When you choose Keras, your codebase is smaller, more readable, easier to iterate on.

librosa is a python package for music and audio analysis. It provides the building blocks necessary to create music information retrieval systems.

A visual programming language for realtime DSP synthesis. Mostly used to make music. Also used to do realtime graphics, video, and interactive art.

The PyTorch Foundation is a neutral home for the deep learning community to collaborate on the open source PyTorch framework and ecosystem. The PyTorch Foundation is supported by leading contributors to the PyTorch open source project.

Custom AI model training, generation & attribution for record labels, publishers, artists, producers, songwriters, distributors, digital instrument makers, sonic branding agencies and enterprise music companies.

An end-to-end platform for machine learning

Community for efficient hold and distribution of GPU

Voice

Unleash the unlimited potential of your voice

Kits is an AI voice platform that streamlines and improves producer workflows with AI audio tools built for music. Kits’ voice and instrument models are Fairly Trained certified.

Use AI to create music with your voice or other voices and make music like never before.

Ultra realistic Text to Speech(TTS) voice. Leading AI Voice Generator. Free Unlimited downloads. Most Fluent & Conversational AI voices

VITS-based Voice Conversion focused on simplicity, quality, and performance

Voicebanks + DAW

Voicebanks + DAW

Voice Clone - based on model by Coqui

Voice cloning tool

Real-time AI voice transformation and interactive audio that enables users to switch their voices as easily as they do skins.

Fantasy language generator