TOOLS AND MODELS
Composition
AIVA
MIDI creation based on detailed and specific musical prompts
Anticipatory Transformer
Symbolic (notation or midi output)
Beatoven.ai
AI powered royalty free music for content creators.
Boomy
Boomy is a generative AI music platform where people create, publish, and monetize original songs - over 18 million to date.
Dance Diffusion
Dance Diffusion is the first in a suite of generative audio tools for producers and musicians to be released by Harmonai. Unconditional random audio sample generation; Audio sample regeneration/style transfer using a single audio file or recording; Audio interpolation between two audio files
deepjazz LSTM
Deep learning driven jazz generation using Keras & Theano!
Delphos
Use your own music for training the models and creating new music
DIFF-A-RIFF
Musical Accompaniment Co-creation via Latent Diffusion Models
Eleven Labs
AI voice models and products powering millions of developers, creators, and enterprises.
Endel
Endel is an AI sound wellness company specializing in creating functional soundscapes to help people focus, relax, and sleep.
Fadr Stem Separation
SynthGPT, Stems, and Remixing tools
Infinite Album
Infinite Album creates real-time generative AI music for games.
JenMusic
Ethically-trained. High-fidelity. Text-to-Music generative audio. World-class AI tech & research.
Lemonaide
Lemonaide Music is an artist-focused music technology company that helps musicians find new ways to write, produce, and create, while keeping the artist front and center.
LifeScore
LifeScore is an AI-powered music generation technology that transforms original recorded music into high quality, endlessly varying remixes that retain artists’ unique musical fingerprint at scale.
Loudly
By following strict ethical AI guidelines, Loudly guarantees that its proprietary music dataset has been carefully developed via consent, transparency and copyright compliance.
Magenta
Ableton Live Plugin. Magenta is a research project from Google focused on using machine learning to generate music and art.
Magenta Studio AI Music Composition Assistant
Magenta Studio is a MIDI plugin for Ableton Live. It contains 5 tools: Continue, Groove, Generate, Drumify, and Interpolate, which let you apply Magenta models to your MIDI files.
Melody Sauce 2
MIDI melody generator
MusicFX
AI Test Kitchen is a place where people can experience and give feedback on some of Google's latest AI technologies. Our goal is to learn, improve, and innovate responsibly on AI together.
MusicLang
Symbolic (notation or midi output)
Mustango
An exciting addition to the vibrant landscape of Multimodal Large Language Models designed for controlled music generation.
Muzik
Symbolic (notation or midi output)
optimus GPT3
GPT3-based Multi-Instrumental MIDI Music AI Implementation
PIanoAI
This is code for providing an augmented piano playing experience. When run, this code will provide computer accompaniment that learns in real-time from the human host pianist. When the host pianist stops playing for a given amount of time, the computer AI will then improvise in the space using the style learned from the host.
RAVE
Realtime Audio Variational autoEncoder
Rightsify
Rightsify is a music licensing company pioneering the future of music creation and copyright management.
Soundful
Soundful AI helps users generate distinctive studio-quality music. The platform is built for content creators, artists, music producers, brands, and creative agencies looking for the highest creative and sonic quality.
Stable Audio
Generate up to 3-minute high-quality audio that you can use commercially.
Stable Audio Open
Stable Audio Open 1.0 generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an autoencoder that compresses waveforms into a manageable sequence length, a T5-based text embedding for text conditioning, and a transformer-based diffusion (DiT) model that operates in the latent space of the autoencoder.
Tuney
Tuney builds AI tools that make music more accessible and fun for all creatives.
WaveAI
LyricStudio and MelodyStudio
Lyrics and Text
Claude
Lyrics and Prompt Writing
Gemini
Google's LLM, Lyrics and Prompt Writing
Poe
POE is an interface through which the users can prompt a variety of models from the same site.
Jarvis Songwriter AI Companion
Jarvis is a songwriting companion that helps overcome writer's block. It generates lyrics suggestions based on any artist, genre, title or/and lyrics prompt.
NLTK
NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, wrappers for industrial-strength NLP libraries, and an active discussion forum.
Perplexity
Search and NLP
Song Maker GPT-3
ChatGPT model built to help with making music
spaCy
Industrial-Strength Natural Language Processing
Visuals
Disco Diffusion v5.7
Image Generation model. The diffusion model in use is Katherine Crowson's fine-tuned 512x512 model.
Haiper
Video and Image generator used for Music Videos and Animations
Hedra
Video generator (lipsync)
Ideogram
Image generation service.
Katalist
Video generator
KREA
Video generator
Leonardo
Create production-quality visual assets for your projects with unprecedented quality, speed, and style-consistency.
Luma
Music Videos, Animations
Microsoft Designer
Album Art
Noisee
Video generator
RunwayML
Video generator
Stable Diffusion XL Base
SDXL consists of an ensemble of experts pipeline for latent diffusion: In a first step, the base model is used to generate (noisy) latents, which are then further processed with a refinement model specialized for the final denoising steps. Note that the base model can be used as a standalone module.
Instrumentation
Production
Sound Engineering
Adobe Podcast
AI-Audio Enhancer
AudioLDM
Generate effects from text, describing the desired audio
Bandlab Mastering
Instantly master your tracks with the world’s leading online mastering service. Hear the difference mastering can make with the fastest, best sounding, and free artist-driven Mastering tool.
CataRT AI agent max patch
The concatenative real-time sound synthesis system CataRT, created in 2005, plays grains from a large corpus of segmented and descriptor-analysed sounds according to proximity to a target position in the descriptor space. This can be seen as a content-based extension to granular synthesis providing direct access to specific sound characteristics.
DDSP
DDSP-VST morphs audio into a range of different instruments. Unlike MIDI notes, DDSP preserves the nuances of pitch and dynamics for expressive neural synthesis.
eMastered
An online mastering engine that’s fast, easy to use, and sounds incredible. Made by Grammy-winning engineers, powered by AI
HARD Audio Remixer
HArmony-Rhythm Disentanglement audio remixer plugin.
LANDR AI Mastering
AI audio mastering tool.
Moises AI Stem Separation
Play your music in any key, at any speed. Remove vocals and instruments in any song. Discover the ultimate immersive practice experience powered by AI.
Neutone
"Neutone Morpho, a realtime tone morphing plugin. Our cutting-edge machine learning technology can transform any sound into something new and inspiring. Neutone Morpho directly processes audio, capturing even the subtlest details from your input."
Ripx
AI-powered DAW (stem splits, noise reduction, instrument substitution, etc.)
SampLab
Generating samples
Soundtrap
Soundtrap is a cloud-based digital audio workstation (DAW) that is reimagining how music and podcasts are created. Made by music producers, songwriters, and audio experts, our goal is to help you unlock your creative potential.
Synplant 2
Sound Design, VST, Sound Generation
Datasets and Models
Dadabots DadaGP dataset
Encoder/decoder converts GuitarPro songs to/from a token-sequence format for generative language models like GPT2, TransformerXL, etc.
DeepDrummer
Making the world a better place through AI-generated beats & grooves
KL3M
Clean foundation models for enterprise use.
Lakh MIDI Dataset
The Lakh MIDI dataset is a collection of 176,581 unique MIDI files, 45,129 of which have been matched and aligned to entries in the Million Song Dataset. Its goal is to facilitate large-scale music information retrieval, both symbolic (using the MIDI files alone) and audio content-based (using information extracted from the MIDI files as annotations for the matched audio files). Around 10% of all MIDI files include timestamped lyrics events with lyrics are often transcribed at the word, syllable or character level.
MUSIC FADERNETS
Controllable music generation based on high-level features via low-level feature modelling
NeucoSVC
The official implementation of NeuCoSVC, a versatile model for any-to-any singing voice conversion.
Symbolic music generation conditioned on continuous-valued emotions
Generates multi-instrument symbolic music (MIDI), based on user-provided emotions from valence-arousal plane.
Yating Music Transformer-based model
Yating Music Transformer-based model
ML Engines and Development Tools
Azure Machine Learning
ML as a Service. Use an enterprise-grade AI service for the end-to-end machine learning (ML) lifecycle.
Keras
The purpose of Keras is to give an unfair advantage to any developer looking to ship Machine Learning-powered apps. Keras focuses on debugging speed, code elegance & conciseness, maintainability, and deployability. When you choose Keras, your codebase is smaller, more readable, easier to iterate on.
LibROSA
librosa is a python package for music and audio analysis. It provides the building blocks necessary to create music information retrieval systems.
Purr Data
A visual programming language for realtime DSP synthesis. Mostly used to make music. Also used to do realtime graphics, video, and interactive art.
PyTorch
The PyTorch Foundation is a neutral home for the deep learning community to collaborate on the open source PyTorch framework and ecosystem. The PyTorch Foundation is supported by leading contributors to the PyTorch open source project.
Somms.ai
Custom AI model training, generation & attribution for record labels, publishers, artists, producers, songwriters, distributors, digital instrument makers, sonic branding agencies and enterprise music companies.
TensorFlow
An end-to-end platform for machine learning
ZeroGPU Spaces
Community for efficient hold and distribution of GPU
Voice
Controlla
Unleash the unlimited potential of your voice
Kits
Kits is an AI voice platform that streamlines and improves producer workflows with AI audio tools built for music. Kits’ voice and instrument models are Fairly Trained certified.
Musicfy
Use AI to create music with your voice or other voices and make music like never before.
PlayHT
Ultra realistic Text to Speech(TTS) voice. Leading AI Voice Generator. Free Unlimited downloads. Most Fluent & Conversational AI voices
RVC
VITS-based Voice Conversion focused on simplicity, quality, and performance
Synthesizer V
Voicebanks + DAW
Vocaloid
Voicebanks + DAW
Voice Clone - based on model by Coqui
Voice cloning tool
Voicemod
Real-time AI voice transformation and interactive audio that enables users to switch their voices as easily as they do skins.
Vulgarlang
Fantasy language generator