标签:Audio processing

WAAS

分类: AI Developer Tools Open Source AI Models

GUI and API for OpenAI Whisper with queuing and notification features.

Applio

分类: AI Voice Changer AI Voice Cloning AI Voice Generator

Applio is a simple, high-quality VITS-based voice conversion tool.

Sumo

分类: AI Speech-to-Text Audio To Text AI

Extracts the essence of lengthy audio files effortlessly using AI.

GoodListen

分类: Long Video To Short Video AI AI Podcast AI Podcast Clip Generator

AI-powered podcast tool for discovery, summarization, and content repurposing.

VocalReplica

分类: AI Tiktok AI Text-to-Speech AI Voice Cloning AI Voice Generator

Voice cloning and vocal/instrumental isolation from YouTube videos and audio files.

Coqui

分类: Other

Coqui was a platform focused on freeing speech, now shutting down.

NirvanaAI extension

分类: AI Assistant AI Productivity Tools AI Speech-to-Text

AI-powered extension for spelling, transcription, diarization, and email integration.

MMAudio pro

分类: AI Sound Effect Generator AI Dubbing AI Text-to-Speech AI Voice Over

AI-powered tool for video to audio synthesis and text to audio conversion.

Nendo

分类: AI Music Generator AI Song Remixer AI Stems Splitter

AI platform to generate unique sample packs from any music library.

Polymath

分类: AI Midi Generator AI Song Remixer AI Stems Splitter Open Source AI Models

ML-powered tool to convert music libraries into searchable sample libraries for music production.

Sarv NC Lite

分类: AI Audio Enhancer AI Noise Cancellation AI Voice Enhancer

Sarv NS removes background noise for clearer calls.

AdutorAI

分类: AI Translate AI Notes Generator AI Speech-to-Text AI Transcriber AI Transcription Audio To Text AI

AI tool to convert speech to clear, structured text with style customization.

FiFi.ai

分类: AI Content Detector AI API AI OCR AI Crop Image AI Image Enhancer AI Image Generator AI Image Upscaler AI QR Code Generator AI Music Generator AI Productivity Tools AI Models Large Language Models (LLMs) AI Video Generator AI Speech-to-Text AI Text-to-Speech

Fifi.ai is an AI cloud platform for business growth with smart tools and custom models.

Label Studio

分类: AI Developer Tools Large Language Models (LLMs) Open Source AI Models

Open source data labeling tool for various data types and M/L integration.

only one ai

分类: AI Code Generator AI Image Generator AI Music Generator AI Video Generator

A comprehensive directory of over 30,000 AI tools for various applications.

Supametas.AI

分类: AI API AI Developer Tools AI Web Scraping AI Knowledge Base AI Document Extraction AI Files Large Language Models (LLMs) AI Transcription

Platform converting unstructured data to LLM RAG-ready structured data for knowledge bases.

ittybit

分类: AI API AI Developer Tools AI Image Recognition AI Transcriber

Scalable media APIs for video, audio, and image processing and automation.

302.AI

分类: AI Chatbot AI Tools Directory Large Language Models (LLMs)

Self-service platform for top global AI models, pay-as-you-go.

GPT-4o click to start

分类: AI Chatbot AI API AI Image Recognition AI Assistant AI Speech-to-Text AI Text-to-Speech

OpenAI's GPT-4o is an advanced AI model with real-time multimodal processing and emotion detection.

GPT 4o

分类: AI Chatbot AI Code Generator AI Translate AI Image Recognition Large Language Models (LLMs) AI Speech Recognition AI Text-to-Speech AI Voice Assistants

Free access to GPT-4o with real-time multimodal capabilities.

Maximo AI

分类: AI Chatbot AI Image Generator AI Audio Editing AI Assistant AI Documents Generator AI PDF AI PPT Maker AI Research Tool AI Video Editor

All-in-one AI solution for trading, content creation, automation, and social media management.

Tila AI

分类: AI Workflow AI Image Generator AI Agent AI Copilot Large Language Models (LLMs) AI Video Generator

Visual AI workspace for multimodal projects and workflow automation.

reccloud.cn

分类: AI Cartoon Generator AI API AI Video Translator AI Vocal Remover AI Animated Video AI Animation Generator AI Video Editor AI Video Generator AI Video Summarizer Text to Video AI Dubbing AI Speech-to-Text AI Text-to-Speech AI Voice Generator Audio To Text AI

AI audio and video processing platform with tools for transcription, translation, and editing.