Cool Artificial intelligence tools

Transcribe Audio

Local

Online

  • Turboscribe: https://turboscribe.ai
    Tried to feed it a file with 2 people, one speaking with a Jordanian accent, and another with a Saudi accent, the results were 8/10, the system seems to allow 3 files 30 minutes each for free users.

Text To Speech

Local

Tortoise and Bark for Voice Synthesis

Creating images with AI (Programming)

So, I will start by dropping a few keywords here, and what they are about, that will probably help you start, I am compiling this as a guideline on where to start

  • VQGAN (Vector Quantized Generative Adversarial Network / neural network) : The software that generates the image
  • CLIP (Contrastive Language-Image Pre-training / neural network) : Software to influence a generated image based on input text (User prompt)
  • VQGAN+CLIP : Two neural network pieces of software that work in tandem.
  • CLIP-Guided-Diffusion: A technique for doing text-to-image synthesis cheaply using pre-trained CLIP and diffusion models.
  • Google colab notebook: A tool made by google where you can run python code and utilize google’s GPUs, both paid and free exist