Generative AI-Powered Audio/Video Processing: Whisper's Python Adventure What Is the Purpose of This Application? An application that displays the ability of Generative AI to process and analyze audio/video files, and then output the lyrics for that audio or video file. Sample code: import streamlit as st from pytube import YouTube import os import torch from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor, pipeline def get_mp3(url): yt = YouTube(str(url)) audio = yt.streams.filter(only_audio = True ).first() destination = '.' out_file = audio.download(output_path=destination) base, ext = os.path.splitext(out_file) new_file = base + '.mp3' os.rename(out_file, new_file) return new_file def get_transcript(audio_file): device = "cuda:0" if torch.cuda.is_available() else "cpu" #If you have GPU else it will use cpu ...
A blog about my testing stories where I pen down my thoughts about test automation covering primarily Selenium, Playwright, Java, JS/TS, Rest Assured, Karate, Maven, TestNG, Postman, newman, Jenkins, Git, Azure DevOps, etc.