User Tools

Site Tools


misc:av

Audio/Video Processing and Podcasts

SoX - Sound eXchange

FFmpeg

FFmpeg provides many audio processing filters:

latest prebuilt (static) binaries:

FFmpeg is utilized in several tools, e.g.:

YouTube Downloader

Downloading audio and video files is interesting for archiving - but also preprocessing for later watching or listening. Many platforms - other than YouTube - are supported.

As described in yt-dlp Wiki on 2022-09-19, there are 3 active forks for the command line:

There are a multitude of Downloader-GUIs, here a very short selection:

Podcast programs & apps

Podcast apps (automatically) download episodes from RSS feeds for (offline and mobile) listening.

Providing an own podcast feed

Podsync can download videos, convert to audio and publish them as an RSS feed - suitable for subscription with podcasts apps. Tools like yt-dlp and ffmpeg are utilized. The only downside: Podsync requires a public server.

Changing speed - not pitch

Listening consumes time - of course. Despite skipping (silent) sections, many episodes can be perceived at (much) higher speed - depending on the speaker(s); thus saving time. The following points to some programs/algorithms.

TDHS, see https://en.wikipedia.org/wiki/Time-domain_harmonic_scaling
implementation: https://github.com/hayguen/audio-stretch
and it's origin https://github.com/dbry/audio-stretch

Browser Extension: skip-silence
https://github.com/vantezzen/skip-silence/issues/36
https://chrome.google.com/webstore/detail/skip-silence/fhdmkhbefcbhakffdihhceaklaigdllh?hl=en

FFmpeg uses atempo:
https://trac.ffmpeg.org/wiki/How%20to%20speed%20up%20/%20slow%20down%20a%20video

libavfilter/af_atempo.c in git://source.ffmpeg.org/ffmpeg

it is based on https://sourceforge.net/projects/apprenticevideo/
with code at https://github.com/pkoshevoy/aeyae
in yae/ffmpeg/yae_audio_tempo_filter.h

theory at http://www.surina.net/article/time-and-pitch-scaling.html

Loudness

misc/av.txt · Last modified: 2023/01/01 (external edit)