Files
videobeaux/docs/_site/programs/utilities/silence_extraction.html
2025-12-07 22:04:44 -05:00

155 lines
7.1 KiB
HTML
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- Begin Jekyll SEO tag v2.8.0 -->
<title>silence_extraction</title>
<meta name="generator" content="Jekyll v3.10.0" />
<meta property="og:title" content="silence_extraction" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="The friendly multilateral video toolkit built for artists by artists." />
<meta property="og:description" content="The friendly multilateral video toolkit built for artists by artists." />
<link rel="canonical" href="http://localhost:4000/videobeaux/programs/utilities/silence_extraction.html" />
<meta property="og:url" content="http://localhost:4000/videobeaux/programs/utilities/silence_extraction.html" />
<meta property="og:type" content="website" />
<meta name="twitter:card" content="summary" />
<meta property="twitter:title" content="silence_extraction" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"WebPage","description":"The friendly multilateral video toolkit built for artists by artists.","headline":"silence_extraction","publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"http://localhost:4000/videobeaux/assets/img/videobeaux.png"}},"url":"http://localhost:4000/videobeaux/programs/utilities/silence_extraction.html"}</script>
<!-- End Jekyll SEO tag -->
<link rel="stylesheet" href="/videobeaux/assets/css/style.css?v=5e23701ed3967d38bab12937d79f95fae74b2a53">
<!--[if lt IE 9]>
<script src="https://cdnjs.cloudflare.com/ajax/libs/html5shiv/3.7.3/html5shiv.min.js"></script>
<![endif]-->
<!-- start custom head snippets, customize with your own _includes/head-custom.html file -->
<!-- Setup Google Analytics -->
<!-- You can set your favicon here -->
<!-- link rel="shortcut icon" type="image/x-icon" href="/videobeaux/favicon.ico" -->
<!-- end custom head snippets -->
</head>
<body>
<div class="wrapper">
<header>
<h1><a href="http://localhost:4000/videobeaux/">videobeaux</a></h1>
<img src="/videobeaux/assets/img/videobeaux.png" alt="Logo" />
<p>The friendly multilateral video toolkit built for artists by artists.</p>
<p class="view"><a href="https://github.com/schwwaaa/videobeaux">View the Project on GitHub <small>schwwaaa/videobeaux</small></a></p>
</header>
<section>
<h1 id="silence_extraction">silence_extraction</h1>
<h2 id="description">Description</h2>
<p>Extracts sections of silence from a videos audio track based on duration thresholds.<br />
Useful for identifying dead air, isolating non-dialogue segments, or preparing silence-aware edits and analysis.</p>
<h2 id="purpose">Purpose</h2>
<p>The <code class="language-plaintext highlighter-rouge">silence_extraction</code> program is designed to detect, isolate, or extract moments of silence within a videos audio.<br />
This is useful for:</p>
<ul>
<li>cutting silent gaps out of recordings,</li>
<li>analyzing pacing or speech density,</li>
<li>preparing regions for time compression,</li>
<li>generating metadata for editors or automation pipelines.</li>
</ul>
<h2 id="how-it-works">How It Works</h2>
<ol>
<li><strong>Silence Detection</strong><br />
FFmpegs silence detection logic identifies quiet sections based on amplitude thresholds.</li>
<li><strong>Duration Filtering</strong>
<ul>
<li><code class="language-plaintext highlighter-rouge">min_d</code> defines the minimum silence duration to be considered meaningful.</li>
<li><code class="language-plaintext highlighter-rouge">max_d</code> defines the longest segment to extract or label.</li>
</ul>
</li>
<li><strong>Adjuster Logic</strong>
The <code class="language-plaintext highlighter-rouge">adjuster</code> parameter allows tuning how tolerant the detection should be, adjusting thresholds or trimming surrounding audio depending on implementation.</li>
<li><strong>Output Behavior</strong><br />
Extracted silence segments may be exported individually, compiled, or used to generate metadata depending on how videobeaux handles downstream processing.</li>
</ol>
<h2 id="program-template">Program Template</h2>
<div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code>videobeaux -P silence_extraction \
-i input.mp4 \
-o output.mp4 \
--min_d VALUE \
--max_d VALUE \
--adjuster VALUE
</code></pre></div></div>
<h2 id="arguments">Arguments</h2>
<ul>
<li><strong>min_d</strong> — Minimum silence duration (in seconds) to count as a silence event.</li>
<li><strong>max_d</strong> — Maximum silence duration to extract or annotate.</li>
<li><strong>adjuster</strong> — Fine-tuning parameter for silence threshold sensitivity or trimming behavior.</li>
</ul>
<h2 id="real-world-example">Real World Example</h2>
<div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code>videobeaux -P silence_extraction \
-i myvideo.mp4 \
-o silence_extraction_styled.mp4 \
--min_d 1.5 \
--max_d 12.0 \
--adjuster medium
</code></pre></div></div>
<h2 id="technical-notes">Technical Notes</h2>
<ul>
<li>Silence detection is typically amplitude-based using FFmpeg filters (e.g., <code class="language-plaintext highlighter-rouge">silencedetect</code>).</li>
<li><code class="language-plaintext highlighter-rouge">min_d</code> is useful for ignoring tiny pauses or breath sounds.</li>
<li>Very large <code class="language-plaintext highlighter-rouge">max_d</code> values may capture irrelevant long stretches; tune for your content.</li>
<li><code class="language-plaintext highlighter-rouge">adjuster</code> may influence thresholding; examples include “strict,” “medium,” or “loose” depending on your implementation.</li>
</ul>
<h2 id="recommended-usage">Recommended Usage</h2>
<ul>
<li>Removing silent gaps in interviews or podcasts.</li>
<li>Locating pauses in lectures for automatic chaptering.</li>
<li>Creating pacing analytics (speech vs silence ratio).</li>
<li>Identifying dead air in archival footage.</li>
</ul>
<h2 id="quality-tips">Quality Tips</h2>
<ul>
<li>Use smaller <code class="language-plaintext highlighter-rouge">min_d</code> values (0.30.7s) for fast speech.</li>
<li>Use larger <code class="language-plaintext highlighter-rouge">min_d</code> (12s) for natural conversations or interviews.</li>
<li>Fine-tune <code class="language-plaintext highlighter-rouge">adjuster</code> to avoid misclassifying quiet music or soft ambience as silence.</li>
<li>Always review extracted segments before batch processing removal or compression.</li>
</ul>
</section>
<footer>
<p>This project is maintained by <a href="https://github.com/schwwaaa">schwwaaa</a></p>
<p><small>Hosted on GitHub Pages &mdash; Theme by <a href="https://github.com/orderedlist">orderedlist</a></small></p>
</footer>
</div>
<script src="/videobeaux/assets/js/scale.fix.js"></script>
</body>
</html>