{"id":2270,"date":"2026-06-15T22:31:41","date_gmt":"2026-06-15T22:31:41","guid":{"rendered":"https:\/\/portfolio.arnoldbiffna.com\/?p=2270"},"modified":"2026-06-16T03:02:33","modified_gmt":"2026-06-16T03:02:33","slug":"voice-cloning","status":"publish","type":"post","link":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/","title":{"rendered":"Voice Cloning"},"content":{"rendered":"\n<figure class=\"wp-block-video\"><video controls poster=\"https:\/\/d3od4vl78dd97d.cloudfront.net\/blogpreviews\/voicecloning.jpg\" src=\"https:\/\/d3od4vl78dd97d.cloudfront.net\/blogvideo\/churchill.mp4\"><\/video><\/figure>\n\n\n\n<p>I am completely amazed at how well voice cloning works locally on my machine.<\/p>\n\n\n\n<p>I already had a local MLX Whisper model running, which I use to generate subtitles from video. I thought that was all I needed. Then I asked Codex to build me a voice cloning tool in Python, and it went much deeper than I expected.<\/p>\n\n\n\n<p>The tool ended up using:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Python<\/li>\n\n\n\n<li>FFmpeg and FFprobe<\/li>\n\n\n\n<li>MLX Audio text-to-speech<\/li>\n\n\n\n<li>MLX Whisper transcription<\/li>\n\n\n\n<li>Hugging Face model snapshots<\/li>\n<\/ul>\n\n\n\n<p>A quick note: cloning someone\u2019s voice can get you into trouble if they object or decide to sue. That said, Winston Churchill is no longer around to complain.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What the Tool Does<\/h2>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Lets me choose a reference audio or video file<\/li>\n\n\n\n<li>Lets me choose a speech text file<\/li>\n\n\n\n<li>Detects and skips leading silence (<strong><em>FFmpeg<\/em><\/strong>)<\/li>\n\n\n\n<li>Extracts and cleans a reference voice sample (<em><strong>FF<\/strong>mpeg<\/em>)<\/li>\n\n\n\n<li>Transcribes the cleaned reference using an MLX Whisper model (<strong><em>whisper-large-v3-mlx<\/em><\/strong>)<\/li>\n\n\n\n<li>Generates new speech from text using a local MLX Audio model (<strong><em>higgs-audio-v2-3B-mlx-q8<\/em><\/strong>)<\/li>\n<\/ol>\n\n\n\n<p>The wild part is that all of this runs locally. No cloud service. No subscription API. Just my Mac, some open-source tools, and a Python script that suddenly feels a little <em>too<\/em> powerful.<\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I am completely amazed at how well voice cloning works locally on my machine. I already had a local MLX Whisper model running, which I use to generate subtitles from video. I thought that was all I needed. Then I asked Codex to build me a voice cloning tool in Python, and it went much &hellip; <a href=\"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Voice Cloning&#8221;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":2279,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[48,57,56],"tags":[],"class_list":["post-2270","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","category-local-ai","category-python"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Voice Cloning - Arnold Biffna Portfolio<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Voice Cloning - Arnold Biffna Portfolio\" \/>\n<meta property=\"og:description\" content=\"I am completely amazed at how well voice cloning works locally on my machine. I already had a local MLX Whisper model running, which I use to generate subtitles from video. I thought that was all I needed. Then I asked Codex to build me a voice cloning tool in Python, and it went much &hellip; Continue reading &quot;Voice Cloning&quot;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/\" \/>\n<meta property=\"og:site_name\" content=\"Arnold Biffna Portfolio\" \/>\n<meta property=\"article:published_time\" content=\"2026-06-15T22:31:41+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-06-16T03:02:33+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2026\/06\/voicecloning.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"200\" \/>\n\t<meta property=\"og:image:height\" content=\"200\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Arnold Biffna\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Arnold Biffna\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/\"},\"author\":{\"name\":\"Arnold Biffna\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/#\\\/schema\\\/person\\\/248c0dc4cc332f12f120d4734ca0ae8f\"},\"headline\":\"Voice Cloning\",\"datePublished\":\"2026-06-15T22:31:41+00:00\",\"dateModified\":\"2026-06-16T03:02:33+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/\"},\"wordCount\":201,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/#\\\/schema\\\/person\\\/248c0dc4cc332f12f120d4734ca0ae8f\"},\"image\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/voicecloning.jpg\",\"articleSection\":[\"Artificial Intelligence\",\"Local AI\",\"Python\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/\",\"url\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/\",\"name\":\"Voice Cloning - Arnold Biffna Portfolio\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/voicecloning.jpg\",\"datePublished\":\"2026-06-15T22:31:41+00:00\",\"dateModified\":\"2026-06-16T03:02:33+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#primaryimage\",\"url\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/voicecloning.jpg\",\"contentUrl\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/voicecloning.jpg\",\"width\":200,\"height\":200},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/2026\\\/06\\\/15\\\/voice-cloning\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Voice Cloning\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/#website\",\"url\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/\",\"name\":\"Arnold Biffna Portfolio\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/#\\\/schema\\\/person\\\/248c0dc4cc332f12f120d4734ca0ae8f\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/#\\\/schema\\\/person\\\/248c0dc4cc332f12f120d4734ca0ae8f\",\"name\":\"Arnold Biffna\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/profile.jpg\",\"url\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/profile.jpg\",\"contentUrl\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/profile.jpg\",\"width\":960,\"height\":960,\"caption\":\"Arnold Biffna\"},\"logo\":{\"@id\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/profile.jpg\"},\"sameAs\":[\"http:\\\/\\\/arnoldbiffna.com\"],\"url\":\"https:\\\/\\\/portfolio.arnoldbiffna.com\\\/index.php\\\/author\\\/arnoldbiffna-2\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Voice Cloning - Arnold Biffna Portfolio","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/","og_locale":"en_US","og_type":"article","og_title":"Voice Cloning - Arnold Biffna Portfolio","og_description":"I am completely amazed at how well voice cloning works locally on my machine. I already had a local MLX Whisper model running, which I use to generate subtitles from video. I thought that was all I needed. Then I asked Codex to build me a voice cloning tool in Python, and it went much &hellip; Continue reading \"Voice Cloning\"","og_url":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/","og_site_name":"Arnold Biffna Portfolio","article_published_time":"2026-06-15T22:31:41+00:00","article_modified_time":"2026-06-16T03:02:33+00:00","og_image":[{"width":200,"height":200,"url":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2026\/06\/voicecloning.jpg","type":"image\/jpeg"}],"author":"Arnold Biffna","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Arnold Biffna","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#article","isPartOf":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/"},"author":{"name":"Arnold Biffna","@id":"https:\/\/portfolio.arnoldbiffna.com\/#\/schema\/person\/248c0dc4cc332f12f120d4734ca0ae8f"},"headline":"Voice Cloning","datePublished":"2026-06-15T22:31:41+00:00","dateModified":"2026-06-16T03:02:33+00:00","mainEntityOfPage":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/"},"wordCount":201,"commentCount":0,"publisher":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/#\/schema\/person\/248c0dc4cc332f12f120d4734ca0ae8f"},"image":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#primaryimage"},"thumbnailUrl":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2026\/06\/voicecloning.jpg","articleSection":["Artificial Intelligence","Local AI","Python"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/","url":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/","name":"Voice Cloning - Arnold Biffna Portfolio","isPartOf":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#primaryimage"},"image":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#primaryimage"},"thumbnailUrl":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2026\/06\/voicecloning.jpg","datePublished":"2026-06-15T22:31:41+00:00","dateModified":"2026-06-16T03:02:33+00:00","breadcrumb":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#primaryimage","url":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2026\/06\/voicecloning.jpg","contentUrl":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2026\/06\/voicecloning.jpg","width":200,"height":200},{"@type":"BreadcrumbList","@id":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/2026\/06\/15\/voice-cloning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/portfolio.arnoldbiffna.com\/"},{"@type":"ListItem","position":2,"name":"Voice Cloning"}]},{"@type":"WebSite","@id":"https:\/\/portfolio.arnoldbiffna.com\/#website","url":"https:\/\/portfolio.arnoldbiffna.com\/","name":"Arnold Biffna Portfolio","description":"","publisher":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/#\/schema\/person\/248c0dc4cc332f12f120d4734ca0ae8f"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/portfolio.arnoldbiffna.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/portfolio.arnoldbiffna.com\/#\/schema\/person\/248c0dc4cc332f12f120d4734ca0ae8f","name":"Arnold Biffna","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2025\/04\/profile.jpg","url":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2025\/04\/profile.jpg","contentUrl":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2025\/04\/profile.jpg","width":960,"height":960,"caption":"Arnold Biffna"},"logo":{"@id":"https:\/\/portfolio.arnoldbiffna.com\/wp-content\/uploads\/2025\/04\/profile.jpg"},"sameAs":["http:\/\/arnoldbiffna.com"],"url":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/author\/arnoldbiffna-2\/"}]}},"_links":{"self":[{"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/posts\/2270","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/comments?post=2270"}],"version-history":[{"count":12,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/posts\/2270\/revisions"}],"predecessor-version":[{"id":2289,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/posts\/2270\/revisions\/2289"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/media\/2279"}],"wp:attachment":[{"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/media?parent=2270"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/categories?post=2270"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/portfolio.arnoldbiffna.com\/index.php\/wp-json\/wp\/v2\/tags?post=2270"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}