<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
    <channel>
        <title>Hatty~de秘密基地</title>
        <link>https://notion-next-ochre-one-47.vercel.app//</link>
        <description>这是一个由NotionNext生成的站点</description>
        <lastBuildDate>Sat, 18 Apr 2026 02:19:52 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>zh-CN</language>
        <copyright>All rights reserved 2026, Hatty</copyright>
        <item>
            <title><![CDATA[Action Chunk Transformation(ACT)]]></title>
            <link>https://notion-next-ochre-one-47.vercel.app//article/2a258186-cead-80ff-a11d-f6d0e5df315d</link>
            <guid>https://notion-next-ochre-one-47.vercel.app//article/2a258186-cead-80ff-a11d-f6d0e5df315d</guid>
            <pubDate>Wed, 05 Nov 2025 00:00:00 GMT</pubDate>
            <content:encoded><![CDATA[<div id="notion-article" class="mx-auto overflow-hidden "><main class="notion light-mode notion-page notion-block-2a258186cead80ffa11df6d0e5df315d"><div class="notion-viewport"></div><div class="notion-collection-page-properties"></div><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-2a258186cead805193f6ea8040db70f0" data-id="2a258186cead805193f6ea8040db70f0"><span><div id="2a258186cead805193f6ea8040db70f0" class="notion-header-anchor"></div><a class="notion-hash-link" href="#2a258186cead805193f6ea8040db70f0" title="VAE"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">VAE</span></span></h2><div class="notion-text notion-block-2a258186cead80b8a25ce888e884b993">a type of<b> generative model</b> used for unsupervised learning</div><div class="notion-text notion-block-2a258186cead8013ab49fd0467ff9593">learns a <b>probabilistic mapping</b> from data to a latent space and is composed of two main parts:</div><ol start="1" class="notion-list notion-list-numbered notion-block-2a258186cead809fa174c10ae9b3cd86" style="list-style-type:decimal"><li><b>Encoder</b>: Maps input data to a <b>latent variable</b> space.</li></ol><ol start="2" class="notion-list notion-list-numbered notion-block-2a258186cead80b2b104f8cfcd488a98" style="list-style-type:decimal"><li><b>Decoder</b>: Reconstructs data from the latent variable.</li></ol><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-2a258186cead8074be04f52c7962d3f1" data-id="2a258186cead8074be04f52c7962d3f1"><span><div id="2a258186cead8074be04f52c7962d3f1" class="notion-header-anchor"></div><a class="notion-hash-link" href="#2a258186cead8074be04f52c7962d3f1" title="CVAE"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">CVAE</span></span></h2><div class="notion-text notion-block-2a258186cead809d8864c7fb88f45f94">extends the VAE by adding conditional information to both the <b>encoder</b> and <b>decoder</b>.</div><h4 class="notion-h notion-h3 notion-h-indent-1 notion-block-2a258186cead80548668e46321dc9ab1" data-id="2a258186cead80548668e46321dc9ab1"><span><div id="2a258186cead80548668e46321dc9ab1" class="notion-header-anchor"></div><a class="notion-hash-link" href="#2a258186cead80548668e46321dc9ab1" title="Structure of a CVAE"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">Structure of a CVAE</span></span></h4><ol start="1" class="notion-list notion-list-numbered notion-block-2a258186cead80d8bfcbc17f8dad612b" style="list-style-type:decimal"><li><b>Encoder</b>:</li><ol class="notion-list notion-list-numbered notion-block-2a258186cead80d8bfcbc17f8dad612b" style="list-style-type:lower-alpha"><ul class="notion-list notion-list-disc notion-block-2a258186cead804d81c6c8f9c5312e7b"><li>The encoder network <!-- --> maps the input to a distribution in the latent space</li><ul class="notion-list notion-list-disc notion-block-2a258186cead804d81c6c8f9c5312e7b"><li>input: the data point x and the condition y (such as a label) </li><li>output: a <b>mean</b> and <b>variance</b> for the distribution (typically Gaussian)</li><div class="notion-text notion-block-2b358186cead809eb53bf052f390d3a4"><em>training stage</em>:</div><div class="notion-text notion-block-2b358186cead801aba9fc2130ff4e4ab"><em>evaluating stage</em>:</div></ul></ul></ol></ol><ol start="2" class="notion-list notion-list-numbered notion-block-2a258186cead80d8a62ffe7b62694347" style="list-style-type:decimal"><li><b>Latent Variable</b>:</li><ol class="notion-list notion-list-numbered notion-block-2a258186cead80d8a62ffe7b62694347" style="list-style-type:lower-alpha"><ul class="notion-list notion-list-disc notion-block-2a258186cead80afa4c7cf257f4d7cdf"><li>The latent variable z is sampled from the distribution parameterized by the encoder. This latent variable captures the underlying structure of the data.</li></ul><ul class="notion-list notion-list-disc notion-block-2a258186cead80efb8c2fa884f5273a0"><li>To allow for backpropagation through the stochastic sampling process, VAEs use the <b>reparameterization trick</b>: <!-- -->, where μ and σ are the parameters learned by the encoder, and <!-- --> is a noise term.</li></ul></ol></ol><ol start="3" class="notion-list notion-list-numbered notion-block-2a258186cead80e99b1eca58e1fa29a3" style="list-style-type:decimal"><li><b>Decoder</b>:</li><ol class="notion-list notion-list-numbered notion-block-2a258186cead80e99b1eca58e1fa29a3" style="list-style-type:lower-alpha"><ul class="notion-list notion-list-disc notion-block-2a258186cead80dd82b1ccc34a60ec2a"><li>The decoder network <!-- --> tries to generate data that is similar to the original input, conditioned on both the latent code and the label y.</li><ul class="notion-list notion-list-disc notion-block-2a258186cead80dd82b1ccc34a60ec2a"><li>input: latent variable z and the condition y </li><li>output: the reconstructed data x</li></ul></ul></ol></ol><ol start="4" class="notion-list notion-list-numbered notion-block-2a258186cead80d1b7defccaf79b675e" style="list-style-type:decimal"><li><b>Loss Function</b>:</li><ol class="notion-list notion-list-numbered notion-block-2a258186cead80d1b7defccaf79b675e" style="list-style-type:lower-alpha"><div class="notion-text notion-block-2a258186cead801088d4dfd2da724b6c"></div><ul class="notion-list notion-list-disc notion-block-2a258186cead80f1ac15ef1fd4fd73c4"><li>The objective of CVAE is to maximize the <b>variational lower bound</b> (ELBO), which can be decomposed into two main components:</li><ul class="notion-list notion-list-disc notion-block-2a258186cead80f1ac15ef1fd4fd73c4"><ol start="1" class="notion-list notion-list-numbered notion-block-2a258186cead800b9715c70f4d7c3e09" style="list-style-type:decimal"><li><b>Reconstruction Loss</b>: Measures how well the model reconstructs the input data, typically using <em>binary cross-entropy</em> or <em>mean squared error</em> for continuous data.</li></ol><ol start="2" class="notion-list notion-list-numbered notion-block-2a258186cead807890c3f2fd13af6bfe" style="list-style-type:decimal"><li><b>KL Divergence</b>: Measures the difference between the learned latent distribution <!-- -->  and the prior distribution p(z), typically a standard Gaussian. This regularizes the model by <b>encouraging the learned latent space to be similar to a known prior distribution</b>.</li></ol></ul></ul></ol></ol><div class="notion-blank notion-block-2a258186cead800ab342fef647857c57"> </div><h2 class="notion-h notion-h1 notion-h-indent-0 notion-block-2a258186cead80a3a388ed478a96ccf5" data-id="2a258186cead80a3a388ed478a96ccf5"><span><div id="2a258186cead80a3a388ed478a96ccf5" class="notion-header-anchor"></div><a class="notion-hash-link" href="#2a258186cead80a3a388ed478a96ccf5" title="positional embedding"><svg viewBox="0 0 16 16" width="16" height="16"><path fill-rule="evenodd" d="M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z"></path></svg></a><span class="notion-h-title">positional embedding</span></span></h2><ol start="1" class="notion-list notion-list-numbered notion-block-2a258186cead8042a9fdfd13f88d5321" style="list-style-type:decimal"><li>PositionEmbeddingSine</li></ol><div class="notion-text notion-block-2a258186cead80d28a4fd672db9f20e6">generates position encodings by applying sine and cosine functions with different frequencies to the normalized x and y coordinates</div><ol start="2" class="notion-list notion-list-numbered notion-block-2a258186cead8015b988fe36202d6f77" style="list-style-type:decimal"><li>PositionEmbeddingLearned</li></ol><div class="notion-text notion-block-2a258186cead80d9b6ccfa25558db246">concatenate the x (column) and y (row) embeddings for each position</div><div class="notion-blank notion-block-2a258186cead807c8d09fadd9802ad93"> </div></main></div>]]></content:encoded>
        </item>
        <item>
            <title><![CDATA[北京实习小记]]></title>
            <link>https://notion-next-ochre-one-47.vercel.app//article/diary-1</link>
            <guid>https://notion-next-ochre-one-47.vercel.app//article/diary-1</guid>
            <pubDate>Sun, 21 Sep 2025 00:00:00 GMT</pubDate>
        </item>
    </channel>
</rss>