<?xml-stylesheet type="text/xsl" href="http://coco-lab.org/Elgg/timo/rss/rssstyles.xsl"?>
<rss version='2.0'   xmlns:dc='http://purl.org/dc/elements/1.1/'>
    <channel xml:base='http://coco-lab.org/Elgg/timo/'>
        <title><![CDATA[Timo Baumann : Aktivität]]></title>
        <description><![CDATA[Aktivität für Timo Baumann, gehostet auf Coco-Lab Weblog.]]></description>
        <generator>Elgg</generator>
        <link>http://coco-lab.org/Elgg/timo/</link>        
        <item>
            <title><![CDATA[poetry generation using ASR and TTS]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/134.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/134.html</guid>
            <pubDate>Thu, 04 Nov 2010 09:38:51 GMT</pubDate>
		<dc:subject><![CDATA[poetry generation]]></dc:subject>
		<dc:subject><![CDATA[TTS]]></dc:subject>
		<dc:subject><![CDATA[Gedichtgenerator]]></dc:subject>
		<dc:subject><![CDATA[Echo DM]]></dc:subject>
		<dc:subject><![CDATA[silly idea]]></dc:subject>
		<dc:subject><![CDATA[singing]]></dc:subject>
		<dc:subject><![CDATA[ASR]]></dc:subject>
            <description><![CDATA[<p>how about the following: you feed the first verse of your favourite poem/song lyrics into an ASR and have a TTS read back what the ASR understood. It is very likely to rhyme! If you also read back a nice chorus, people will love it. Now, you just need your TTS to do decent singing and you&#39;re set:</p><p>U: &quot;Ick heff mol in Hamburg een Veermaster sehn&quot;<br />S: &quot;gegen $l aber hin &lt;sil&gt; der hach&lt;sil&gt; also eben&quot;<br />S (canned): &quot;to my hooday, to my hooday&quot;<br />U: &quot;De masten so scheep as den Schipper sien Been&quot;<br />S: &quot;hm mach &lt;sil&gt; nach $t es in der &lt;sil&gt; warm $w&quot;<br />S (canned): &quot;to my hooday, hooday ho.&quot;</p><p>By the way, this is how singing works in a soccer stadium and the principle behind Chinese whispers. </p>]]></description>
        </item>
                
        <item>
            <title><![CDATA[aggregated audio duration]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/82.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/82.html</guid>
            <pubDate>Sat, 17 Apr 2010 12:55:31 GMT</pubDate>
		<dc:subject><![CDATA[perl]]></dc:subject>
		<dc:subject><![CDATA[audio]]></dc:subject>
            <description><![CDATA[<p>I&#39;ve finally written the one script that was missing from the interwebs and that I have longed to have for so long: </p> <div style="background-color: lightgray; font-family: monospace; line-height: 110%"><p>#!/usr/bin/perl<br /># Copyright (C) 2010 Timo Baumann<br /># This program is free software; you can redistribute it and/or modify it <br /># under the terms of the GNU General Public License as published by the <br /># Free Software Foundation; either version 2 of the License, <br /># or (at your option) any later version.<br /># <br /># This program is distributed in the hope that it will be useful, <br /># but WITHOUT ANY WARRANTY; without even the implied warranty of <br /># MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. <br /># See the GNU General Public License for more details.<br /># <br /># You should have received a copy of the GNU General Public License <br /># along with this program; if not, see &lt;<a href="http://www.gnu.org/licenses/">http://www.gnu.org/licenses/</a>&gt;.</p><p>use strict;<br />use warnings;<br />use Audio::Wav;<br />use Audio::Wav::Read;<br /><br />#usage: audio-duration.pl path-or-file1 path-or-file2 ...<br /><br />my @files;<br />for my $arg (@ARGV) {<br />&nbsp;&nbsp; &nbsp;my $findresult = `find $arg`;<br />&nbsp;&nbsp; &nbsp;push @files, grep /.wav$/, split &quot; &quot;, $findresult;<br />}<br />#print join &quot; &quot;, @files;<br />my $duration = 0.0;<br />my $wav = new Audio::Wav;<br />for my $file (@files) {<br />&nbsp;&nbsp; &nbsp;my $read = $wav-&gt;read($file);<br />&nbsp;&nbsp; &nbsp;$duration += $read-&gt;length_seconds();<br />}<br /># convert to something readable<br />my $readableDuration = &quot;&quot;;<br />if ($duration &gt; 600) {<br />&nbsp;&nbsp; &nbsp;my $seconds = int($duration + .5);<br />&nbsp;&nbsp; &nbsp;my $minutes = int($duration / 60);<br />&nbsp;&nbsp; &nbsp;$seconds -= $minutes * 60;<br />&nbsp;&nbsp; &nbsp;my $hours = int($minutes / 60);<br />&nbsp;&nbsp; &nbsp;$minutes -= $hours * 60;<br />&nbsp;&nbsp; &nbsp;$readableDuration = &quot;(&quot; . ($hours &gt; 0 ? &quot;$hours:&quot; : &quot;&quot;) . &quot;$minutes&#39;$seconds&quot;) &quot;;<br />}<br />print &quot;$duration seconds &quot;, $readableDuration, &quot;in &quot;, ($#files + 1), &quot; wave files.n&quot;;</p></div> <p>Running this in any directory wil yield the duration of audio (only .wav files) of all the files in this directory. If you supply arguments, it will look into the given directories (or files) and tell you the summed duration.</p><p>A must-have for any corpus-linguist dealing with loads of audio files! </p>]]></description>
        </item>
                
        <item>
            <title><![CDATA[Update]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/81.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/81.html</guid>
            <pubDate>Sun, 11 Oct 2009 19:40:57 GMT</pubDate>
		<dc:subject><![CDATA[update]]></dc:subject>
            <description><![CDATA[I&#39;ve been back from Stockholm for a while now, gone to Interspeech, SIGdial and YRRSDS and am now back at work in Golm. I&#39;m now again (professionally) centered around two things: Getting our next prototype of an incremental SDS up and running in the next few months and continue to work on my PhD thesis, which I hope to finish some time next year.]]></description>
        </item>
                
        <item>
            <title><![CDATA[Great new Sphinx Extensions]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/80.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/80.html</guid>
            <pubDate>Wed, 01 Jul 2009 09:09:47 GMT</pubDate>
		<dc:subject><![CDATA[ASR]]></dc:subject>
		<dc:subject><![CDATA[incremental]]></dc:subject>
		<dc:subject><![CDATA[pitch]]></dc:subject>
		<dc:subject><![CDATA[Sphinx]]></dc:subject>
            <description><![CDATA[I finally got around to package the pitch tracker and some of our incremental result filtering (which was the reason for me travelling to Boulder, USA) as add-ons for Sphinx. Find them on my <a href="http://www.ling.uni-potsdam.de/~timo/code/">website</a>.]]></description>
        </item>
                
        <item>
            <title><![CDATA[Here and there]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/79.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/79.html</guid>
            <pubDate>Wed, 20 May 2009 08:43:05 GMT</pubDate>
		<dc:subject><![CDATA[travel]]></dc:subject>
            <description><![CDATA[I have safely arrived in Stockholm, where I will be visiting researcher at KTH&#39;s <a href="http://www.speech.kth.se/speech/">speech lab</a> for the summer. Also, I will be travelling to <a href="http://www.naaclhlt2009.org/">NAACL-HLT 2009</a> in Boulder to present our paper and a smaller <a href="http://www.sfb673.org/component/option,com_eventcal/task,event/date,1244444400/eventid,73/Itemid,53/catid,/lang,en/">workshop</a> in Bielefeld. Also, I will participate in <a href="http://www.diaholmia.org/">Dialholmia</a> as a student volunteer. Many chances to meet and greet!]]></description>
        </item>
                
        <item>
            <title><![CDATA[Speed-Up Boot under Ubuntu with VMware]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/78.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/78.html</guid>
            <pubDate>Fri, 30 Jan 2009 09:50:02 GMT</pubDate>
		<dc:subject><![CDATA[bootchart]]></dc:subject>
		<dc:subject><![CDATA[howto]]></dc:subject>
		<dc:subject><![CDATA[ubuntu]]></dc:subject>
		<dc:subject><![CDATA[vmware]]></dc:subject>
            <description><![CDATA[<p>I still need Windows for one piece of software that I use occasionally. So, since I&#39;ve moved to Ubuntu, I&#39;ve been using VMware for this (as it was the only solution at that time).</p><p>My VMware stopped working under Hardy. Luckily, I never needed my Windows-App for half a year. In Intrepid, I was able to just install vmware from their webpage and it restored my windows session from a year ago. Probably a record-breaking uptime for windows...</p><p>So, yesterday I played around with <a href="http://www.bootchart.org/">bootchart</a> and found out, that the VMware-services took 4 precious seconds of my (and my battery&#39;s) lifetime on every boot. Not really worth it, as I&#39;m unlikely to use my Windows-App anytime soon. So, here&#39;s what I did: </p><p>remove the links in /etc/rc*.d/*vmware </p><p>as the first command in /usr/bin/vmplayer add: </p><p>gksu -D &quot;Need root priviledges to start vmware services.&quot; /etc/init.d/vmware restart</p><p>Works like a charm. </p>]]></description>
        </item>
                
        <item>
            <title><![CDATA[Generic Minimum Edit Distance of lists in Perl]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/77.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/77.html</guid>
            <pubDate>Tue, 20 Jan 2009 15:26:23 GMT</pubDate>
		<dc:subject><![CDATA[perl]]></dc:subject>
		<dc:subject><![CDATA[helpme]]></dc:subject>
		<dc:subject><![CDATA[fixme]]></dc:subject>
            <description><![CDATA[<p>The title says it all: I am looking for a generic implementation that tells me the edit distance of two lists. The implementations on CPAN all seem to work on string-data. Which is OK for finding typos but makes WER calculation tedious. </p><p>So, I want a generic implementation that takes a comparator-function (as in sort {$a &lt;=&gt; $b} @list) and two lists and outputs the edit distance. Nice to have would be distance-weights and really nifty if the value of the comparator function (not only !=0 but how much lower or higher) was taken into account.</p><p>Luckily I don&#39;t need it now, so I don&#39;t have to write it. But it would be a great finger exercise for a Perl-in-NLP class.</p><p>EDIT: The obvious module Text::Levenshtein on CPAN actually *<a href="http://rt.cpan.org/Public/Bug/Display.html?id=42459">miscalculates</a>* Levenshtein-distance for some input. Luckily I wondered what the 3 bugs in the module were about before I just happily used that code... So I ended up slightly modifying an <a href="http://www.merriampark.com/ldperl.htm">implementation by Eli Bendersky</a>, which already uses lists internally. So I left out the part about the comparator interface for now and just calculate standard WER, which is all I need right now.</p>]]></description>
        </item>
                
        <item>
            <title><![CDATA[no sound in praat with ubuntu (and USB sound)]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/76.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/76.html</guid>
            <pubDate>Thu, 18 Dec 2008 11:11:04 GMT</pubDate>
		<dc:subject><![CDATA[howto]]></dc:subject>
		<dc:subject><![CDATA[ubuntu]]></dc:subject>
		<dc:subject><![CDATA[sound]]></dc:subject>
		<dc:subject><![CDATA[praat]]></dc:subject>
            <description><![CDATA[<p>If praat (on ubuntu) doesn&#39;t want to play any audio, it tells you to consult some Sound-HOWTO (which at least doesn&#39;t exist on ubuntu). Unfortunately, <a href="http://ubuntuforums.org/archive/index.php/t-64383.html">http://ubuntuforums.org/archive/index.php/t-64383.html</a> is of little help (and doesn&#39;t allow posting anymore as its active phase has expired).</p><p>The solution for us was easy: We use an external USB sound card and have deactivated the mainboard sound. For some reason, the sound device is now called &quot;/dev/dsp1&quot; (and &quot;/dev/audio1&quot;) and there is no &quot;/dev/dsp&quot; (nor &quot;/dev/audio&quot;). Adding symlinks from &quot;/dev/dsp&quot; to &quot;/dev/dsp1&quot; (and from &quot;/dev/audio&quot; to &quot;/dev/audio1&quot;) fixed the problem. Hope this helps. </p>]]></description>
        </item>
                
        <item>
            <title><![CDATA[steps towards German ASR]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/74.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/74.html</guid>
            <pubDate>Thu, 15 May 2008 17:48:19 GMT</pubDate>
		<dc:subject><![CDATA[Sphinx]]></dc:subject>
		<dc:subject><![CDATA[ASR]]></dc:subject>
            <description><![CDATA[<p><em>Klotz hin Gnubbel</em></p><p>This is what I get with the current acoustic model and a LM that was even trained including the correct sentence (<em>und f&uuml;ge es ein in den Bauch des Elephanten</em>).</p><p>Even using only just the correct sentence as a grammar returns <em>und f&uuml;ge es</em>,&nbsp; instead of the complete sentence. The alignment shows, that <em>es</em> is supposedly spans the complete <em>ein in den Bauch des</em>. </p><p>I read that the current models are severely overtrained on one speaker, so I tried one of his utterances (de43-01, <em>die Anwendung wird entwickelt</em>) which is correctly understood if I use it as a grammar (effectively resulting in forced alignment) and which results in the beautiful <em>phantasie wird entwickelt</em> if I include this one sentence in the statistical LM as above.&nbsp;</p><p>Thus, the bad results are probably due to the bad acoustic model. I&#39;ve already uploaded the PentoNamingCorpus to Voxforge, thus hopefully, acoustic models will improve eventually. But if bad comes to worse, we&#39;ll have to train based on KCoRS and Verbmobil... </p>]]></description>
        </item>
                
        <item>
            <title><![CDATA[sox in Ubuntu Hardy (8.04)]]></title>
            <link>http://coco-lab.org/Elgg/timo/weblog/70.html</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/timo/weblog/70.html</guid>
            <pubDate>Fri, 25 Apr 2008 20:17:28 GMT</pubDate>
		<dc:subject><![CDATA[howto]]></dc:subject>
		<dc:subject><![CDATA[ubuntu]]></dc:subject>
		<dc:subject><![CDATA[sox]]></dc:subject>
            <description><![CDATA[<p>If you just upgraded your Ubuntu to 8.04 and use sox, then you may get the error &quot;sox soxio: Failed reading `some.file&#39;: unknown file type `auto&#39;&quot;</p><p>In Hardy, all audio formats for sox have been refactored in separate packages libsox-fmt-XYZ. So, either install just the base formats from libsox-fmt-base or get all possible formats with libsox-fmt-all.</p>]]></description>
        </item>
        
        <item>
            <title><![CDATA[SRILM-Makefile for Ubuntu]]></title>
            <link>http://coco-lab.org/Elgg/timo/files/-1/11/Makefile.machine.i686</link>
            <enclosure url="http://coco-lab.org/Elgg/timo/files/-1/11/Makefile.machine.i686" length="1758" type="document/unknown" />
            <pubDate>Tue, 13 Nov 2007 13:42:16 GMT</pubDate>
		<dc:subject><![CDATA[patch]]></dc:subject>
		<dc:subject><![CDATA[srilm]]></dc:subject>
		<dc:subject><![CDATA[ubuntu]]></dc:subject>
            <description><![CDATA[copy this to common/]]></description>
        </item>
        <item>
            <title><![CDATA[schaffe, schaffe]]></title>
            <link>http://coco-lab.org/Elgg/timo/files/-1/10/construct3_e0.gif</link>
            <enclosure url="http://coco-lab.org/Elgg/timo/files/-1/10/construct3_e0.gif" length="557" type="image/gif" />
            <pubDate>Wed, 10 Oct 2007 09:29:19 GMT</pubDate>
            <description><![CDATA[häusle baue]]></description>
        </item>
        <item>
            <title><![CDATA[WekaTest]]></title>
            <link>http://coco-lab.org/Elgg/timo/files/-1/9/WekaTest.java</link>
            <enclosure url="http://coco-lab.org/Elgg/timo/files/-1/9/WekaTest.java" length="5262" type="text/plain" />
            <pubDate>Tue, 11 Sep 2007 10:03:58 GMT</pubDate>
		<dc:subject><![CDATA[example]]></dc:subject>
		<dc:subject><![CDATA[weka]]></dc:subject>
            <description><![CDATA[Beispiel zur Weka API]]></description>
        </item>
    </channel>
</rss>
