<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jonas-Kress.de</title>
	<atom:link href="http://Jonas-Kress.de/feed/" rel="self" type="application/rss+xml" />
	<link>http://Jonas-Kress.de</link>
	<description>Thoughts, projects and publications from Jonas M. Kress</description>
	<lastBuildDate>Thu, 26 Apr 2012 13:22:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Embed Images in Latex</title>
		<link>http://Jonas-Kress.de/157/2012/04/office/embed-images-in-latex/</link>
		<comments>http://Jonas-Kress.de/157/2012/04/office/embed-images-in-latex/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 13:13:47 +0000</pubDate>
		<dc:creator>Jonas M. Kress</dc:creator>
				<category><![CDATA[Latex]]></category>
		<category><![CDATA[Office]]></category>

		<guid isPermaLink="false">http://Jonas-Kress.de/?p=157</guid>
		<description><![CDATA[<p>When writing papers Latex is the most common tool. Usually it&#8217;s very hard to use, but with Lyx a graphical interface it&#8217;s pretty easy. Nevertheless there are some things to keep in mind especially when you work with images. If &#8230; <a href="http://Jonas-Kress.de/157/2012/04/office/embed-images-in-latex/">Continue reading <span class="meta-nav">&#8594;</span></a></p>
 ]]></description>
			<content:encoded><![CDATA[<p>When writing papers <a href="http://www.latex-project.org/">Latex</a> is the most common tool. Usually it&#8217;s very hard to use, but with <a href="http://www.lyx.org/">Lyx</a> a graphical interface it&#8217;s pretty easy.<br />
Nevertheless there are some things to keep in mind especially when you work with images.<br />
If you insert an image in a common format (png, gif, jpeg) directly to your document the quality will be extremely bad, because its downsized and compressed while generating the PDF from the Latex document.<br />
So here are my best practices:</p>
<h2>Images created in Open Office Draw</h2>
<ol>
<li>File -&gt; Export as PDF</li>
<li>Remove the white space by cutting with <a href="http://www.pdfscissors.com/">PDF Scissors</a></li>
<li>Use Latex tag &#8220;\includegraphics[page=1]{fileName}&#8221; with the parameter page you can set the page to be displayed</li>
</ol>
<p>Voilà your images are inserted to your document without quality loss, because they are inserted as curves.</p>
<h2>Images in common formats (png, gif, jpeg)</h2>
<ol>
<li>Create a new <a href="http://inkscape.org/">Inkscape </a>document</li>
<li>Paste your image</li>
<li>File -&gt; Document Settings -&gt; Fit Page to Selection</li>
<li>File -&gt; Save As -&gt; PDF</li>
<li>Use Latex tag &#8220;\includegraphics{fileName}&#8221;</li>
</ol>
<p>Your image quality will not be reduced during Latex to PDF compilation.</p>
<p>That&#8217;s the way how you should embed your graphics in Latex to achieve the best image quality.</p>
]]></content:encoded>
			<wfw:commentRss>http://Jonas-Kress.de/157/2012/04/office/embed-images-in-latex/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Evaluating Hive HBase Table Mapping Performance</title>
		<link>http://Jonas-Kress.de/117/2012/03/publication/evaluating-hive-hbase-table-mapping-performance/</link>
		<comments>http://Jonas-Kress.de/117/2012/03/publication/evaluating-hive-hbase-table-mapping-performance/#comments</comments>
		<pubDate>Tue, 20 Mar 2012 12:15:12 +0000</pubDate>
		<dc:creator>Jonas M. Kress</dc:creator>
				<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hive]]></category>
		<category><![CDATA[Papers]]></category>

		<guid isPermaLink="false">http://Jonas-Kress.de/?p=117</guid>
		<description><![CDATA[<p>As part of my master thesis I am evaluating the performance of Hive queries that use tables stored in HBase. Although the HBase table mappings in Hive are very experimental and features like server side filtering and partitions have still &#8230; <a href="http://Jonas-Kress.de/117/2012/03/publication/evaluating-hive-hbase-table-mapping-performance/">Continue reading <span class="meta-nav">&#8594;</span></a></p>
 ]]></description>
			<content:encoded><![CDATA[<p>As part of my master thesis I am evaluating the performance of Hive queries that use tables stored in HBase.<br />
Although the HBase table mappings in Hive are very experimental and features like server side filtering and partitions have still not been implemented, the performance and scaling seems to be very good.</p>
<p>In my evaluation I am measuring time queries take and compare that to the data set size.<br />
Furthermore the Hadoop cluster is monitored by a <a href="http://ganglia.info/">Ganglia</a>  to evaluate the used resources.</p>
<p>The data set is still very small but growing&#8230;</p>
<p><a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/chart.png"><img class="alignnone size-full wp-image-137" title="chart" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/chart.png" alt="" width="900" height="900" /></a></p>
<p>Network:<br/><br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__network_report.png"><img class="alignnone size-full wp-image-136" title="03_19_1211-54-03_20_127-24__network_report" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__network_report.png" alt="" width="1247" height="793" /></a><br />
RAM:<br/><br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__mem_report.png"><img class="alignnone size-full wp-image-135" title="03_19_1211-54-03_20_127-24__mem_report" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__mem_report.png" alt="" width="1247" height="837" /></a><br />
Load:<br/><br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__load_report.png"><img class="alignnone size-full wp-image-134" title="03_19_1211-54-03_20_127-24__load_report" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__load_report.png" alt="" width="1247" height="821" /></a><br />
HBase Requests:<br/><br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__hbase.regionserver.requests.png"><img class="alignnone size-full wp-image-133" title="03_19_1211-54-03_20_127-24__hbase.regionserver.requests" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__hbase.regionserver.requests.png" alt="" width="1247" height="779" /></a><br />
Disk writes:<br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__disk_dm_writes_kb.png"><img class="alignnone size-full wp-image-132" title="03_19_1211-54-03_20_127-24__disk_dm_writes_kb" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__disk_dm_writes_kb.png" alt="" width="1247" height="779" /></a><br />
Disk reads:<br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__disk_dm_reads_kb.png"><img class="alignnone size-full wp-image-129" title="03_19_1211-54-03_20_127-24__disk_dm_reads_kb" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__disk_dm_reads_kb.png" alt="" width="1247" height="779" /></a><br />
CPU:<br />
<a href="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__cpu_report.png"><img class="alignnone size-full wp-image-127" title="03_19_1211-54-03_20_127-24__cpu_report" src="http://Jonas-Kress.de/blog/wp-content/uploads/2012/03/03_19_1211-54-03_20_127-24__cpu_report.png" alt="" width="1247" height="825" /></a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://Jonas-Kress.de/117/2012/03/publication/evaluating-hive-hbase-table-mapping-performance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hive Table Column Mapping</title>
		<link>http://Jonas-Kress.de/106/2011/12/hadoop/hive/hive-table-column-mapping/</link>
		<comments>http://Jonas-Kress.de/106/2011/12/hadoop/hive/hive-table-column-mapping/#comments</comments>
		<pubDate>Fri, 16 Dec 2011 11:56:13 +0000</pubDate>
		<dc:creator>Jonas M. Kress</dc:creator>
				<category><![CDATA[Hive]]></category>

		<guid isPermaLink="false">http://Jonas-Kress.de/?p=106</guid>
		<description><![CDATA[<p>When creating a column mapping in hive all the data is being saved to the metastore. Example of a hive hbase column mapping: CREATE EXTERNAL TABLE hivetableName(Col String, Col1 String) STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler' WITH SERDEPROPERTIES ("hbase.columns.mapping" = "colFamily:Col, colFamily:Col") TBLPROPERTIES("hbase.table.name" &#8230; <a href="http://Jonas-Kress.de/106/2011/12/hadoop/hive/hive-table-column-mapping/">Continue reading <span class="meta-nav">&#8594;</span></a></p>
 ]]></description>
			<content:encoded><![CDATA[<p>When creating a column mapping in hive all the data is being saved to the metastore.</p>
<p>Example of a hive hbase column mapping:<br />
<code>CREATE EXTERNAL TABLE hivetableName(Col String, Col1 String)<br />
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'<br />
WITH SERDEPROPERTIES ("hbase.columns.mapping" = "colFamily:Col, colFamily:Col")<br />
        TBLPROPERTIES("hbase.table.name" = "hbasetableName")</code></p>
<p>If you use a vast amount of columns that are being mapped the table column of the serde params will reach its limit. </p>
<p>You can fix this problem with the following statement if your metastore uses a mysql database:<br />
<code>ALTER TABLE hive.<code>SERDE_PARAMS</code> MODIFY COLUMN <code>PARAM_VALUE</code> text;</code> </p>
]]></content:encoded>
			<wfw:commentRss>http://Jonas-Kress.de/106/2011/12/hadoop/hive/hive-table-column-mapping/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Virtualbox Image Resize Disk (VDI, VMDK)</title>
		<link>http://Jonas-Kress.de/101/2011/12/virtualization/virtualbox-image-resize-disk-vdi-vmdk/</link>
		<comments>http://Jonas-Kress.de/101/2011/12/virtualization/virtualbox-image-resize-disk-vdi-vmdk/#comments</comments>
		<pubDate>Fri, 16 Dec 2011 11:29:08 +0000</pubDate>
		<dc:creator>Jonas M. Kress</dc:creator>
				<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[VDI]]></category>
		<category><![CDATA[VirtualBox]]></category>
		<category><![CDATA[VMDK]]></category>

		<guid isPermaLink="false">http://Jonas-Kress.de/?p=101</guid>
		<description><![CDATA[<p>When resizing a VirtualBox image you may face the error &#8220;Resize hard disk operation for this format is not implemented yet!&#8221;. You can fix this by doing a convert to the VDI disk format before trying to resize it. It &#8230; <a href="http://Jonas-Kress.de/101/2011/12/virtualization/virtualbox-image-resize-disk-vdi-vmdk/">Continue reading <span class="meta-nav">&#8594;</span></a></p>
 ]]></description>
			<content:encoded><![CDATA[<p>When resizing a VirtualBox image you may face the error &#8220;Resize hard disk operation for this format is not implemented yet!&#8221;.<br />
You can fix this by doing a convert to the VDI disk format before trying to resize it.<br />
It could be necessary to boot the converted image before doing a resize!</p>
<p>Convert format of virtual hard disk:<br />
<code>VBoxManage clonehd disk.vmdk disk.vdi --format VDI</code></p>
<p>Resize virtual hard disk:<br />
<code>VBoxManage modifyhd disk.vdi --resize [SIZE_IN_MB]</code></p>
<p>You may also want to resize your partitions.<br />
This can be easily done by using the <a href="http://gparted.sourceforge.net/livecd.php" target="_blank">GParted live cd</a></p>
]]></content:encoded>
			<wfw:commentRss>http://Jonas-Kress.de/101/2011/12/virtualization/virtualbox-image-resize-disk-vdi-vmdk/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Master Thesis in Progress</title>
		<link>http://Jonas-Kress.de/46/2011/11/publication/master-thesis/</link>
		<comments>http://Jonas-Kress.de/46/2011/11/publication/master-thesis/#comments</comments>
		<pubDate>Mon, 28 Nov 2011 15:53:25 +0000</pubDate>
		<dc:creator>Jonas M. Kress</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://dev.jonas-kress.de/?p=46</guid>
		<description><![CDATA[<p>Started working on my master thesis with the title: Analysis of migrating a MySQL based data warehouse and monitoring system to hadoop Most relevant books and papers: Hadoop The Definitive Guide 2nd Edition and HBase The Definitive Guide Hive – &#8230; <a href="http://Jonas-Kress.de/46/2011/11/publication/master-thesis/">Continue reading <span class="meta-nav">&#8594;</span></a></p>
 ]]></description>
			<content:encoded><![CDATA[<p>Started working on my master thesis with the title:</p>
<blockquote><p>Analysis of migrating a MySQL based data warehouse and monitoring system to hadoop</p></blockquote>
<p>Most relevant books and papers:</p>
<ul>
<li>Hadoop The Definitive Guide 2nd Edition and</li>
<li>HBase The Definitive Guide</li>
<li>Hive – A Petabyte Scale Data Warehouse</li>
<li>High Performance MySQL</li>
</ul>
<p>Further links:<a href="http://www.cloudera.com/"><br />
http://www.cloudera.com/</a> Hadoop distribution and installation package<br />
<a href="http://hadoop.apache.org/">http://hadoop.apache.org/</a>  Hadoop is an open source Apache project</p>
]]></content:encoded>
			<wfw:commentRss>http://Jonas-Kress.de/46/2011/11/publication/master-thesis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

