<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Bioinfo Blog!</title>
	<atom:link href="http://bioinfoblog.it/feed/" rel="self" type="application/rss+xml" />
	<link>http://bioinfoblog.it</link>
	<description>Blog di Bioinformatica e di esperienze di studio all'estero</description>
	<lastBuildDate>Sun, 14 Jun 2009 12:35:04 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=abc</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How to stay focused on your work (programming)</title>
		<link>http://bioinfoblog.it/2009/06/how-to-stay-focused-on-your-work-programming/</link>
		<comments>http://bioinfoblog.it/2009/06/how-to-stay-focused-on-your-work-programming/#comments</comments>
		<pubDate>Sun, 14 Jun 2009 12:35:04 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=794</guid>
		<description><![CDATA[The authors of programming4scientists have published a nice blog post with several tips on how to avoid distractions while programming:

programming4scientitst &#8211; Stay on target!

I think this is a major issue for all people working in bioinformatics, because compared to the classical computer scientist, we bioinformaticians have less culture on this topic, and generally there is [...]]]></description>
			<content:encoded><![CDATA[<p>The authors of programming4scientists have published a nice blog post with several tips on how to avoid distractions while programming:</p>
<ul>
<li><a href="    *  http://www.programming4scientists.com/2009/06/stay-on-target-ways-to-help-yourself-work">programming4scientitst &#8211; Stay on target!</a></li>
</ul>
<p>I think this is a major issue for all people working in bioinformatics, because compared to the classical computer scientist, we bioinformaticians have less culture on this topic, and generally there is less perception on which are the good practices on programming.</p>
<p>One classical example is the position of the monitors and how to sit in front of them:</p>
<div class="wp-caption aligncenter" style="width: 252px"><a href="http://www.codinghorror.com/blog/images/computing-ergonomics-monitor-position.png"><img title="right monitor position" src="http://www.codinghorror.com/blog/images/computing-ergonomics-monitor-position.png" alt="most of the bioinformaticians programs in a no-ergonomic position" width="242" height="265" /></a><p class="wp-caption-text">most of the bioinformaticians programs in a no-ergonomic position - you can program better  just by sitting more comfortably in front of the computer.</p></div>
<p>When I was still a master student and was looking for a place to start a phd, I was literally paranoic by this picture, because of every laboratory I visited as candidate, I found that the majority of the monitors where positioned wrongly. It&#8217;s a kind of non-sense: is the &#8216;publish or perish&#8217; mentality so strong that we don&#8217;t even have the time to care about how we sit in front of the computer?</p>
<p>Well, going back to the argument of this post, I think that the tools linked by <a href="http://www.programming4scientists.com/2009/06/stay-on-target-ways-to-help-yourself-work/">programming4scientists</a> can be very useful; however, the most effective way to avoid distractions is to work in a good team, and to interact with your team mates.</p>
<p>If you work on your project alone and never speak with your supervisor, you are more likely to waste a lot of time with distraction; but if you have clear objectives, and you have a clear focus on what you have to do every day, you will be a lot less likely to distract.</p>
<p>I think it is responsability of the group leader to help his students to avoid distractions, and to keep them always focused on a task. If I got distracted and go slower on my work, it is because I don&#8217;t have a clear idea of how or what to do next, and I don&#8217;t have nobody to ask it. At this point many people will say you that if you are a true scientist you must be able to carry your own project alone, but for me this sounds like an excuse or a lie.</p>
<p>There is a whole science who studies how to coordinate a group of programmers, and it is called &#8216;<a class="zem_slink" title="Software engineering" rel="wikipedia" href="http://en.wikipedia.org/wiki/Software_engineering">software engineering</a>&#8216;.</p>
<p>A good methodology which can give good results is <a href="http://extremeprogramming.org/rules/pair.html">pair programming</a>. I like more or less all the rules from the <a href="http://extremeprogramming.org/">extreme programming</a> phylosophy, also the &#8216;<a href="http://extremeprogramming.org/rules/standupmeeting.html">5 minutes stand-up meeting every day</a>&#8216; could be effective in the short term.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/c32d8935-d0e6-439a-9012-3719e0833b3f/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=c32d8935-d0e6-439a-9012-3719e0833b3f" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F06%2Fhow-to-stay-focused-on-your-work-programming%2F&amp;linkname=How%20to%20stay%20focused%20on%20your%20work%20%28programming%29"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/06/how-to-stay-focused-on-your-work-programming/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ruffus, a tool to write scientific pipelines in python</title>
		<link>http://bioinfoblog.it/2009/06/ruffus-a-tool-to-write-scientific-pipelines-in-python/</link>
		<comments>http://bioinfoblog.it/2009/06/ruffus-a-tool-to-write-scientific-pipelines-in-python/#comments</comments>
		<pubDate>Wed, 03 Jun 2009 16:56:24 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=779</guid>
		<description><![CDATA[I would like to point to you ruffus, an alternative to Make as a tool to define pipelines, with a python syntax and many good things.
Here it is a link to the project:

ruffus &#8211; a tool to define scientific pipelines with a python syntax


and here it is also its home page at google/code:

ruffus google/code page

I [...]]]></description>
			<content:encoded><![CDATA[<p>I would like to point to you ruffus, an alternative to Make as a tool to define pipelines, with a python syntax and many good things.</p>
<p>Here it is a link to the project:</p>
<ul>
<li><a href="    *  http://ruffus.googlecode.com/svn/trunk/doc/html/index.html">ruffus &#8211; a tool to define scientific pipelines with a python syntax<br />
</a></li>
</ul>
<p>and here it is also its home page at google/code:</p>
<ul>
<li><a href="http://code.google.com/p/ruffus/">ruffus google/code page</a></li>
</ul>
<p>I have been looking for a python alternative to Makefiles for ages, and this project seems very interesting to me. With respect with scons, paver, waf, and other alternatives, this ruffus should be better because it has been designed specifically for pipelines in bioinformatics, while the others are to compile programs.</p>
<p>It can also plot a picture representing the workflow:</p>
<div class="wp-caption aligncenter" style="width: 586px"><a href="http://ruffus.googlecode.com/svn/trunk/doc/html/_images/maximal_mode.jpg"><img title="a pipeline with ruffus" src="http://ruffus.googlecode.com/svn/trunk/doc/html/_images/maximal_mode.jpg" alt="this tool can also draw representations of a pipeline in an image. However, I suppose this can be enhanced a bit :)" width="576" height="170" /></a><p class="wp-caption-text">A pipeline defined with the ruffus syntax and drawn as an image with a ruffus utility. It is nice to have this feature and it is easy to use it <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .</p></div>
<p>I have tried a few examples and it seems to be working fine. The only doubt I have is if it can work with Grid Engine systems (qsub/qmake like), but I have filed a bug report and hope that the developers will answer me.</p>
<p><a href="http://www.nodalpoint.org/2007/03/18/a_pipeline_is_a_makefile">Make</a> is a great tool, because it allows you to save shell statements with ease and write simple pipelines; however, its&#8217; syntax is a bit complicated and you can&#8217;t have more than one wild type character per rule. I have also tried <a href="http://taverna.sf.net/">taverna</a>: but it is too big and slow for me, I just need a way to run command line tools locally, while it seems to force you to use web services and so on. Finally, the <a href="http://skam.sourceforge.net/skam-intro.html">BioMake</a> syntax seemed fantastic, but I didn&#8217;t succeed in installing skam in my computer.</p>
<p>So, it is great to have another alternative to write pipelines, and this ruffus seems to work well. Maybe, if I&#8217;ll have time, I&#8217;ll write a short subjective comparison between all the pipeline-defining tools I know.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F06%2Fruffus-a-tool-to-write-scientific-pipelines-in-python%2F&amp;linkname=ruffus%2C%20a%20tool%20to%20write%20scientific%20pipelines%20in%20python"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/06/ruffus-a-tool-to-write-scientific-pipelines-in-python/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>a parser for KEGG pathways in python</title>
		<link>http://bioinfoblog.it/2009/06/a-parser-for-kegg-pathways-in-python/</link>
		<comments>http://bioinfoblog.it/2009/06/a-parser-for-kegg-pathways-in-python/#comments</comments>
		<pubDate>Tue, 02 Jun 2009 15:21:28 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=775</guid>
		<description><![CDATA[I am writing a parser for the kegg KGML format, the dialect of XML used to store kegg pathways.

I couldn&#8217;t find any existing module to do this in python. There is a similar library for R (KEGGgraph), maybe something for perl (I didn&#8217;t find it), but apparently nothing for python.
Here it is the code, hosted [...]]]></description>
			<content:encoded><![CDATA[<p>I am writing a parser for the kegg KGML format, the dialect of XML used to store kegg pathways.</p>
<p style="text-align: center;">
<div id="attachment_790" class="wp-caption aligncenter" style="width: 310px"><a href="http://bioinfoblog.it/wp-content/2009/06/sampleplot.png"><img class="size-medium wp-image-790" title="purine metabolism" src="http://bioinfoblog.it/wp-content/2009/06/sampleplot-300x225.png" alt="update: A KEGG pathway file, parsed with python and plotted with networkx" width="300" height="225" /></a><p class="wp-caption-text">update: A KEGG pathway file, parsed with python and plotted with networkx</p></div>
<p>I couldn&#8217;t find any existing module to do this in python. There is a similar library for R (<a href="http://www.bioconductor.org/packages/devel/bioc/html/KEGGgraph.html">KEGGgraph</a>), maybe something for perl (I didn&#8217;t find it), but apparently nothing for python.</p>
<p>Here it is the code, hosted on github:</p>
<ul>
<li><a href="http://github.com/dalloliogm/kegg-kgml-parser--python-/tree"> http://github.com/dalloliogm/kegg-kgml-parser&#8211;python-/tree</a></li>
</ul>
<p>It&#8217;s is really a simple and incomplete script, but now I understood which libraries to use and how to read the format, so I will be able to improve it as soon as I have free time.</p>
<p>It requires the <a href="http://networkx.lanl.gov/">networkx</a> library which is the standard for working with graphs and networks in python, ElementTree (already included in the standard lib) to parse the XML, and pylab/matplotlib for plotting.</p>
<p>Any suggestion is really appreciated. For me, this is the first time that I work with XML files.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F06%2Fa-parser-for-kegg-pathways-in-python%2F&amp;linkname=a%20parser%20for%20KEGG%20pathways%20in%20python"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/06/a-parser-for-kegg-pathways-in-python/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>slides and questions on bioinformatics from the Italian Pycon</title>
		<link>http://bioinfoblog.it/2009/05/slides-and-questions-on-bioinformatics-from-the-italian-pycon/</link>
		<comments>http://bioinfoblog.it/2009/05/slides-and-questions-on-bioinformatics-from-the-italian-pycon/#comments</comments>
		<pubDate>Tue, 12 May 2009 20:44:24 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=761</guid>
		<description><![CDATA[I am back in Barcelona after the italian Pycon.it, and I can finally publish the slides of my talk on &#8216;Python and Bioinformatics&#8217;:

I am sorry that the slides are in Italian. I could have written them in English, but I was too tired  
The talk went well, and I have been asked many questions. [...]]]></description>
			<content:encoded><![CDATA[<p>I am back in Barcelona after the italian Pycon.it, and I can finally publish the slides of my talk on &#8216;Python and Bioinformatics&#8217;:</p>
<p style="text-align: center;"><object width="425" height="355" data="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=pycon-090511033133-phpapp01&amp;rel=0&amp;stripped_title=pycon" type="application/x-shockwave-flash"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=pycon-090511033133-phpapp01&amp;rel=0&amp;stripped_title=pycon" /><param name="allowfullscreen" value="true" /></object></p>
<p>I am sorry that the slides are in Italian. I could have written them in English, but I was too tired <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> </p>
<p>The talk went well, and I have been asked many questions. Maybe I should have put some more practical example, and it was too generic, but the fact is that I wanted to discuss a bit about methods. I hope I didn&#8217;t bore anyone <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>I hope the people who asked me won&#8217;t mind if I post some of their questions here, but maybe it will make it easier to get better answers than mine.</p>
<ul>
<li>
<h3>how can I do business on <a class="zem_slink" title="Bioinformatics" rel="wikipedia" href="http://en.wikipedia.org/wiki/Bioinformatics">bioinformatics</a>? Would a framework or some support will be successfull?</h3>
</li>
</ul>
<p>Most of the people who attended the talk were informaticians, and they have a better mentality than our scientists, which only think of getting grants and doing our research.</p>
<p>I think that making business in bioinformatics is feasable, maybe even in Italy.</p>
<p>Creating a company to provide support services will have more chances to be successfull than the approach of writing a new software. There are a lot of laboratories which are in their &#8216;<em>earlier phases of adopting bioinformatics</em>&#8216;: many groups are realizing that it is cheaper to use the data produced by the big American centers instead of producing their own, so they are looking for somebody to guide them and teach them the fundamentals of programming and data management.</p>
<p>The problem is that it difficult to train as bioinformatician: it requires a lot of efforts, there are not so many courses, and maybe it is more worth to work in more remunerable fields like web programming and others.</p>
<ul>
<li>
<h3>I am a computer scientist interested in bioinformatics. Where can I study biology?</h3>
</li>
</ul>
<p>I couldn&#8217;t answer well to this question. I think that at least a master degree in bioinformatics is a good starting point, but I know of people working in this field that have just studied computer science.</p>
<p>It depends on which group you are going to work; but in general, it is only important to work in contact with a biologist to understand if what you are doing is good (what about pair programming?)</p>
<ul>
<li>
<h3>Is the speed of a language a limiting factor in bioinformatics? What about these solutions to speed up python?</h3>
</li>
</ul>
<p>Honestly I don&#8217;t think speed is much important in bioinformatics, because most of the bottleneck points are: the access to data on the disk, the network speed in some cases, and moreover the time you have to write the program.</p>
<p>A good idea to speed up python is to use psyco. I have tried it once, but then I forgot about it <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' /> . Moreover, a good news is that they have solved the GIL problem in the most recent versions.</p>
<ul>
<li>
<h3>How can I search for overlapping regular expressions on a sequence, with python?</h3>
</li>
</ul>
<p>The python library for regular expression, re, lacks the option to look for overlapping motifs. For instance, if your regular expression is &#8216;C.&#8217;, and you parse the string &#8216;CCA&#8221;, you will only get a match (&#8217;CC&#8217;), and there is no way to look for two.</p>
<p>Some time ago I solved the problem by using a good library called <a href="http://fraenkel.mit.edu/TAMO/">TAMO</a>, created for working with motifs and advanced regular expressions. You can use it to do overlapping searches, look at the tutorial: <a href="http://fraenkel.mit.edu/TAMO/TAMO_tutorial.html">http://fraenkel.mit.edu/TAMO/TAMO_tutorial.html</a></p>
<ul>
<li>
<h3>How much is difficult to contribute a module to biopython?</h3>
</li>
</ul>
<p>Well, I have submitted a few patches to biopython, but the best people to ask this question are the developers theirselves.</p>
<p>In general, it is easier if your proposal doesn&#8217;t require to add new dependencies and if you provide some good use cases and tests already.</p>
<ul>
<li>
<h3>What about Make/taverna/other similar tools?</h3>
</li>
</ul>
<p>Somebody asked me why I don&#8217;t use taverna to create pipelines with my script.</p>
<p>I have tried taverna extensively, submitted some bug reports, and participated to a lot of their surveys; and I also  think that is a very good software.</p>
<p>The problems with taverna are: no testing support, no python support, not clear if you can work with local scripts only, can&#8217;t be launched from the shell, and make is more or less standard and installed everywhere.</p>
<p>Some alternatives to make, written in python: scons, paver, waf, and biomake (which I didn&#8217;t succeed to compile yet and is not in python)</p>
<ul>
<li>
<h3>Will the new unicode behaviour in python 3 slow down programs which read sequence files?</h3>
</li>
</ul>
<p>This is a question I have asked to another speaker. In python 3, all the strings will be unicode by default. But if I need to read sequences which only contains A, C, T, G, N (5 characters), why do I need to store them as unicode? Won&#8217;t it slow down all my parsers?</p>
<ul>
<li>
<h3>Any library to work with protein spectrometry?</h3>
</li>
</ul>
<p>I only know of this: <a href="http://www.massxpert.org/wiki">http://www.massxpert.org/wiki</a>/ but I don&#8217;t know if there is something better.</p>
<p>It seems it is difficult to work with spectrometry data, because every company has its own proprietary format and there is not an universal parser.</p>
<ul>
<li>
<h3>Other questions?</h3>
</li>
</ul>
<p>mmm maybe I forgot about some of the questions, and I will put them here later.</p>
<p>I am not a python/bioinformatics guru, but I thought it was a good opportunity to discuss over it with other people. Later I will write a post on the other talks of the conference&#8230;.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/dbbb64e7-397d-42e4-a263-74acc7c00896/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=dbbb64e7-397d-42e4-a263-74acc7c00896" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F05%2Fslides-and-questions-on-bioinformatics-from-the-italian-pycon%2F&amp;linkname=slides%20and%20questions%20on%20bioinformatics%20from%20the%20Italian%20Pycon"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/05/slides-and-questions-on-bioinformatics-from-the-italian-pycon/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Bioinformatics and databases &#8211; some links</title>
		<link>http://bioinfoblog.it/2009/05/bioinformatics-and-databases-some-links/</link>
		<comments>http://bioinfoblog.it/2009/05/bioinformatics-and-databases-some-links/#comments</comments>
		<pubDate>Mon, 04 May 2009 16:59:39 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>
		<category><![CDATA[Programmazione]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=752</guid>
		<description><![CDATA[I have found some useful links on databases and bioinformatics.
First, you must be introduced to the debate &#8216;Flat files against databases&#8217;:

nodalpoint &#8211; databases or flat files?
bioinformaticszen post on databases
what is a database, a flat file, etc..


Usually, people in bioinformatics start with using flat files for doing most of the things. Later, they learn relational databases [...]]]></description>
			<content:encoded><![CDATA[<p>I have found some useful links on databases and bioinformatics.</p>
<p>First, you must be introduced to the debate &#8216;Flat files against databases&#8217;:</p>
<ul>
<li><a href="http://www.nodalpoint.org/2007/05/02/database_or_flat_text_file">nodalpoint &#8211; databases or flat files?</a></li>
<li><a href="http://www.bioinformaticszen.com/software/using_a_database">bioinformaticszen post on databases</a></li>
<li><a href="http://websiteowner.info/articles/cgi/databasetypes.asp">what is a database, a flat file, etc..<br />
</a></li>
</ul>
<p>Usually, people in bioinformatics start with using flat files for doing most of the things. Later, they learn relational databases to handle bigger collections of data.</p>
<p>If you want to know in general about mysql, this post is full of very good links and slides:</p>
<ul>
<li><a href="http://amix.dk/blog/viewEntry/19439">blog post full of good links on mysql</a></li>
</ul>
<p>This is BioPostgres, a project to build some extensions to the <a class="zem_slink" title="PostgreSQL" rel="homepage" href="http://www.postgresql.org/">Postgres</a> database for bioinformatics (e.g. blast, go databases..)</p>
<ul>
<li><a href="http://phenomics.cs.ucla.edu/">BioPostgres, home<br />
</a></li>
<li><a href="http://www.bioinformatics.org/project/?group_id=793">a type for biological data for postgres</a></li>
<li><a href="http://archives.postgresql.org/pgsql-sql/2002-04/msg00318.php">postgres performance tips</a></li>
</ul>
<p>And this is on Postgres indexes, to speed up queries:</p>
<ul>
<li><a href="http://www.postgresql.org/docs/current/interactive/gin.html" target="_blank">http://www.postgresql.org/docs/current/interactive/gin.html</a></li>
<li> <a href="http://www.postgresql.org/docs/current/interactive/gist.html" target="_blank">http://www.postgresql.org/docs/current/interactive/gist.html</a></li>
</ul>
<p>If you program in python, I strongly recommend you this <a class="zem_slink" title="Object-relational mapping" rel="wikipedia" href="http://en.wikipedia.org/wiki/Object-relational_mapping">ORM</a> module called elixir (sqlalchemy):</p>
<ul>
<li><a href="http://elixir.ematia.de/trac/wiki/TutorialDivingIn">elixir, an extension to sqlalchemy</a></li>
</ul>
<p>See if you can follow the tutorial, even if you don&#8217;t know what an ORM is.</p>
<p>Sometimes a relational database is not the best option, especially if you have hierarchical or multidimensional data (e.g. a gene and a list of transcripts, etc..)</p>
<p><a class="zem_slink" title="Zope Object Database" rel="homepage" href="http://www.zope.org/">ZODB</a> is an object oriented database: it allows you to store python objects via the pickle modules:</p>
<ul>
<li><a href="http://faassen.n--tree.net/blog/view/weblog/2008/06/20/0">introduction to ZODB</a> (the most up to date)</li>
<li><a href="http://wiki.zope.org/ZODB/guide/zodb.html">Guide to ZODB and ZEO</a></li>
</ul>
<p>I am actually using zodb, and I think I will write a blog post on it later.</p>
<p>Another option for storing big data, in an hierarchial structure (a column can be a table containing other columns), is <a class="zem_slink" title="Hierarchical Data Format" rel="homepage" href="http://www.hdfgroup.org">HDF5</a>, a binary format used in physics and astronomy, which has a very good python API module called PyTables:</p>
<ul>
<li><a href="http://www.pytables.org/moin">PyTables</a>, API for HDF5</li>
</ul>
<p>&#8211;</p>
<p>Thanks to:</p>
<ul>
<li><a href="http://www.linkedin.com/pub/8/3ba/555">Brian Sutherland</a></li>
<li><a href="http://www.mailund.dk/index.php/2009/05/04/last-week-in-the-blogs-8/">mailund.dk</a></li>
<li><a href="http://bioinformaticszen.com/">bioinformaticszen</a></li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/6772bb6d-86e7-4912-8941-5a8755fb19db/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=6772bb6d-86e7-4912-8941-5a8755fb19db" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F05%2Fbioinformatics-and-databases-some-links%2F&amp;linkname=Bioinformatics%20and%20databases%20%26%238211%3B%20some%20links"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/05/bioinformatics-and-databases-some-links/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Happy San Jordi (with delay)</title>
		<link>http://bioinfoblog.it/2009/04/happy-san-jordi-with-a-delay/</link>
		<comments>http://bioinfoblog.it/2009/04/happy-san-jordi-with-a-delay/#comments</comments>
		<pubDate>Sun, 26 Apr 2009 14:20:53 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=734</guid>
		<description><![CDATA[I would like to wish you all an Happy San Jordi, which is one of the most important National Feasts here in Catalunya.
Today (well, it was the 23th of April), boys are used to give a rose to their girlfriends, while girls use to give a book to their boyfriends.

What does this have to do [...]]]></description>
			<content:encoded><![CDATA[<p>I would like to wish you all an Happy<a href="http://en.wikipedia.org/wiki/St_George%27s_Day"> San Jordi</a>, which is one of the most important National Feasts here in Catalunya.</p>
<p>Today (well, it was the 23th of April), boys are used to give a rose to their girlfriends, while girls use to give a book to their boyfriends.</p>
<p><a href="http://www.hudin.com/img/photos/845.jpg"><img class="aligncenter" title="rose san jordi" src="http://www.hudin.com/img/photos/845.jpg" alt="" width="303" height="202" /></a></p>
<p>What does this have to do with bioinformatics?</p>
<p>Absolutely nothing, but somebody of which I won&#8217;t tell the name, has un-explicitely let me know that this blog is becoming too technical and that I should put some distraction every now and then. Happy San Jordi!! <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F04%2Fhappy-san-jordi-with-a-delay%2F&amp;linkname=Happy%20San%20Jordi%20%28with%20delay%29"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/04/happy-san-jordi-with-a-delay/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thoughts on preparing slides</title>
		<link>http://bioinfoblog.it/2009/04/thoughts-on-preparing-slides/</link>
		<comments>http://bioinfoblog.it/2009/04/thoughts-on-preparing-slides/#comments</comments>
		<pubDate>Wed, 22 Apr 2009 23:19:52 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=730</guid>
		<description><![CDATA[In the last post, I forgot to put some thoughts I had when preparing the slides.

it is true that it is better to have a lot of slides, at least one or two per minute. When you show the same slide for too much time, like two or three minutes, you start seeing people in [...]]]></description>
			<content:encoded><![CDATA[<p>In the last post, I forgot to put some thoughts I had when preparing the slides.</p>
<ul>
<li>it is true that <span style="color: #ff0000;">it is better to have a lot of slides</span>, at least one or two per minute. When you show the same slide for too much time, like two or three minutes, you start seeing people in the audience loosing the point.</li>
<li><span style="color: #ff0000;">Putting code examples in a slideshow is a pain</span>. I have been told that you must keep it at least 20 pt of height and use a monospace font, and that an empiric rule to know whether the slide is readable is to watch it on the monitor from at least 2 meters of distance.<br />
There it seems to be no simple way to highlight code in an open office presentation. There is a nice extension to do it in writer, but unfortunately it doesn&#8217;t work with impress.<br />
I wonder whether one of the &#8216;most nerdish&#8217; way to prepare slides would be easier. I have tried with LaTeX once, but onestly, I have had a lot of troubles in writing python code and maintaining the indentation with LaTeX, and I don&#8217;t know whether there is some a simpler way to do it.</li>
<li><span style="color: #ff0000;">Using a different color</span> (like, for example, red) to highlight a word or a phrase<span style="color: #ff0000;"> is a lot better rather than using the bold effect</span>.</li>
<li>Maybe I am not very good at presenting talks (hope I am also not very bad), but<span style="color: #ff0000;"> I am better at preparing slides that can be published on Internet</span> and maybe used by other people as reference. Uff <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </li>
</ul>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F04%2Fthoughts-on-preparing-slides%2F&amp;linkname=Thoughts%20on%20preparing%20slides"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/04/thoughts-on-preparing-slides/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>An Introduction to Python and Bioinformatics</title>
		<link>http://bioinfoblog.it/2009/04/an-introduction-to-python-and-bioinformatics/</link>
		<comments>http://bioinfoblog.it/2009/04/an-introduction-to-python-and-bioinformatics/#comments</comments>
		<pubDate>Tue, 21 Apr 2009 09:59:53 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=718</guid>
		<description><![CDATA[Here in the prbb, we organize a series of talks called &#8216;PRBB technical seminars&#8216;.
This time, it was my turn to present something, so I have prepared an introduction to python for bioinformatics in general.
Have a look at it here:


I have talked about a bit of everything, insisting on three things:

python has a very good syntax [...]]]></description>
			<content:encoded><![CDATA[<p>Here in the prbb, we organize a series of talks called &#8216;<a href="http://bg.imim.es/technical-seminars/">PRBB technical seminars</a>&#8216;.</p>
<p style="text-align: left;">This time, it was my turn to present something, so I have prepared an <a href="http://www.slideshare.net/giovanni/introduction-to-python-for-bioinformatics">introduction to python for bioinformatics</a> in general.<br />
Have a look at it here:</p>
<p style="text-align: center;">
<p style="text-align: center;"><object width="425" height="355" data="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=introduction-090421044444-phpapp02&amp;rel=0&amp;stripped_title=introduction-to-python-for-bioinformatics" type="application/x-shockwave-flash"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=introduction-090421044444-phpapp02&amp;rel=0&amp;stripped_title=introduction-to-python-for-bioinformatics" /><param name="allowfullscreen" value="true" /></object></p>
<p>I have talked about a bit of everything, insisting on three things:</p>
<ul>
<li>python has a very good syntax and it is easier to learn than perl</li>
<li>it has good tools to do testing (which are very important for bioinformatics)</li>
<li>there are many good libraries for bioinformatics, even if maybe perl and R are better supported (unfortunately)</li>
</ul>
<p>Well, I prepared to talk during the Easter Holydays, so I have decided to do it in a quick and easy way: and I had the genial idea of putting a lot of happy/sad faces <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> , even if in the end, I decided to not put too many sad faces, because I thought it would have been unpopular for a talk.</p>
<p>If you can give me some feedback, I will really appreciate it <img src='http://bioinfoblog.it/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F04%2Fan-introduction-to-python-and-bioinformatics%2F&amp;linkname=An%20Introduction%20to%20Python%20and%20Bioinformatics"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/04/an-introduction-to-python-and-bioinformatics/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Going to PyCon3.it!!</title>
		<link>http://bioinfoblog.it/2009/04/going-to-pycon3it/</link>
		<comments>http://bioinfoblog.it/2009/04/going-to-pycon3it/#comments</comments>
		<pubDate>Thu, 02 Apr 2009 14:53:17 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=702</guid>
		<description><![CDATA[PyCon.it is the yearly conference on python in Italy, usually yeld in the beatiful city of Firenze (Florence). This year, the keynote talk will be given by Guido Van Rossum himself.
Earlier I proposed a talk on python and bioinformatics for the pycon3, and I have just been told that it has been accepted.
So, now I [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.pycon.it/">PyCon.it</a> is the yearly conference on <a class="zem_slink" title="Python (programming language)" rel="homepage" href="http://www.python.org/">python</a> in Italy, usually yeld in the beatiful city of Firenze (<a class="zem_slink" title="Florence" rel="geolocation" href="http://maps.google.com/maps?ll=43.7716666667,11.2536111111&amp;spn=0.1,0.1&amp;q=43.7716666667,11.2536111111%20%28Florence%29&amp;t=h">Florence</a>). This year, the keynote talk will be given by Guido Van Rossum himself.</p>
<div class="wp-caption aligncenter" style="width: 349px"><a href="http://www.pycon.it/"><img title="pycon3" src="http://www.pycon.it/static/p3/i/bg-guido.jpg" alt="Guido Van Rossum will be at Pycon.it this year" width="339" height="186" /></a><p class="wp-caption-text">Guido Van Rossum will be at Pycon.it this year</p></div>
<p>Earlier I proposed a talk on <a href="http://www.pycon.it/conference/talks/python-and-bioinformatics">python and bioinformatics</a> for the pycon3, and I have just been told that it has been accepted.</p>
<p>So, now I have to prepare a seminar on &#8216;<a href="http://www.pycon.it/conference/talks/python-and-bioinformatics">Python and Bioinformatics</a>&#8216; before May. Do you have any suggestions?</p>
<p>This is the <a href="http://www.pycon.it/pycon3/programma/">schedule</a> of the workshop. All the talk will be live-translated in English.</p>
<p>See you there!!</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/56a222b1-a4af-4577-94f5-7d70873802d0/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=56a222b1-a4af-4577-94f5-7d70873802d0" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F04%2Fgoing-to-pycon3it%2F&amp;linkname=Going%20to%20PyCon3.it%21%21"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/04/going-to-pycon3it/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>molecularlab sold to a commercial company!</title>
		<link>http://bioinfoblog.it/2009/04/molecularlab-sold-to-a-commercial-company/</link>
		<comments>http://bioinfoblog.it/2009/04/molecularlab-sold-to-a-commercial-company/#comments</comments>
		<pubDate>Tue, 31 Mar 2009 23:25:13 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Bioinformatica]]></category>

		<guid isPermaLink="false">http://bioinfoblog.it/?p=672</guid>
		<description><![CDATA[I am really horryfied by a mail that I have just received from Riccardo Fallini, Molecularlab.it&#8217;s founder and former owner.

Molecularlab is a open web italian scientific community, and this is a pretty reductive description to say that it is the community where I have learned everything I know about e-science and the importance of communication [...]]]></description>
			<content:encoded><![CDATA[<p>I am really <strong>horryfied</strong> by a mail that I have just received from Riccardo Fallini, Molecularlab.it&#8217;s founder and former owner.</p>
<p style="text-align: center;"><a href="http://loopsflair.tk/"><img class="aligncenter" title="loops flair logo" src="http://www.molecularlab.it/images/header/loops-flair_bought_animated.gif" alt="" width="468" height="60" /></a></p>
<p><a href="http://www.molecularlab.it/forum">Molecularlab</a> is a open web italian scientific community, and this is a pretty reductive description to say that it is the community where I have learned everything I know about e-science and the importance of communication between scientists throught Internet.</p>
<p>I have been a member of the <a href="http://www.molecularlab.it/forum">Molecularlab</a> community since at least 2004, when I was still a 2nd year bachelor student. Over the years, I became one of the most active users, and eventually a member of the Staff and moderator of the forum.</p>
<p>It has been so nice, a great school where I have learned most of what I know about bioinformatics, and also molecular biology and other fields. Imagine being in a place where you answer two random questions about bioinformatics every day (how do I do e-pcr? which database for promoters of Macaca mulatta? how does blast works? what is an HMM? and many more): this is what I have been doing for years in molecularlab.</p>
<p>So today, April the first 2009, <a href="http://www.molecularlab.it/about/riccardo-fallini.asp">Riccardo</a> must have gone crazy or something like this.</p>
<p>He sold the whole web site to a most unknown company called &#8216;<a href="http://loopsflair.tk/">Loops Flair</a>&#8216;: I have never heard of them before, but it looks like they are some kind of weird evangelists doing research on <em>Danio. rerio</em>.</p>
<p>Look for example at the innovative technologies they offer: there is this <a href="http://lnx.molecularlab.it/loops-flair/index.php?option=com_content&amp;view=category&amp;layout=blog&amp;id=37&amp;Itemid=57">DRMPEK technology</a> to produce fishes with multiple colors at a very low cost:</p>
<p style="text-align: center;"><a href="http://lnx.molecularlab.it/loops-flair/index.php?option=com_content&amp;view=category&amp;layout=blog&amp;id=37&amp;Itemid=57"><img class="aligncenter" title="GloGish loops flair" src="http://upload.wikimedia.org/wikipedia/commons/f/f2/GloFish.jpg" alt="" width="273" height="181" /></a></p>
<p style="text-align: left;">and there is this <a href="http://lnx.molecularlab.it/loops-flair/index.php?option=com_content&amp;view=article&amp;id=57:protein-unfolder&amp;catid=38:in-silico&amp;Itemid=67">Protein Unfolder</a>:</p>
<p style="text-align: center;"><a href="http://lnx.molecularlab.it/loops-flair/index.php?option=com_content&amp;view=article&amp;id=57:protein-unfolder&amp;catid=38:in-silico&amp;Itemid=67"><img class="aligncenter" title="protein unfolder" src="http://python.rice.edu/~kolomeisky/protein-DNA.jpg" alt="" width="194" height="148" /></a></p>
<p style="text-align: left;">To automatically get a vector with a DNA plasmide from the structure of one unsequenced protein.</p>
<p style="text-align: left;">But what is worst, is the collection of gadgets they offer for free to any new user:</p>
<p style="text-align: left;"><a href="http://loopsflair.tk/"><img class="aligncenter" title="usb fish" src="http://www.sz-wholesale.com/uploadFiles/upimg8%5CFish-Shaped-USB-Disk_164548.jpg" alt="" width="294" height="198" /></a></p>
<p style="text-align: left;">I can&#8217;t believe <a href="http://www.molecularlab.it/about/riccardo-fallini.asp">Riccardo</a> sold all the whole <a href="http://www.molecularlab.it/forum">Molecularlab</a> for just a couple of fish-like USB pens.</p>
<p style="text-align: left;">Now the whole Italian scientific community remains without a good support online, and I would like to know what Riccardo and <a href="http://www.molecularlab.it/forum/profilo.asp?id=26">Nicola</a> will explain to the poor users and students.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fbioinfoblog.it%2F2009%2F04%2Fmolecularlab-sold-to-a-commercial-company%2F&amp;linkname=molecularlab%20sold%20to%20a%20commercial%20company%21"><img src="http://bioinfoblog.it/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://bioinfoblog.it/2009/04/molecularlab-sold-to-a-commercial-company/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
