Thesis deposited!! Here is my preface

I have just deposited my PhD thesis! If everything goes well, I will defend it within a few months. It took me a while, but I am there at last!

I can not post the thesis online yet, but in the meanwhile, I would like to at least post the preface I wrote for it.

My thesis is dedicated to detection and characterization of signatures of selection in the human genome. Thus, the preface is about the ethical problems faced in human population genetics, and narrates a little story about a mistake made by an earlier anthropologist. I hope that you will enjoy it. As for me, I am going to make a short break to celebrate.

 

Preface

Toward the end of the 18th century, the German anthropologist Johan Friedrich Blumenbach wrote a book on the origins of mankind, with the aim of demonstrating that all humans belong to the same species. The society of the 18th century was much more segregated than our modern society, and some people, including renowned scientists, believed that blacks and American Indians did not belong to the same species as the white man. Blumenbach, who was a strong opponent of racist theories, decided to write a book to demonstrate that all human have a common origin, and that there are no scientific basis for any discrimination.

Eventually, Blumenbach succeeded in his noble intentions, but a little design mistake in his book led to a misinterpretation that he would not have desired. In his book, Blumenbach listed the human populations in the following order: American, Mongolian, Caucasian, Malaysian and African. Since he believed that the human species originated in the Caucasian region, he explicitly put the Caucasian population in the middle, as a way to remind his readers that all human beings have a common origin. Unfortunately, this tiny detail was interpreted as a prove that the Caucasian was the purest of all human races. People believed that if even him, the most egalitarian scientist of the time, positioned white people at the center of the Geometry of races, it was because these had a special importance.

This error is representative of how delicate is to work in the field of Human Population Genetics. If Blumenbach had decided to list the populations in a different order, for example, by placing Caucasians in the second position, events like the Jim Crow’s laws in the United States and even the Nuremberg laws in Germany would not have had the same scientific justification they had. A whole life spent to demonstrate that all people are equal has gone forgotten because of a bad decision in listing the names of some populations. Blumenbach was a strong champion of equality, but his mistake affected the life of innocent people.

This thesis is dedicated to Johan Friedrich Blumenbach, with the hope that learning from his mistake will protect me from making similar errors. The work presented here describes new methods to analyze human population genetics data, and specifically, to detect genes and alleles that have given a selective advantage to a human population. Nevertheless, these “selective advantages” are only relative to events to which our ancestors have been exposed in the past. The only reason why we study them is to understand how our genome works, with the aim of designing better medicines and improve our health conditions.

The field of Human Population Genetics is in a delicate position at this moment. We live in times of cheap genome sequencing, and we can expect that, in the close future, genome sequencing will become a component of our daily lives. Moreover, the appearance of new communication media has made science more accessible to everybody – with good and bad implications. This means that the research that is being written right now by population geneticists will soon be read by not only by scientists, but also by people moved by other interests. It is difficult to predict how our work will be interpreted, as it was difficult, in the 18th century, to predict how a mistake in listing populations would have had such negative impact.

I hope that those who will read this thesis will do it with a positive mind. I have tried with all my efforts to avoid any concept that may be misinterpreted, but my lack of experience may have not allowed me to find all the potential flaws. I hope that the people who will read this thesis will be savvy when they encounter mistakes, and that they will be stimulated to learn more about this subject. Eventually, they will discover that despite the errors that scientists can make, Blumenbach was right in his intentions: all humans beings belong to the same species, and there are no scientific basis for any form of discrimination.

 

(This preface is inspired by the chapter “The Geometer of Race” in the book “I have Landed” by S.J.Gould, 2003, and by the book “Fatal Invention” by Dorothy Roberts, 2011)

Two short “Agile Bioinformatics” talks

I have just come back from the Programming for Evolutionary Biology course in Leipzig, version 2013!! The course is still going on, but unfortunately this year I could not stay the whole duration three weeks, as I have stuff to do here in Barcelona.

This year, apart from the “Introduction to Linux” module, I also taught a short module on “Best Practices for programming in bioinformatics”. It was pure fun, I think I never enjoyed so much giving a talk. I explained a part about Version Control, and another about Scrum, and people were really excited about it. To make you understand how much people liked this talk, consider that three persons invited me a beer after that, which for me constitutes the maximum compliment for a talk.

I have uploaded the two slideshow on slideshare. Unfortunately, the best part of the talk was a live demonstration on how I use these practices during my daily work, but at the moment I can not make these example publicly available. However, you should be able to follow the slideshows anyway.

 

Notes from a “Write it clearly” course

I recently took a course on improving English Writing skills for researchers. These are my notes, organized as a series of “Do and Do not” lists, plus some separate list for each section of a research paper.

Feel free to have a look at them and make use of them. If you have any comments, you can add them here or to table. Have an happy paper writing day!

click to access the notes.

I wrote a videogame for the Wii

I wrote a small web game for the “Week of Science 2012” (Semana de la Ciencia), a science divulgation initiative organized in Spain. I participated to it as a member in the Institut of Biologia Evolutiva of Barcelona, the institution to which I belong to. The game is in Spanish, but I think anybody can understand it without translation. Click on the image to play with it:

The “Phylogenetic Tree” for the “Semana de la Ciencia” in Barcelona.

If the game is not shown correctly, click on this link: IBE phylogenetic game sc2012

In short, we had 15 minutes to explain to a class of college students (from 12 to 18 years old) how to make phylogenetic trees. This is how we organized the time:

  • In the first five minutes, we had a short presentation explaining that we all come from a common ancestor, and that our work of evolutionary biologist is to reconstruct the tree of life. We also explained what a phylogenetic tree is, and how we reconstruct it.
  • In the next three minutes, we played the first game. This game was quite easy, and was meant to check if the student understood how phylogenetic trees are constructed. During this first game, one volunteer student had to decide where to put a mammal, a bird and a jellyfish in a phylogenetic tree.
  • In the next minutes, we played the second game, which was a bit of a trick. Students had to reconstruct the phylogenetic tree of four protists. Have a look at the “Juego 2: protists” to see it. This game was tricky because there it is no way to come with the correct solution. In fact, after letting the students play for a while, we showed them that the only way to know the real phylogenetic tree was to use the DNA sequences. Then we had a few more slides explaining how mutations in DNA sequences can be used to reconstruct the history of changes in evolution.

To make things a bit more entertaining, we also connected a Wii remote to the computer, so the student who played the game had to use it as a mouse. This was fun to set up, and I think I will use a Wii remote in my next talk :-).

The activity was a bit condensed in 15 minutes, but I think that more or less all the students understood the basic concept. At least, some made questions, and in general, they seemed to like the game. I hope they will at least remember that DNA can be used to study how species have evolved :-).

If you want to customize the page, the code is available on bitbucket:

This was the first time I programmed something in Javascript, so the code is a terrible mess. There is a lot of code duplication, and a lot of patchy fixes. But as Agile Programmers say, “Code first and Refactore later”. I think I will work on cleaning this code for next year, so if you have any suggestions on how to make it better, please join the repository on bitbucket.

Gamestorming for bioinformatics

Most meetings in academic research groups are awful. I have attended meetings that lasted hours and hours, and didn’t produce any useful output. I know researchers who try to avoid meetings as much as they can, and prefer to work by themselves, because of too many bad experiences. In the end, the problem is that scientists are not very good communicators, and most PIs are not trained for being group leaders, so meetings end up being very boring and time wasting, more harmful than useful.

Fortunately  this year, thanks to a meetup group here in Barcelona, I discovered that there are many ways to improve meetings and make them more interesting. The most interesting is the concept of “Gamestorming”, which is based on transforming group meetings into “games”. If instead of inviting people to attend a meeting you ask them to participate to a short game, people are more likely to participate actively and make good contributions.

Most gamestorming techniques involve blackboards and post-its, and ask people to use them to explain their own opinion.  A simple example of a gamestorming meeting would be a planning meeting where the group leader splits a blackboard into three sections, one for listing different “Project Proposals”, and the other two for “Pros” and “Cons” of each project proposal, and asks the participant to fill the blackboard using post-its. If you want to have a good overview of techniques for brainstorming in general, I can recommend you the book “Gamestorming“, by Gray, Brown, Macanufo, from O’Reilly, which I am reading these days.

In any case, I have been thinking about which planning “games” can be adopted in bioinformatics, or by researchers in general. Here is a list of a what I introduced or planned to introduce to my group:

Continue reading

N-Glycosylation – one pathway, two distinct selective constraints

Our group just published a new paper in BMC Systems Biology. The title is Distribution of events of positive selection and population differentiation in a metabolic pathway: the case of asparagine N-glycosylation. It is already on the journal’s web page.

The pathway of N-Glycosylation can be ideally splitted into two separate parts, one upstream and one downstream of a process known as Calnexin/Calreticulin Cycle, in which an intermediate product of the pathway is involved. In theory, given their function, we can hypothesize that the two parts of the pathway are exposed to different selective constraint, and evolve at different paces among human populations.

The biology and function of the two parts of the pathway are explained in details in the article, but I will try to summarize them here. The upstream part of this pathway is required for this Calnexin/Calreticulin Cycle, a mechanism of folding quality control, so we can expect that all of his genes are conserved among populations. On the other hand, the downstream part of the pathway is involved in host-pathogen interactions, and can be expected to be more variable when comparing populations that adapted to different environments. In the article we have shown that in fact, signatures of population differentiation are more abundant in the downstream part of the pathway.

Unfortunately I don’t have much time to prepare a good presentation to illustrate the paper, but I have uploaded a short resume to slideshare. Have a look at it if you are interested:

You can also check this previous post, where I explained briefly that the main theme of work done in our lab is to study how selective constraints are distributed along the genes of a pathway.

Introduction to Unix systems for Evolutionary Biologists – slides online

Here are the slides of the “Introduction to Unix-like systems” lecture I gave last Saturday at the “Programming for Evolutionary Biology” workshop in Leipzig.

In these slides, I did my best to communicate to the students what is philosophy behind the Unix systems and why they have been so important in the past. The Unix philosophy is in reality an approach to data analysis and programming: I am happy if I have been able to convince the students that, by studying how the first programmers have approached the problem of data analysis, they will be able to learn good programming practices, and avoid mistakes that have already been surpassed many years ago.

I would like to thank my colleague Brandon Invergo and my supervisor Hafid Laayouni for suggestions on how to improve the slides. Enjoy!

Continue reading

Planning a 8-hours “Introduction to Linux” course with trello

Next week I am going to give a 8 hours “Introduction to Linux” course at the “Programming for Evolutionary Biology” workshop in Leipzig. In this post, I will describe how I have used a nice planning software called “trello” to make the schedule of the course.

You must know that I am a big fan of using small card papers to organize things. I started using CRC cards from the ExtremeProgramming techniques, and now the way I organize my time is similar to the KanBan technique, although I kind of evolved it independently. In simpler words, I have the habit of cutting A4 papers into 8 smaller A6 papers, the size of a post-it, and use them to take note and to plan my projects. If you visit my office, it is full of collections of “A6” papers everywhere 🙂

One day I may prepare a blog post about how I organize my projects with A6 papers. For now, just consider that trello basically allows me to do on a web page what I usually do on paper. Also, trello allows to share workflows with other people on Internet.. For example, I can show you the schedule of the Linux course that I have made:

my trello board for the "Introduction to Linux" course. Click to see it!

So, I used trello to make 5 distinct sets of cards, one for each of the 5 parts that compose the course. In each of this list, I filled some cards to describe the most important topics that I wanted to talk about in that part of the course. I have used some a red color label to highlight which is the most important message to transmit in each of the parts of the course, the “Take-Home” message.

Continue reading