Write a short Perl script which prints out strings containing common words associated with statistics.
Follow these steps to get your first Perl script up and running:
You will notice the use of \n at the end of the print command lines. This symbol stands for newline. Try deleting these from the ends of this test program and see what happens.
Bioinformatics and Biostatistics
You've probably heard terms such as average, mean, standard deviation, median, range, probability, odds, interval, ratio, sensitivity, correlation, and prevalence. These are all terms which are part of the vocabulary of statistics. In this unit we will introduce you to statistics and show you a little about how statistics can be applied to biological data (and also how Perl can help you with this endeavor).
You've probably heard of the field of bioinformatics. Bioinformatics is all about making sense of biological data. Probably the most important tool used in bioinformatics is the computer. The two computer languages used most in field of bioinformatics are Perl and Java (with C/C++ and Python also being very important). In this unit we will limit our discussion to biostatistics. Biostatistics is all about making sense of the statistics generated through the analysis of biological data. Biostatistics is a sub-discipline of bioinformatics. Computers and computer languages are also very important for performing biostatical analysis.
One thing is certain about biological data: There is a lot of it. In fact, there is a HUGE amount of data being generated on a daily basis. The shear fact that one human genome contains 3.1 billion basepairs (something we will discuss at great length in unit two) and that there are billions of humans (all with slightly different genomes) should help you to get some persepctive on the enormous amount of information which could be gathered just on the contents of the human genome. Add to that the enormous number of species of living things (which all have genomes of their own) which could be studied and you get even more data which could potentially be analyzed.
Biology isn't just about the study of genomes, however. Biologists conduct experiments dealing with the effects of potential pharmaceuticals on living organisms, how proteins are constructed, the structure of various microorganisms, biological pathways in the body, the function of various systems within the body, and many other interesting topics. All of these studies yield information: usually lots of information. So, again we see the importance of bioinformatics and it's sub-field, biostatistics.
ASSIGNMENT:
You will write a simple Perl script which produces four lines of output. Each line will contain at least four common words associated with the field of statistics.