tags:
- "#r-programming"
- "#biotech"
Evaluating Clustering algorithms using xgenes
* Chromosome == Set of Genes
START
Generate the initial population
Compute fitness
REPEAT
Selection // Select fittest individuals
Crossover // Threshold for mixing of genes. e.g. 3 means, it stops after 3 genes mix between parents.
// The remaining unmixed genes are all could be from single Male or Female.
Mutation // Happens with low probability
Compute fitness
UNTIL population has converged // New population not much different from old.
STOP
* Before emergence of bioinformatics, there were only two ways to conduct biological experiments :
- Within a living organism (in vivo, meaning in living in Latin)
- In an artificial environment (in vitro, meaning in glass in Latin)
The field of bioinformatics is considered as in silico (meaning in silicon in Latin)
First of all you will have to learn a bit about biology; genetics and genomics to be specific.
This will include studying about genes, DNA, RNA, protein structures, etc.
Then you will have to study about biological sequences (for example, sequences found in DNA, RNA and proteins) and techniques to discover and analyze various patterns in them.
For creating drugs, we can understand the disease using computational tools, identify the disease cause and treat with suitable drugs accordingly, rather than merely treating the symptoms. ???
Algorithms, genomics, proteomics
Current research in bioinformatics can be classified into:
(iii) Computer Aided Drug Designing : computational methods to simulate drug-receptor interactions. CADD methods are heavily dependent on bioinformatics tools, applications and databases. (See: http://www.mpi-inf.mpg.de/departments/d3/areas/docking.html )
(iv) Biological database: collected from experiments, literature, and computational analyses. Information contained in biological databases includes gene function, structure, localization (both cellular and chromosomal), clinical effects of mutations as well as similarities of biological sequences and structures.
(See: http://www.mrc-lmb.cam.ac.uk/genomes/madanm/pres/biodb.htm )
(v) Biological Data Mining: Biological Data mining is the discovery of useful knowledge from biological databases. Some of the most popular tasks are classification, clustering, association and sequence analysis, and regression. (See: http://cs.salemstate.edu/hatfield/teaching/courses/DataMining/M.htm )
(vi) Microarray informatics: Microarray Technology is a powerful tool to monitor gene expression or gene expression changes of hundreds or thousands of genes in a single experiment.
The services provided by https://med.nyu.edu/chibi/services/microarray-informatics :
Microarray-centered informatics is currently applied to primarily two high-capacity profiling areas:
We provide assistance with these following modules of typical experimental workflows: