Saturday, July 23, 2011

Computational BioHanziology [1]

The basic elements in computational biology are strings or sequences describing DNA or proteins. A number of questions thus require the analysis of sets of strings and the extraction of (grammatical) rules and patterns.


所以如果我們把漢字轉換成生物序列,生物資訊與漢字資訊就可以變成交流學門。這也可以是 BioNLP 的一個新領域。


We are dealing with strings, trees and graphs. When dealing with strings, we typically manipulate a 4-letter alphabet {A,T,G,C} or a 20-letter alphabet if we intend to assemble the nucleotides 3 * 3 into amino-acids in order to define proteins. There are trees involved in the patterns or in the secondary structure, and even graphs in the tertiary structure. 




No comments:

Post a Comment