M
michaelzhao
Hey,
I am making a program to tally up the nucleic acid bases of E. Coli.
This data will be used in bioinformatics research to generate a Markov
Matrix.
However, I am just beginning PERL and ran into a slight problem.
Basically, I need the frequencies of the 4 bases, Adenosine (A),
Thymine (T), Cytosine (C), and Guanine (G). I made a PERL script to
tally up the total counts of the bases. In my case, there are 6254
Adenosine, 4957 Thymine, 4245 Cytosine, and 3534 Guanine.
In order to find the frequencies, all I would do is divide each base
count by the total.
However, here is my problem. Instead of doing it globally. I need to
be able to specify an arbitrary start and stop position to start
tallying the occurences of the bases and also to find the frequencies
of the bases in that particularly defined area.
Herein lies my problem. I have not a clue how to go about doing this
task. I've been looking online for a solution but haven't really found
one. If anyone can suggest an idea or function(s) I could use to go
about doing this task. I would be much obliged. Thanks!
~Michael
I am making a program to tally up the nucleic acid bases of E. Coli.
This data will be used in bioinformatics research to generate a Markov
Matrix.
However, I am just beginning PERL and ran into a slight problem.
Basically, I need the frequencies of the 4 bases, Adenosine (A),
Thymine (T), Cytosine (C), and Guanine (G). I made a PERL script to
tally up the total counts of the bases. In my case, there are 6254
Adenosine, 4957 Thymine, 4245 Cytosine, and 3534 Guanine.
In order to find the frequencies, all I would do is divide each base
count by the total.
However, here is my problem. Instead of doing it globally. I need to
be able to specify an arbitrary start and stop position to start
tallying the occurences of the bases and also to find the frequencies
of the bases in that particularly defined area.
Herein lies my problem. I have not a clue how to go about doing this
task. I've been looking online for a solution but haven't really found
one. If anyone can suggest an idea or function(s) I could use to go
about doing this task. I would be much obliged. Thanks!
~Michael