R
Ross
Dear All,
For a file with many records like follows, I would like to remove each
record, if any, containing "XXXXX and the ATCCAAT... follows". If the
sequence is a single line, i can just simply use
if (line =~ ^>.*)
if (line =~ (.*)X(.*) )
newline = $1;
anybody has good idea to solve the problem? thanks in advance
GTCACATATGATGATATCTGAGCTTATTTTTAACTTCCGAACCACTATAC
TGTTAAAACTCATTACAAGACACCGCCAAGGGTGGTAATGGTACTGGGTG
CACCATAGTACCTAGGGTAGATACCATATCTAGATGGCACGTTAAAAGCC
AATAGAGCTTGAGCTTGAGCCAGATTCCGATCAAAGTAGAGATCACCAAA
CTGCTGGAGTTGTAGCTGCTGCGCTATGGCCTGAACAATGTTAATGTCCT
GATAGTGAGATTGTTGCGCCACCAGCGCGAGATGTTGCCAGACTTGGTTG
TTTCTCAGTTGAAACGCAGCTGATTGCAAGAAGGGGCTTGCCGCTATGCC
ATACTGCTGCCTTACGAACTCATTATATGGGCTAAGCACCTGTTGCTGTA
GCAGGACAGGCGACTGCAGCTGATATTGCCTATAACTTTGACCTAAAACA
TCAAACTGCGCAGAGGCGCTGCATGCAGCAATAGCAAGGAGAGCAAAGAC
GAAAATGATCTTCATTGCTGCGGGACACTANATCTTTCTATTTTTCTGTA
TAATGCTTGAACTGTGTGAACGATCXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCTCTTCAAT
CTCGGGAANNNNNTGTNGGGGTGTTGGGAAATCCCCCCCTTGTTGGGGTT
TTTCTTGGTTAAACACAAGTGTCCCTTCTCTTTAAAAAAAACCCCTTTTC
CTGTTGGGGGGGTNNTTTTTTTTTTTTTCTTTTTTTTTTTTTTNTTTTTT
TCCTTTTTTTTTTTTTTTTTTTTTTCTTTTTGTTCCTTTCTTGTTTCTGT
TTCTCTTTTTTTTTTTTTTTTTTTTTTTGTTTTCTTTTTTTTTTTTTCTG
For a file with many records like follows, I would like to remove each
record, if any, containing "XXXXX and the ATCCAAT... follows". If the
sequence is a single line, i can just simply use
if (line =~ ^>.*)
if (line =~ (.*)X(.*) )
newline = $1;
anybody has good idea to solve the problem? thanks in advance
GTTGCTATGAAAGCACTTTATTTCTATTTATATCACCCAAAGTTTCACAT9P01P10A.y putative DD1A protein [Oryza sativa (japonica
cultivar-group)] HSP:949 GAACAATTAGTATAAACTTTAGTTGAATCTCGTTACTATATTAGCTTCGG
AGCTCAATTACAAACAGCTAGCAAAAAATGCCAGGTCCCCCATAAAAGAA
ACCATCATGTTCATAATCAGACACACGGTAGCAATTTGATATATATCCGA
GAGCAGAATTGATTTGATGGGTGTTGCCGCCTGCATCAAAAAACTTGACG
CCACTAAATGATGCAGCGTTTTTGATTGGAGCATTCCCACTGCCCATCGG
AGGACTTGGTTTATGTCCCTTTTTCAAGGCATAGCCACCAAACATTATTG
TCACTGGTTTATTTGACAAGCTTGTAAACAGAGATCTTGGATAGTAACCT
GTAAGTCTGGCTTCACCATTAAACCCATAGTAGACTTGCCAATCACCAGA
AATTTGATCCTTGGATACTCTGACTGTAGTGTATCGTTTGTCGCTAGAGG
TGGTGGAAACAGGGTTAATCACCATTCCTGGAACGATTTCTGAGCTAAAT
ACACTTTCGAATCCAGGACAACGCATATCAGGACAGGCATTAGATCCTTG
AGTAAACCAAGTACTGAAGTGCGTCTGTGAATCATTGTATGATTCAGGCT
CAATATTCCATCCAGCTATAACATTATTTATGGCGGATGCTTCATCCTTA
TTATAAATCGAAATGAAACCTCCTGTTTGTTGTCCATGCTCTAGATTAAA
GCATAAACATCCATGGTGGCCTCTACTCCATAATACGTTATAGCATTATC
TGAAGGACCCCATCCATATACTGCAAGATACAACGTGCCAGCTTGATTTG
ATTCATGACCCGACGAAGATAAATTCACATCAAGTATAAGGGGCATCAAT
GTTTGCCATTTCTTTTGGACCTCTTCTTCCATAGGAGGGAAGCAACTCCT
ACTGTTTGACTACCAAGGAACACACACAGAGCAGTGCAGATTGATTAAAA
ATTTCTCCATATTATATTTGGGGATGGAGAGGGTATATGTTTGAGTTCCC
CGGCGTTAGGCCGATTTCCGGGTACACAAAATGCGGGCTTCCGAGAAAAA
AAATTCCCCCAACCTTGGATTTGTTTTTTTTTTTCTCTTCTTCTTCTACT
CTATTTTTATTTCTTGTGTTTGTTTCTGTACTTTTCTTGTTGTTTTTTGT
GTGTTCTTTTTGTTGTGTTTGTTTTTTTTCTTTTCTTTTTGTTTTTATGT
ATCTATCCTTTCTTATTGTTTGTATTTTTTTTTTGTTATTTTTGTATGTT
TTCTTTGTTGTGTTATTTTTTTGGTTTTCTTTTTTTGTTTTTATCACTTT
CTCTTTGTATTGAGTGCTTTTCTTGTTTTTATTTTGTTGATTCTTTTGTC
TTGTCTCTGTCTTTTTTTTCCGTATATGCTTTGTTTGTTTCTTATCCTTT
GCTTG
9P01P10B.y prolamin precursor (clone pX24) - rice emb|CAA37850.1|
prolamin [Oryza sativa (japonica cultivar-group)] HSP:418
GTCACATATGATGATATCTGAGCTTATTTTTAACTTCCGAACCACTATAC
TGTTAAAACTCATTACAAGACACCGCCAAGGGTGGTAATGGTACTGGGTG
CACCATAGTACCTAGGGTAGATACCATATCTAGATGGCACGTTAAAAGCC
AATAGAGCTTGAGCTTGAGCCAGATTCCGATCAAAGTAGAGATCACCAAA
CTGCTGGAGTTGTAGCTGCTGCGCTATGGCCTGAACAATGTTAATGTCCT
GATAGTGAGATTGTTGCGCCACCAGCGCGAGATGTTGCCAGACTTGGTTG
TTTCTCAGTTGAAACGCAGCTGATTGCAAGAAGGGGCTTGCCGCTATGCC
ATACTGCTGCCTTACGAACTCATTATATGGGCTAAGCACCTGTTGCTGTA
GCAGGACAGGCGACTGCAGCTGATATTGCCTATAACTTTGACCTAAAACA
TCAAACTGCGCAGAGGCGCTGCATGCAGCAATAGCAAGGAGAGCAAAGAC
GAAAATGATCTTCATTGCTGCGGGACACTANATCTTTCTATTTTTCTGTA
TAATGCTTGAACTGTGTGAACGATCXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCTCTTCAAT
CTCGGGAANNNNNTGTNGGGGTGTTGGGAAATCCCCCCCTTGTTGGGGTT
TTTCTTGGTTAAACACAAGTGTCCCTTCTCTTTAAAAAAAACCCCTTTTC
CTGTTGGGGGGGTNNTTTTTTTTTTTTTCTTTTTTTTTTTTTTNTTTTTT
TCCTTTTTTTTTTTTTTTTTTTTTTCTTTTTGTTCCTTTCTTGTTTCTGT
TTCTCTTTTTTTTTTTTTTTTTTTTTTTGTTTTCTTTTTTTTTTTTTCTG