data mining

vmishra85 · Jun 21, 2006

i have to find the common simillar subsequence among the large
no.s(>21,000) of input sequences, and not neccesarily present in all
the input sequences.subsequence need not be the exact one.it can vary
upto 20%.

can anybody help to design the algorithm for the above problem.
Take it as a challenge.

Vladimir Oka · Jun 21, 2006

i have to find the common simillar subsequence among the large
no.s(>21,000) of input sequences, and not neccesarily present in all
the input sequences.subsequence need not be the exact one.it can vary
upto 20%.

can anybody help to design the algorithm for the above problem.
Take it as a challenge.

This is a question more suited to comp.programming; here you're only
likely to get advice once you try to implement it in C, and experience
C language problems.

PS
I feel something biotechy about it (looking for a gene in different DNA
strands). Maybe there lies an alternative path to the answer. Are there
any Usenet groups dedicated to it?

pemo · Jun 21, 2006

Vladimir said:
This is a question more suited to comp.programming; here you're only
likely to get advice once you try to implement it in C, and experience
C language problems.

PS
I feel something biotechy about it (looking for a gene in different
DNA strands). Maybe there lies an alternative path to the answer. Are
there any Usenet groups dedicated to it?

My misses is a Population Geneticist, and yes, it sounds like the sort of
thing she does with DNA sequences in C++ ... when casually asked about this,
she replied 'needleman-wunsch' - I didn't seek clarification - it usually
does my head in if I do.

osmium · Jun 21, 2006

i have to find the common simillar subsequence among the large
no.s(>21,000) of input sequences, and not neccesarily present in all
the input sequences.subsequence need not be the exact one.it can vary
upto 20%.

can anybody help to design the algorithm for the above problem.
Take it as a challenge.

There is a newsgroup devoted to the specialty of genetic algorithms, it is
comp.ai.genetic. But I think your chances of getting any detailed help,
even there, are well under 1%.

Juuso Hukkanen · Jun 23, 2006

i have to find the common simillar subsequence among the large
no.s(>21,000) of input sequences, and not neccesarily present in all
the input sequences.subsequence need not be the exact one.it can vary
upto 20%.

Many C data-mining code examples are shown in
http://www.cosc.canterbury.ac.nz/tad.takaoka/manuscript3.pdf

Juuso Hukkanen
(to reply by e-mail set addresses month and year to correct)

Your Uncle · Jun 29, 2006

Juuso Hukkanen said:
Many C data-mining code examples are shown in
http://www.cosc.canterbury.ac.nz/tad.takaoka/manuscript3.pdf

I'd love to hear anything you have to say about this topic. I believe it
has forensic dimensions. bfx

Call for Papers Reminder (extended): The 2013 InternationalConference of Data Mining and Knowledge E	0	Mar 10, 2013
Data saving in condition of changing reality	0	Apr 29, 2022
Web Usage Mining	0	Jan 25, 2006
Call for Papers Reminder (extended): IAENG International Conferenceon Data Mining and Applications (	0	Dec 17, 2008
Last Call for Papers (extended): The2007 International Conference of Data Mining and Knowledge Engin	0	Mar 10, 2007
SENTINEL CONTROL LOOP WHEN DEALING WITH TWO ARRAYS	1	Oct 26, 2023
Call for Papers: The2007 International Conference of Data Mining and Knowledge Engineering (ICDMKE 2	0	Jan 14, 2007
I'm tempted to quit out of frustration	1	Aug 13, 2023

data mining

vmishra85

Vladimir Oka

pemo

osmium

Juuso Hukkanen

Your Uncle

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads