search source code project

C

cmk128

Hi
Here is my project:

1) Apache + tomcat + mysql (have to use JSP, no php)
2) 3 servers : Linux FC6 , 2GB ram , 2.8Ghz
3) Goal : to build a website to let people search the source code from
over 1 million files.

It is unable to use mysql to build the index from the source code
because people can search "int x=0" rather than english word, so all
the indexing server doesn't word. I guess every time the user submit a
search, i use grep command to search all the 1 million files. But it
is too slow, how can i improve it?

thanks
from Peter ([email protected])
 
C

cmk128

Forgot to say, in my xeon 3Ghz server, i need 10 seconds to grep 20000
files. If i need to grep 1 million files, then i need at least 500
seconds, it is too slow.
thanks
from Peter
 
C

Chris Uppal

3) Goal : to build a website to let people search the source code from
over 1 million files.

Have you looked at OpenGrok ?

(It uses Lucene as its text search engine internally, but adds stuff for
specialising to searching and displaying code.)

-- chris
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,755
Messages
2,569,536
Members
45,011
Latest member
AjaUqq1950

Latest Threads

Top