need advices... accessing a huge collection

GrelEns · Oct 23, 2003

hello,

having almost 1,000 tar.gz files in different directories (could not change
that) and these archives contain over 1,000,000 text files. I would like to
build a tool to access as quickly as possible any or sub-collection of these
text files to serve them by http upon user request.

does anyone have ideas on the good way to do it ?

(i was thinking of a mapping in a dictionary whose keys would be filename,
value - path to archive containing it, and extract all the files from a same
archive at the same time)

i also was wondering which is fastest :
- upon each user request, re-building a dictionary from reading key/value
from a file,
- or on the first request building a hard-coded python dictionary and then
importing it,
- or maybe other suggestions (storing in a database...) ?

thanx

Paul Rubin · Oct 23, 2003

GrelEns said:
i also was wondering which is fastest :
- upon each user request, re-building a dictionary from reading key/value
from a file,
- or on the first request building a hard-coded python dictionary and then
importing it,
- or maybe other suggestions (storing in a database...) ?

If the tar files are static (not being updated), simplest thing is use
dbm to store the dictionary.

I Need Help with making a function that draws in a canvas using location data.	1	Dec 17, 2021
I need help in understanding these files on my phone, Could someone help me understand these files? Urgent help needed. Please help.	1	Jun 4, 2023
Memory problems (garbage collection)	6	Apr 23, 2009
Want to re-pack() a Frame displaying a collection	0	Mar 19, 2006
Using static collection in a WebService	0	Dec 19, 2007
Code suggestions?	0	Sep 21, 2013
Working on mobile css menu with plenty of frustration!	2	Dec 29, 2022
Garbage collection problems with a c++ wrapper for a module	2	Jul 15, 2004

need advices... accessing a huge collection

GrelEns

Paul Rubin

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads