Efficiently Read file Headers

Thread starter ts8807385
Start date Mar 17, 2009

ts8807385

Mar 17, 2009

Hey guys,

I have a process where I'm throwing files out based on their file
header. This works fine, but when I have a lot of files (millions)
it's slow. What I do now is open each file and push the first ten
bytes into a vector I call 'header_bytes'. I basically do fd.get() ten
times while incrementing an int and pushing_back into the vector.

I then have a bunch of if statements that look similar to the below
code for about 12 common files headers (jpegs, pngs, wavs, riffs, etc)
that I want to exclude from further processing:

if (byte1 == 10 and byte2 == 14 and byte3 == 12)
return false;
else if ()
return false;
else if ()
return false;
else
//process the file further
return true;

As I said, this works fine. When I only have to process a few thousand
files, I'm done quickly. How can I speed it up?

Thanks,
Tom

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

Universal BMP Steganography Tool (AES-128-CTR + SP800-90A CSPRNG) Full Encoder/Decoder with 3LSB Payload, PasswordDerived Key & External Key File	4	Mar 26, 2026
Console Interactive File Manager for Protected Files on Windows	3	Apr 10, 2026
Building a Professional Neural Network Framework: Full C++ Implementation with Windows GUI and Real-Time Training Visualization	11	Jun 14, 2026
Implementing a Q-Learning Algorithm with Logistic Regression Normalization in C++	0	Jun 4, 2025
Introducing SecureScreenOverlay v2.1: An Open-Source Solution for Enhanced Screen Capture Protection on Windows	0	Mar 25, 2026
AES-128 Clipboard Protector: Auto-Encrypt Ctrl+C, Smart-Decrypt Ctrl+V (C++ Windows Hook)	7	Mar 24, 2026
Rich Text Format (RTF) Document Builder in C++: Code and Features	0	Sep 28, 2025
Database Manager: A C++ Console Application	14	May 12, 2025

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

Total: 33 (members: 3, guests: 30)
Robots: 146

Forum statistics

Threads: 474,470

Messages: 2,571,809

Members: 48,797

Latest member: PeterSimpson

Latest Threads

Need a SharePoint Online migration tool that actually works
- Started by henrywalker
- Yesterday at 2:03 AM
Export Google Photos Albums to Computer with Complete Folder Structure
- Started by henrywalker
- Jul 15, 2026
Best Enterprise Strategy for Large PST Archives
- Started by henrywalker
- Jul 13, 2026
Create Better Digital Experiences with Modern Design Thinking
- Started by Damian01
- Jul 11, 2026
How Can I Convert Old DBX Files to PST When Outlook Express Is No Longer Available?
- Started by Damian01
- Jul 10, 2026
How to Improve Ruby Application Performance and Fix Common Slowdown Issues?
- Started by Damian01
- Jul 10, 2026
Best Way to Prepare PST Email Archives for Legal Discovery Without Outlook?
- Started by henrywalker
- Jul 10, 2026
Need a Reliable Office 365 Backup Solution for Business ComplianceAny Recommendations?
- Started by henrywalker
- Jul 8, 2026
What's the Safest Way to Keep an Offline Copy of My Webmail Emails?
- Started by henrywalker
- Jul 6, 2026
Hamming distance
- Started by WhiteCube
- Jul 5, 2026

Top