Hi to Simon and Datamystic Crew,
I am testing this new pattern function in Split File filter. It seems to work OK with text files, but on binaries it fails, making thousands of 0 byte files, and I can't find logical pattern why this happens.
I am testing it for extraction of PNG file fragments from one big binary file. When I simulate PNG start and end strings in plain textual file it splits perfectly, but in binary results as I said above.
I can send you my test files to see by yourself...
Beta Feedback - Split File by Pattern
Moderators: DataMystic Support, Moderators, DataMystic Support, Moderators, DataMystic Support, Moderators
-
- Posts: 5
- Joined: Fri Jun 16, 2006 12:13 am
- Location: Varazdin, Croatia
- Contact:
Beta Feedback - Split File by Pattern
Last edited by Bernard Toplak on Fri Jun 16, 2006 4:52 pm, edited 1 time in total.
- DataMystic Support
- Site Admin
- Posts: 2227
- Joined: Mon Jun 30, 2003 12:32 pm
- Location: Melbourne, Australia
- Contact:
Hi Bernard,
Ahem! We actually assumed that no-one would use it one binary files, hence we coded this in two parts
- a search replace for the pattern, which inserts a null character
- a normal split on nulls.
You can workaround this by removing the nulls first, which sounds like it would be useful anyway.
Ahem! We actually assumed that no-one would use it one binary files, hence we coded this in two parts
- a search replace for the pattern, which inserts a null character
- a normal split on nulls.
You can workaround this by removing the nulls first, which sounds like it would be useful anyway.
-
- Posts: 5
- Joined: Fri Jun 16, 2006 12:13 am
- Location: Varazdin, Croatia
- Contact:
Well, with it's extreme pattern matching power TextPipe is ideal candidate for such jobs of splitting all kinds of files at pattern-principle.Simon Carter wrote:... we coded this in two parts
- a search replace for the pattern, which inserts a null character
- a normal split on nulls.
I'm afraid that none of above methods is good for bitmap data as those files use full range of hex chars, so no split-on-char could be performed.
Hmm, you mean removing \x00 chars? I'm not sure that is a good solution when bitmap data is on the table. If the split on pattern would work with bins just like on text ones, extracting pattern match and ignoring any other unmatching data, without any additional character insertion it would work perfectly!Simon Carter wrote:You can workaround this by removing the nulls first, which sounds like it would be useful anyway.
-
- Posts: 5
- Joined: Fri Jun 16, 2006 12:13 am
- Location: Varazdin, Croatia
- Contact:
Hi Simon & Crew!
I just wanted to ask if there will be any moves in this direction?
I have solved my task with one freeware tool that can identify more than 110 different file formats (and much more) in hex files. This can be very usefull for data forensic tasks like mine. If it is permited I can post info for other users that may need it because I see this is out of the TextPipe's main domain, right?
Best regards
Bernard
I just wanted to ask if there will be any moves in this direction?
I have solved my task with one freeware tool that can identify more than 110 different file formats (and much more) in hex files. This can be very usefull for data forensic tasks like mine. If it is permited I can post info for other users that may need it because I see this is out of the TextPipe's main domain, right?
Best regards
Bernard
- DataMystic Support
- Site Admin
- Posts: 2227
- Joined: Mon Jun 30, 2003 12:32 pm
- Location: Melbourne, Australia
- Contact:
-
- Posts: 5
- Joined: Fri Jun 16, 2006 12:13 am
- Location: Varazdin, Croatia
- Contact: