Acmlm's Board - I2 Archive - Hardware/Software - Can anything else open .mht files?
User | Post |
HyperLamer
Posts: 3314/8210 |
...How did I miss that? |
Xkeeper
Posts: -2518/-863 |
Originally posted by FreeDOS Well, if it's a base64 encoding: http://www.fourmilab.ch/webtools/base64/
Look up, Hyperhacker
Just open it in Notepad, remove all excess junk (save the HTML only, I don't konw what the inside looks like) and open it again. |
HyperLamer
Posts: 3299/8210 |
Sounds fun. Got a base64 decoder handy, or do I have to go find/make one?
Originally posted by BMF54123 Whoever backed up the memory locations thread in the old SMW Hacking forum (before the big KERSPLAT) used .MHT, and I can only watch in horror as IE eats up all my available RAM every time I open the files...
That's precisely the problem. Having to view it in IE is teh suck for many reasons (no instant search, can't put it in a Mozilla tab, unnecessary resource usage etc), and having it split into 3 files is even worse, especially when IE opens whatever I type in the URL bar in a new window. |
FreeDOS
Posts: 1121/1657 |
I got a chance now to save the board index in IE as a .mht file. Looking at it in Notepad, it is saved in the same format that email would be in (which would explain Outlook (Express)'s ability to view them). Yeah, all the binary (maybe non-binary, too?) files are in base64. You should be able to remove the headers and change the page source minimally to get a "real" HTML file. Then you copy and paste the base64 things into their own files and run the base64 program on them.
One interesting note is that even IE6 has this line: From: <Saved by Microsoft Internet Explorer 5> (If you can't tell, "From:" in email is the person who sent it. the less-than and greater-than are commonly used as a non-standard way to get the person's name). |
BMF98567
Posts: 653/1261 |
Originally posted by Xkeeper Anyway, what do you want with .mht files? Wouldn't it be easier just to hit HTML Only (*.htm, *.html), or just save the page with images instead of screwing with this format?
My guess is someone else saved the file in .MHT format, not him. Whoever backed up the memory locations thread in the old SMW Hacking forum (before the big KERSPLAT) used .MHT, and I can only watch in horror as IE eats up all my available RAM every time I open the files... |
FreeDOS
Posts: 1119/1657 |
Well, if it's a base64 encoding: http://www.fourmilab.ch/webtools/base64/
It works. I've used it a couple times when someone sends me a raw email file and they want me to check the attatchment for bad things (which is encoded in base64). I can't guarentee that it will work on non-*nix. I've never tried nor had the need. |
Xkeeper
Posts: -2532/-863 |
Originally posted by Acmlm "HTML only" is like doing "view source" and saving from there ...
MHT isn't the same thing at all, it's simply the HTML page formatted like an email message, with all images as base64 encoded attachments, included in the .mht (load a .mht in a text editor, you'll see) ... if you load it in Internet Explorer and view the source, you'll get the HTML back (although slightly different) and even the links to the images
"Full web page" saves the HTML and all images (in a folder), and changes the image links to point to the saved copy ...
Originally posted by dan From looking at a MHT file in Notepad, it seems to be similar in format to a saved email. Testing this out, I renamed the MHT to .EML, and found that Outlook Express could open the page as an email. Perhaps, you could look for some kind of EML extractor utility, if MHT extractor searches turn up nothing. (Which they did for me)
Failure to read entire thread... Go figure.
Anyway, what do you want with .mht files? Wouldn't it be easier just to hit HTML Only (*.htm, *.html), or just save the page with images instead of screwing with this format?
|
dan
Posts: 427/782 |
From looking at a MHT file in Notepad, it seems to be similar in format to a saved email. Testing this out, I renamed the MHT to .EML, and found that Outlook Express could open the page as an email. Perhaps, you could look for some kind of EML extractor utility, if MHT extractor searches turn up nothing. (Which they did for me) |
HyperLamer
Posts: 3289/8210 |
As I understand it, all the images, CSS etc is saved in the file itself. A decent idea, but a pain in the back end when IE is the only thing that can open it and it refuses to actually do anything with it. |
Acmlm
Posts: 1054/1173 |
"HTML only" is like doing "view source" and saving from there ...
MHT isn't the same thing at all, it's simply the HTML page formatted like an email message, with all images as base64 encoded attachments, included in the .mht (load a .mht in a text editor, you'll see) ... if you load it in Internet Explorer and view the source, you'll get the HTML back (although slightly different) and even the links to the images
"Full web page" saves the HTML and all images (in a folder), and changes the image links to point to the saved copy ... |
neotransotaku
Posts: 2205/4016 |
MHT HTML file is a file where external links outside the file still point to the server you got them from. Unless you completely save the webpage, you technically do not get the images downloaded ... |
Prier
Posts: 5283/8392 |
IE has the option of saving it in HTML only format, which is what an MHT is.
Not sure if renaming the extension to HTML would do any good either, I never saved in MHT. |
Karadur
Posts: 779/1192 |
This is a long shot, but I did a search for 'open mht files' on google, and came across this post on another board, that says Outlook Express apparently needs to be installed for .mht files to open properly
However, by the sound of your post, it seems like you're more looking for a .mht to .html converter, and if that's the case, I don't know where one could be found |
FreeDOS
Posts: 1116/1657 |
So... what is a .mht file? From what you're saying, it sounds like it's the way IE saves pages, but it's not. IE inserts some random character encoding line to break it on other OSes, but that's all. |
HyperLamer
Posts: 3282/8210 |
Since after about 20 minutes of trying to save them as a normal page instead of this weird format, it's become painfully obvious that IE is the worst piece of crap software ever invented, is there something else that can save these as a normal HTML file and extract all the images and such? (You know, like the way web pages should be saved. ) |
|