After my article on saving web pages to a hard drive, Richard Fuller wrote to tell why he wanted to do it.
Me again! Just a note about saving web pages to the hard drive. I do this habitually with financial sites where, for instance, I wish to have a "virtual hard copy" of the confirmation page as proof of a transaction, just in case there’s a query at a later date. In that event I can simply e-mail the proof as an attachment.
By far the best way to do save pages, in my view, is by means of the excellent Firefox add-on Fireshot, which offers the option to save only the visible area or the entire page (excellent for those of us still struggling along with little old CRT monitors). The files can be saved in jpeg format, which I believe are much smaller files than html and can be conveniently viewed using Windows Picture and Fax Viewer.
I responded to Richard that documentation is exactly why I use ClickBook. Unlike other "print to PDF" pdf creators, ClickBook stacks up as many "prints" as you want in its queue, allows you to unselect ones you don’t really want, allows you to change the order (drag & drop) in the resulting PDF file " and ultimately puts them all into ONE pdf file. ClickBook does so much more that they barely mention its PDF functions.
Most pdf creators create a separate PDF with every print.
Another longtime subcriber Rohn gave some excellent comments via my web site:
Why save web pages?
Because the web is dynamic. Content comes and goes. Favorites are good, I’ve got 10,000+, but they are not permanent. So if it is really good, I like to keep a copy locally to make it easier to find in future.
However, I don’t like to do the simple “File/Save”. That saves HTML and graphics in separate locations (html file and images folder). If you are not careful, you can break the link between them. For that reason I prefer to use the option to save them in a "single file" (which maintains the hyperlinks):
– in FF File / Save As / File type: Web Page Complete
– in IE File / Save As / File type: Web Archive, single file (MHT)
But doing that saves a lot of extra "carp", mostly advertising that I don’t care about but also trivial graphics like twiter icons and graphical lines and bullets. Instead I copy the article text and paste it into a new blank file (usually in Word) which I save as HTML (Filtered) format (to maintain the hyperlinks). The resulting file is a fraction of the size of the file generated by the browser. If there are graphics in the article (most are "pretty" stuff that can be deleted) then I save the file as a (compressed) DOCX or simple DOC file format.
If it is part of a larger topic, say Office application tips or Windows tips, I prefer to paste them into "Tips Collection" documents that bring similar content together. This way all tips about a topic: Word Menu Word Ribbon Windows 7 etc, are in logical order in a single searchable file, with graphics as required.
Rohn also added, as another web site comment:
How dynamic is the web. Every couple of months I use a program called "AM Deadlinks" to review my favorites collection. Every time I find dozens of broken links that I end up deleting.
PS: I also use a FF addon called "Plain Old Favorites". It allows FF to tie into IE Favorites. That way I don’t have to worry about merging FF bookmarks and IE Favorites. As well, I have relocated my Favorites folder to a "Data" drive so that all of my userids (daily user, limited user and admin accounts) can access the same set of Favorites.
I like Favorites because each one is save as a Windows Shortcut FILE. This allows me to include my Favorites when I do file name searches. I use a tool called "Everything" because it provides MUCH faster name searches than windows.