Slashdot Journal Grabber

From LeoWiki

Jump to: navigation, search

This utility will scrape the journal of any slashdotter and save the entries as text files in the current working directory. The graphical version also gives you a convenient HTML summary of the journal that can be used as an index.

It uses the wxWidgets for Python and runs as is on Windows, Linux and Mac OS X.

0.96 on Mac
Enlarge
0.96 on Mac
0.95 on Windows
Enlarge
0.95 on Windows
0.91 on Linux
Enlarge
0.91 on Linux

Downloads

Windows Executable

Source Code

You'll need the wx Python modules to run these. Check your package manager or wxPython.org.

Instructions

Usage

WinGrabber presents a standard graphical interface.

Grabber is a console program invoked with:

Grabber [username]

Important Windows Note

The program saves the text files to the active folder. If you create Start Menu shortcuts, be sure to set the Start In path to something other than blank. If the Start In path is blank, Windows will use the shortcut's location, and you don't want your journal entries to be saved to the Start Menu.

Macintosh Setup and Usage

To run grabber on Mac OSX Tiger (note: some of this may not be necessary. This is what I did, and it works):

  1. Install a copy of Python. Tiger includes Python, but I installed the newest version from undefined.org'. The only tested version is MacPython-OSX-2.4.1-1.dmg.
  2. Install a copy of TigerPython24Fix. Click on the link from the undefined.org page or go to the pythonmac.org pages directly.
  3. Install a copy of the wxPython runtime libraries from their page. Only tested version is osx-unicode-py2.4
  4. Download the Grabber.py and (if desired) WinGrabber.py code, and place them in the same folder, wherever you want them to live.
  5. From the shell of your choice, run: /usr/local/bin/pythonw "pathtowingrabber/WinGrabber.py" or right click on WinGrabber.py icon and select "Open With..." and then "PythonLauncher".

Changelog

1.1.0

  • updated to work with Slashdot's new layout
  • ignores some more ugly and pointless stuff (NOBR, WBR)

1.0.0

  • substantial re-engineering of internals
  • replies to journal entries are now grabbed
  • better handling of line breaks

0.9.6

  • filenames now include the journal entry's title

0.9.5

  • toolbar for saving and copying the index summary
  • formal GPL license
  • code clean-up

0.9.1

  • much larger summary limit (2 097 152 characters)

0.9.0

  • first public release

Interesting Links

Legalese

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; version 2 of the License.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA

Personal tools