Slashdot Journal Grabber
From LeoWiki
This utility will scrape the journal of any slashdotter and save the entries as text files in the current working directory. The graphical version also gives you a convenient HTML summary of the journal that can be used as an index.
It uses the wxWidgets for Python and runs as is on Windows, Linux and Mac OS X.
Downloads
Windows Executable
- Grabber.zip - 1.1.0
Source Code
- Grabber.py - console app
- WinGrabber.py - cross-platform GUI
- COPYING - GPL
- Setup.py - py2exe compilation script
You'll need the wx Python modules to run these. Check your package manager or wxPython.org.
Instructions
Usage
WinGrabber presents a standard graphical interface.
Grabber is a console program invoked with:
Grabber [username]
Important Windows Note
The program saves the text files to the active folder. If you create Start Menu shortcuts, be sure to set the Start In path to something other than blank. If the Start In path is blank, Windows will use the shortcut's location, and you don't want your journal entries to be saved to the Start Menu.
Macintosh Setup and Usage
To run grabber on Mac OSX Tiger (note: some of this may not be necessary. This is what I did, and it works):
- Install a copy of Python. Tiger includes Python, but I installed the newest version from undefined.org'. The only tested version is MacPython-OSX-2.4.1-1.dmg.
- Install a copy of TigerPython24Fix. Click on the link from the undefined.org page or go to the pythonmac.org pages directly.
- Install a copy of the wxPython runtime libraries from their page. Only tested version is osx-unicode-py2.4
- Download the Grabber.py and (if desired) WinGrabber.py code, and place them in the same folder, wherever you want them to live.
- From the shell of your choice, run:
/usr/local/bin/pythonw "pathtowingrabber/WinGrabber.py"or right click on WinGrabber.py icon and select "Open With..." and then "PythonLauncher".
Changelog
1.1.0
- updated to work with Slashdot's new layout
- ignores some more ugly and pointless stuff (NOBR, WBR)
1.0.0
- substantial re-engineering of internals
- replies to journal entries are now grabbed
- better handling of line breaks
0.9.6
- filenames now include the journal entry's title
0.9.5
- toolbar for saving and copying the index summary
- formal GPL license
- code clean-up
0.9.1
- much larger summary limit (2 097 152 characters)
0.9.0
- first public release
Interesting Links
- Python Tutorial
- Python Module Index
- Python 2.4 QuickRef
- wxPython
- wxPython wiki
- py2exe
- jimmac GPLed icons
Legalese
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; version 2 of the License.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA
