HTM2TXT v 1.0, Sep.21,1996 by Otto R„der Description: HTM2TXT.CMD is a REXX script which extracts HTML-tags from .HTML-files used in Internet www-communication and stores the remaining text into an ASCII-file. Group: HTM2TXT belongs to group: os2 - textutil Freeware: HTM2TXT may be distributed freely under the following conditions. Copyright notices must NOT be removed, all files contained in the file inventory below must be distributed together (you may not remove any files), and you may not charge for the program. If you find the program useful then send a post-card (picture of the location where you live) to: Otto R„der Hauptstrasse 61B/13 A3001 Mauerbach --------------- Austria Prerequisites: HTM2TXT requires OS/2 and REXX. It has been developed and tested under OS/2 Warp, there is no intention to move it to other platforms. Distribution: The following files are contained in HTM2TXT1.ZIP: HTM2TXT.CMD the REXX command-file 1996-09-11 HTM2TXT.ICO an icon file contributed by Gerard Pinkas, pinkas@en.com MAKEOBJ.CMD a command to create a desktop program object README.TXT documentation, this file FILE_ID.DIZ Id-file Installation: To install HTM2TXT just UNZIP the HTM2TXT1.ZIP file and place the command into a directory contained in your CONFIG.SYS PATH= statement. You may use the MAKEOBJ.CMD to create a desktop oject for HTM2TXT.CMD. You should run MAKEOBJ.CMD from that directory where HTM2TXT.CMD and HTM2TXT.ICO are installed. Usage: From an OS/2 command line start HTM2TXT: htm2txt filename.htm Make sure filename.htm is in the current directory. or Drag and drop a .HTML object to the HTM2TXT object if you have created one with makeobj.cmd. HTM2TXT will create an output file 'filename.txt' and it will start an editor to view this file. Note: HTM2TXT will follow then the column width in tables is determined by: nnn/pixlbyt. line 39: editor='e' the name of an ASCII editor to display result file. It may be changed to the installations favoured editor. editor='' causes no editor to be called line 43: consts= this is a table of variables to substitute special characters. I could not find an official table in any web-site, so this table is a collection of variables i have seen up to today. You may temporarily overwrite these parameters by adding options when starting htm2txt from an os/2 command line: htm2txt filename.html l 80 to set linemax to 80 characters htm2txt filename.html p 8 to set pixlbyt to 8 pixels/char htm2txt filename.html e te to set editor to tinyedit htm2txt filename.html u to include url-references in .txt These options may appear in any order after the filename: htm2txt filename.html e te p 8 l 80 u Warranty: The program is distributed on an as-is basis. It tries to extract as much text as possible, however, i am sure, there are some special forms of tags which i missed. Normally such tags are simply ignored. There is no guarantee to get certain results nor is any guarantee to avoid damages of existing files. Note: In the current directory the program will overwrite any file with filename of input-file and a file extension of .TXT, eg.: filename.TXT ! Comments: Comments and recommendations pls to: oraeder@ibm.net