|STAT 5.4 DATA MANIPULATION & ANALYSIS PROGRAMS FOR UNIX and MSDOS |STAT is a set of about 30 data manipulation and analysis programs developed by Gary Perlman at the University of California, San Diego and at the Wang Institute. The manipulation programs are general utilities that work with other standard programs like sort. The analysis programs compute most widely used statistics. |STAT programs are designed with the philosophy that individual programs should be designed as tools that do one task well and produce output suitable for input via pipes to |STAT and other programs. Interactive use is supported in the command line interpreter/editor while batch files or shell scripts provide a programming language for complex analyses. Typical usage involves a pipeline of transformations of data followed by input to an analysis program, summarized schematically by: INPUT DATA | TRANSFORM | ANALYSIS | OUTPUT RESULTS Package Features: o simple input formats (free-format field-oriented) o flexible data manipulation o several simple lineprinter plotting options o data validation (range and type checking) o consistent option conventions with online help o compiles and runs on any UNIX System (V6, V7, 2.8 BSD, 4.x BSD, System V, ANSI, etc.) o runs on MSDOS/PCDOS 2.x, 3.x, 4.x with 96K (IBM, AT&T, Epson, and all compatibles) o usually less than a few seconds per analysis o liberal copyright (but can't be distributed for gain) o in use at hundreds of university, industry, government and research sites for over ten years Data Manipulation Programs: abut join data files beside each other colex column extraction/formatting dm conditional data extraction/transformation dsort multiple key data sorting filter linex line extraction maketrix make matrix format from free-format input perm random/numerical/alphabetical permutation probdist probability distribution functions ranksort convert data to ranks repeat repeat strings or lines in files reverse reverse lines, columns, or characters series generate an additive series of numbers transpose transpose matrix format input validata verify data file consistency Data Manipulation Program Highlights: o conditional extraction of rows or columns o data sorting based on multiple keys o line permutation (random and sorted) o matrix formation and transposition functions o conversion of data to ranks o additive series generation + 6 distributions: uniform, normal, t, chi-square, F, binomial o input validation for format and data types o text pagination and formatting with headers o ASCII file archiver for file combination and transfer Data Analysis Programs: anova multi-factor analysis of variance calc interactive algebraic modeling calculator contab contingency tables and chi-square desc descriptions, histograms, frequency tables dprime signal detection d' and beta calculations features tabulate features of items oneway one-way anova/t-test, error-bar plots pair paired data statistics, regression, plots rankind independent conditions rank order analysis rankrel related conditions rank order analysis regress multiple linear regression and correlation stats simple summary statistics ts time series analysis, plots Data Analysis Program Highlights: o 20 descriptive statistics on a single distribution o between group and within group (paired) t-tests o weighted/unweighted means and multifactor anova o multiple linear regression, partial correlation analysis o simple time-series analysis with auto-correlation o ranked-order analyses including: Friedman, Wilcoxin, Spearman, Fisher Exact Test, Median Test, Mann-Whitney, Kruskal-Wallis o multifactor contingency tables with chi-square o programmable calculator with over 30 functions o lineprinter histograms, scatter plots, error-bar plots o computed probabilities for significance tests Distribution Conditions: CAREFULLY READ THE FOLLOWING CONDITIONS. IF YOU DO NOT FIND THEM ACCEPTABLE, YOU SHOULD NOT USE |STAT. |STAT IS PROVIDED "AS IS," WITHOUT ANY EXPRESS OR IMPLIED WARRANTY. THE USER ASSUMES ALL RISKS OF USING |STAT. THERE IS NO CLAIM OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. |STAT MAY NOT BE SUITED TO YOUR NEEDS. |STAT MAY NOT RUN ON YOUR PARTICULAR HARDWARE OR SOFTWARE CONFIGURATION. THE AVAILABILITY OF AND PROGRAMS IN |STAT MAY CHANGE WITHOUT NOTICE. NEITHER MANUFACTURER NOR DISTRIBUTOR BEAR RESPONSIBILITY FOR ANY MISHAP OR ECONOMIC LOSS RESULTING THEREFROM OF THE USE OF |STAT EVEN IF THE PROGRAMS PROVE TO BE DEFECTIVE. |STAT IS NOT INTENDED FOR CONSUMER USE. CASUAL USE BY USERS NOT TRAINED IN STATISTICS, OR BY USERS NOT SUPERVISED BY PERSONS TRAINED IN STATISTICS, MUST BE AVOIDED. USERS MUST BE TRAINED AT THEIR OWN EXPENSE TO LEARN TO USE THE PROGRAMS. DATA ANALYSIS PROGRAMS MAKE MANY ASSUMPTIONS ABOUT DATA, THESE ASSUMPTIONS AFFECT THE VALIDITY OF CONCLUSIONS MADE BASED ON THE PROGRAMS. REFERENCES TO SOME APPROPRIATE STATISTICAL SOURCES ARE MADE IN THE |STAT HANDBOOK AND IN THE MANUAL ENTRIES FOR SPECIFIC PROGRAMS. |STAT PROGRAMS HAVE NOT BEEN VALIDATED FOR LARGE DATASETS, HIGHLY VARIABLE DATA, NOR VERY LARGE NUMBERS. You may make copies of any tangible forms of |STAT programs, provided that there is no material gain involved, and provided that the information in this notice accompanies every copy. You may not copy printed documentation unless such duplication is for non- profit educational purposes. You may not provide |STAT as an inducement to buy your software or hardware or any products or services. You may distribute copies of |STAT, provided that mass distribution (such as electronic bulletin boards or anonymous ftp) is not used. You may not modify the source code for any purposes other than getting the programs to work on your system. Any costs in compiling or porting |STAT to your system are your's alone, and not any other parties. You may not distribute any modified source code or documentation to users at any sites other than your own. Ordering Information February 1990: To expedite your order, please follow the instructions below. [] Please indicate the items that you would like to order. [] Orders must be prepaid. Purchase orders are not acceptable. [] Make your check or (postal) money order payable to G. Perlman. [] Checks must be in US funds drawn on a US bank. [] Please include a delivery address label to speed service. International orders: please indicate your country name. |STAT Prices, Subject to Change without Notice: UNIX C Source Version of |STAT: $20/30 C Language Source Code & Online Manual Entries [] 1/2 inch 9 track mag tape, 1600 bpi tar format ($20) [] 1/4 inch cartridge tape, tar format ($30) DOS Executable Version of |STAT: $15 Preformatted Manuals & Executables (without Source Code) ($15) [] 2S/2D (360K) DOS 5.25 inch floppy diskettes [] HD (1.2M) DOS 5.25 inch or 3.5 inch floppy diskettes (by special request) DOS Turbo C Source Code Version for |STAT: $10 Turbo C Language Source Code, Project Files, Online Manual [] HD (1.2M) DOS 5.25 inch or 3.5 floppy diskette Handbook (highly recommended for new users): $10 Examples, Reference Materials, CALC & DM Manuals, Manuals [] Typeset Manual (over 100 8.5 x 11 inch pages) These prices cover the cost of media and delivery worldwide. Send your order, check to G. Perlman, and label to: Gary Perlman The |STAT Software Project Department of Computer and Information Science Room 228, Bolz Hall, The Ohio State University 2036 Neil Avenue, Columbus, OH 43210-1277 USA Notes: UNIX is a trademark of AT&T. MSDOS is a trademark of Microsoft.