PCjs Machines

Home of the original IBM PC emulator for browsers.

Logo

PC-SIG Diskette Library (Disk #639)

[PCjs Machine "ibm5170"]

Waiting for machine "ibm5170" to load....

Information about “SUPERSTAT 2 OF 2 (ALSO 638)”

SuperSTat Share is a shareware Statistical Package for analyzing the
results of Market surveys and poles. The statistical analysis is very
complete. The program is operated by choosing items from menus and
filling in forms on the screen. This version allows 25 variables
and 50 data columns with up to nine values to the variable.

This is a two disk set that requires both to function.

System Requirements:  128K, two disk drives, Hercules graphics display,
Epson-compatible printer.

How to Start:  After consulting the NOTES638.TXT and SST.DOC for
information, initiate the program by entering SSTMENU at the DOS
prompt.

Suggested Registration:  $35.00; $20.00 for students.

File Descriptions:
The First Disk Contains:
SCATTER  EXE  Scatter plot program
SSTMENU  EXE  Main menu and system management program
FREQ     EXE  Frequency distribution analysis program
CROSS    EXE  Cross tabulate program
SST      DEF  System configuration file
TEST     TTL  Analysis Module
TEST     DTA  Analysis Module
TEST     APD  Analysis Module
TEST     VRN  Analysis Module
INT      APD  Analysis Module
INT      DTA  Analysis Module
INT      TTL  Analysis Module
INT      VRN  Analysis Module

The Second Disk Contains:
SSTMENU  EXE  The menu and system management program
SST      DEF  Configuration file
DATAMAN  EXE  The data management module
FILEMAN  EXE  The file management module
CODEMAN  EXE  The code entry module
SST      DOC  The short form of the SST Manual

FILES639.TXT

----------------------------------------------------------------------------
Disk No  639  SST Version 1.00  (Disk 2 of 2)                      V1   DS2
----------------------------------------------------------------------------
SuperSTat Share is a two disk, shareware Statistical Package for analyzing
the results of Market surveys and poles. The statistical analysis is very
complete. The program is operated by choosing items from menus and filling in
forms on the screen. This version allows 25 variables and 50 data columns
with up to nine values to the variable.
 
SSTMENU  EXE  The menu and system management program
SST      DEF  Configuration file
DATAMAN  EXE  The data management module
CODEMAN  EXE  The code entry module
FILEMAN  EXE  The file management module
SST      DOC  The short form of the SST Manual
 
PC Software Interest Group (PC-SIG)
1030D E. Duane Ave.
Sunnyvale, CA  94086
(408) 730-9291
(c) Copyright 1986

GO.TXT

This disk contains the program: SUPERSTAT Share
 
To find out about this program: "PRINT" or "TYPE" these files:
 
SST.DOC         The user's Manual
 
to print the Manual type:  PRINT SST.DOC and press ENTER  or
                    type:  COPY SST.DOC PRT1 and press ENTER
 
To print the other documents, use the same commands with the appropriate
file name.
 
PC Software Interest Group (PC-SIG)
1030D E. Duane Ave.
Sunnyvale, CA  94086
(408) 730-9291
(c) Copyright 1986

NOTES639.TXT

Program name: SST v. 1.00
 
Author name:(C) 1986 Derby Micro-Computer Services, Inc.
 
Address:             P.O. Box 220566
                     Charlotte, NC  28222
 
Telephone Number:   (704) 536-7721
 
Program Classification: Statistics for Market Surveys
 
Registration Fee: $35.00;  students $20.00
                  A full commercial version is available.
 
Program Description:  This is a two disk set.
 
Superstat Share is a shareware Statistical Package for analyzing the results
of Market surveys and other questioneers (poles). It allows weighting the
data set, encryption of your data and automating analysis by using special
batch file commands. The statistical analysis is very complete. The program
is operated by choosing items from menus and filling in forms on the screen.
 
This version allows 25 variables and 50 data columns with up to nine values
to the variable.  The commercial version allows 250 variables and 250 data
columns with up to 9 values each. Both versions handle an unlimited number
of records.
 
This version of SST runs under MS/PC/DOS 2.0 or higher on any IBM PC/XT/AT or
compatible with a color monitor, a monochrome monitor, or a Hercules
monochrome graphics card. It needs a hard disk or two floppies and an EPSON
compatible printer.
 
There are three analysis modules; FREQ, CROSS, and SCATTER. Graphics is
provided in the form of bar and scatter charts.
 
There are measures for nominal to nominal association and significance,
ordinal to ordinal association and significance, and nominal to ordinal
association and significance. Cramer's V is provided as well, along with the
degrees of freedom, N, and missing values.
 
You are also provided row, column, and total percentages and the cells counts
as well as complete information on row and column totals and missing values.
 
You are provide stats for bi-variate correlation, regression, and one-way
ANOVA, as well as descriptive stats on the individual variables.

SST.DOC

















                         SuperStat Short Documentation

                  (C) 1986 Derby Micro-Computer Services, Inc.

                                P.O. Box 220566
                             Charlotte, NC  28222
                                (704) 536-7721

         

          


                                ACKNOWLEDGEMENTS


         Dr. Gary R. Rassel,  University of North Carolina at 
         Charlotte political science department.  His comprehensive 
         and insightful knowledge of research and statistics have 
         guided the development of this software from its conception 
         to its documentation.  It is with heartfelt appreciation that 
         I thank you for all you have done.

         Dr. Theodore S. Arrington, University of North Carolina at 
         Charlotte political science department.  My senior research 
         professor, Dr. Arrington's advice has been consistently 
         excellent.  Without his guidance I am sure I would never have 
         become proficient with research or stats.

         Ray Twery, University of North Carolina at Charlotte 
         mathmatics department.  If the others have inspired me to 
         write theoretically correct research code, Mr. Twery has made 
         sure the calculations were correct.

         Sherry Hicks, Charlotte Market Research Group.  Sherry has 
         used SuperStat in all its various forms for the past two 
         years.  She has done everything from small focus group 
         tabulations to massive public opinion survey analysis with 
         the package.  She has supported a growing and deservedly very 
         busy research business with SuperStat alone when other 
         packages would have, in comparison to SuperStat's earlier 
         versions, been easier on her and done a better job.  At the 
         same time she has regularly let me know what the problems 
         were and how it SHOULD as well as could be improved.  For 
         this I am eternally grateful.  I could never have duplicated 
         the extensive real world testing she has done for me.  Due to 
         her work, the package is easier to use, more straightforward 
         in approach, and fast enough to meet tight deadlines.

         Paula Schneiderman, Charlotte Market Research Group.  For 
         trusting and believing in the product.  Paula gave me the 
         chance to put this package to work and never backed away.  

         Mike Derby    

         
                                  DISCLAIMERS

         Epson is a trademark of Epson America, Inc.

         IBM is a trademark of International Business Machines Corp.

         

                                  Introduction

         Welcome to SuperStat Share, the shareware version of 
         SuperStat 1.00.  We hope you will find this program useful.  
         If you do, we request a $35 contribution, students are asked 
         to contribute $20 and send a copy of their student ID.  This 
         version of SuperStat doubles as a demo for the larger and 
         more powerful commercial version.  When you register, you are 
         entitled to a 35$ discount on the purchase of commercial 
         SuperStat.  

         The essential difference between the two versions is this: 
         the shareware version allows 25 variables and 50 data columns 
         with up to nine values to the variable.  The commercial 
         version allows 250 variables and 250 data columns with up to 
         9 values each. Both versions handle an unlimited number of 
         records.  

         As with all shareware, you are allowed to copy and share the 
         program (in unmodified form) with anyone.  You may not charge 
         a fee for the software.  You should provide the entire 
         package of eight modules and the documentation.

         



         CONFIGURING YOUR SYSTEM 

         To start SuperStat, place your Utilities disk in the default 
         drive, type SSTMENU and press <ENTER> (or return as it is 
         also known).  The menu will allow you to configure your 
         system and call the other modules of SuperStat.  

         If you are using a color monitor, you may set colors for use 
         with all modules.  If you are using a monochrome monitor, you 
         can tell the computer it is a color monitor and use reverse 
         video or bright white as your foreground color.  If you are 
         using a Hercules monochrome graphics card, set it to DIAG and 
         use it as if it were a regular monochrome monitor, or set the 
         system to Hercules mode and all color statements will be 
         skipped.  If you are using a hard disk, you may specify drive 
         A: as the default DATA drive and use drive C: etc. as your 
         PROGRAM drive.  

         If you are using a serial printer you may specify it as 
         the default printer along with the baud rate and etc.  
         SuperStat will use any of the five print/com ports you 
         specify for output.

         SuperStat uses EPSON compatible printer commands to compress 
         print in large cross-tabulations (over 5 x 5) and also to 
         squeeze 100 x 100 scatter plots (squeezing is vertical 
         compression).  Check your printer manual for the appropriate 
         printer codes and use SSTMENU to change them if necessary.

         If you wish to change the defaults to the "factory setting," 
         a menu choice is there to reset the original defaults.  

         You may use SSTMENU to call the other modules or you may type 
         the name of the module at the DOS prompt and press <ENTER>.  
         The names are self-explanatory.  
                                
              ex.  A>SCATTER  <ENTER>

              This command will load and execute the scatter plot 
              module.

         SYSTEM REQUIREMENTS

         You need two disk drives, 256 K main memory, and an ASCII 
         compatible printer.  SuperStat uses the Epson standard 
         command set for compressing print.
         
         SSTMENU creates a file called SST.DEF.  If it is not present 
         when you run other modules, the "factory" defaults will be in 
         effect.
         



         COMPATIBLE COMPUTERS

         SuperStat has been tested successfully on a wide range of 
         compatible computers, including some piecemeal systems with 
         very obscure BIOS origins.  However, the TANDY line of 
         compatible computers produce strange results when the 
         <NUMLOCK> or function keys are pressed.  To avoid these 
         problems on TANDY computers, load DOS, press the <NUMLOCK> 
         key, then load SuperStat, and DO NOT press the <NUMLOCK> or 
         function keys again until you return to DOS. 
         

         CODEBOOK ENTRY

         Start with your codebook.  This way, you can enter the 
         codebook filename in the data entry module and it will set 
         the number of columns you need and keep track of your maximum 
         and minimum values (if you have interval level variables in 
         your data set).  

         CODEMAN is the name of the module.  It stands for codebook 
         manager.  

         FILENAME ENTRY

         It first asks you for your codebook filename.  Follow 
         standard file naming conventions (explained in your DOS 
         manual) but do not enter an extension.  Actually, SuperStat 
         won't let you use a period (.) here.  You may use directory 
         naming conventions (paths) if you need them but remember, the 
         default drive is always your filename prefix UNLESS you enter 
         one explicitly.  To do this, enter b:\dir\filename.  When the 
         second character of the filename string is a colon (:), the 
         default drive is over ridden.  This is true of all modules.  
         
         Now the menu appears.  You may now choose to enter a new 
         codebook file.  
         
         NUMBER OF VARIABLES

         You are then asked to enter the number of variables.  This is 
         the number of questions if you are analyzing a questionnaire.  
         Be careful, researchers often number the questions and then 
         alphabetize the other sub-questions.  The sub-questions count 
         as variables in SuperStat.  If you skip a variable, create a 
         one variable codebook and either merge it to the end of your 
         larger codebook, or split the larger codebook, merge the 
         single variable, and finally, merge the latter half of the 
         original codebook.  Merging and splitting is accomplished 
         with FILEMAN, which will be explained later.
         



         TITLE OF THE VARIABLE

         Now you will begin to enter the codebook.  You are first 
         prompted for the title of the variable (up to 32 characters), 
         it can be any combination of characters except the carat (^),  
         the ampersand (&), the underline (_), the tilde (~) and the 
         at sign (@).  
         
         INSERT AND DELETE

         The carat and the ampersand are reserved as additional delete 
         and insert keys respectively.  This is so because SuperStat 
         is set up to allow the use of the ten-key pad for ten-key 
         punch.  When <NUMLOCK> is not active you may use the <INS> 
         and <DEL> keys on the ten-key pad.
         
         CURSOR MOVEMENT

         The backspace key will move the cursor from the right to the 
         left one space at a time but will not erase what is there.  
         The <TAB> key moves the cursor from the left to the right one 
         space at a time.  When the <NUMLOCK> key is not active you 
         may use the <RIGHT> and <LEFT> cursor keys on the ten-key 
         pad.

         CAUTION: When you backspace over a character it is no 
         longer a part of the input string unless otherwise noted.  
         Every character from the cursor right is ignored.  The 
         character remains on the screen so you can tab back over and 
         re-include it.  


         TYPE OF VARIABLE
         
         After you enter the name of the variable you are prompted for 
         the type of variable, either nominal/(O)rdinal, (I)nterval, 
         or (M)ultiple response.  The capital letters in parentheses 
         indicate the key you press to specify the type of variable.  

         INTERVAL LEVEL VARIABLES

         If you specify an interval level variable, you are then asked 
         for the number of data columns this variable will require.  
         You may use from one to twelve columns for the variable.  
         SuperStat Share does calculations to the 32nd decimal place 
         in floating point but if you enter too many large numbers you 
         CAN cause an overflow.  
         



         MULTIPLE RESPONSE VARIABLES

         There are two types of multiple response variables (MRV), a 
         stand alone multiple response variable and a multiple 
         tabulation of a single value of other variables.  A MRV may 
         have up to nine values and use up to ten data columns with 
         its required missing value switch.  The number of columns 
         required by a MRV may be computed by adding the number of 
         values to the variable (1-9) plus one.

         If you specify a MRV you will then be prompted for the number 
         of columns the variable will require.  This is the number of 
         values to the variable plus one.

         THE MULTIPLE RESPONSE MISSING VALUE SWITCH

         Multiple response variables use the first columns of their 
         field in the data set as a missing value switch.  When the 
         first column of a MRV is a one, then the case is present.  If 
         the first column contains a zero, the case is not present and 
         is a missing value.  Missing values may be included or not 
         included in the table calculations depending upon your needs.
         However, if the first column is zero, the case will always be 
         considered a missing value, even if other columns of the 
         field contain responses.

         STAND ALONE MULTIPLE RESPONSE VARIABLES

         Stand Alone MRV's are coded one or zero only.  A one 
         indicates a response, a zero represents a non-response.  

         TAB-OF-OTHER MULTIPLE RESPONSE VARIABLES

         Tab-of-Other MRV's have a single column missing value switch 
         but may tabulate up to nine other variables and will allow 
         the tabulation of any value of the other variables in the 
         range of 1-9.  As the missing value switch is always in the 
         first column of the MRV field, and all fields in the data set 
         must be contiguous, the other variables must immediately 
         follow the switch.  

         



         MULTIPLE RESPONSE VALUE LABELS

         After you specify the number of columns, you are then 
         prompted for the value labels of the variable, in turn, one 
         at a time.  If the MRV stands alone, this is usually some 
         abbreviation of the response category.  If this MRV is a tab 
         of other variables, the labels are usually some abbreviation 
         of the other variable's titles.  
         
         As you enter the labels, you are told which column each label 
         is associated with.  Notice that the first label is 
         associated with the second column of the MRV field.

         NOMINAL-ORDINAL VARIABLES

         Nominal and ordinal (NO) variables are treated the same by 
         the analysis modules so it saves time and programming to 
         group them in the type of variable prompt.  

         If you specify a NO variable you will then be prompted for 
         the number of values to the variable.  Then you will be 
         prompted to enter the value labels, in turn, one at a time.  
         If this is an ordinal variable, be sure to enter the codes in 
         their proper order. 

         The labels may contain any combination of characters except 
         the carat and ampersand.

         LINE ENTRY

         "Any combination of characters" means any ASCII character 
         with a code between 32 and 126 except the ^, the &, the ~, 
         the _, and the @.  Periods are sometimes dis-allowed, as are 
         commas, depending upon the error prevention needs of the 
         program at a given place.  

         All data entry in SuperStat is line entry.  One piece of 
         information at a time, as it is needed, only if it is needed, 
         with no backing up.  All prompts have error checks to ensure 
         valid entries, although the program cannot, and does not 
         check for every possible error, just the most common, or the 
         only ones possible at a given place.

         Full screen editing is nice and we are sure you have a 
         favorite word processor.  Just learn the format of a 
         SuperStat codebook file and then use an ASCII word processor 
         to enter and edit the codes.  We fully expect experienced 
         users to eventually by-pass our "hand holding" entry modules.  
         Go ahead, we've made it easy for you.

         



         FORMAT OF CODEBOOK FILES

         Two files are created by CODEMAN, ".TTL" files and ".VRN" 
         files.
         
         The VRN file contains:

         Ex.  5,22,5
              | |  |
              | |  -> total variables       
              |  -> first column of next variable
              -> next variable number

         When the first and third number are equal, the codebook has 
         been completed. 

         The TTL file contains the codes.  The first line contains the 
         specifications for the variable to follow.

           ----------> Type of variable (must be in double quotes)
          |    ------> Number of first column in data field
          |   |  ----> Number of columns used by this variable
          |   | |  --> Number of labels for the variable
          |   | | |
          |   | | |
         "NO",1,1,4
         Title of the variable
         label-1
         label-2
         label-3
         label-4
         
         The format for MRV's is functionally equal.

         "MSA",2,5,4               ("MSA means stand alone)
         Title of the variable     ("MTO" means a Tab-of-Others)
         label-1
         label-2
         label-3
         label-4
         
         The format for interval variables is slightly different.

                 |-> DIFFERENCE = values always equal 2  
         "I",7,3,2
         Title of the variable
         Maximum value         (always initialized to zero)
         Minimum value         (always initialized to nines, if three
                                columns, then 3 nines)
         Ex.   "I",7,3,2
               Age of Respondent
               0
               999
         



         TAKING A BREAK

         If you want to take a break from code entry, just press 
         <ESC>ape at the title of the variable prompt.  This will 
         close the file and return you to the menu.  Information is 
         always saved after each variable is entered so you don't need 
         to worry about explicitly saving files.

         ADDING TO INCOMPLETE CODEBOOKS

         When you come back to your data entry, choose the second menu 
         item, "add to existing codebook."  The routine is the same as 
         above.

         PRINTING A CODEBOOK

         You may print your codebook to a disk or printer.  It comes 
         up on screen as you do so.  Press <CTRL> and <NUMLOCK> 
         together to slow the process down.  Press any key to resume 
         printing.  
         
         If you print to disk, you will be prompted for an output 
         filename.  Follow the same conventions as always.  Remember 
         your default drive specification is still in effect.
         
         If you print to a printer other than LPT1:, be sure to change 
         the system defaults with SSTMENU.  Any ASCII compatible 
         printer should work fine.

         EDITING A CODEBOOK 

         Press (4) when you see the CODEMAN menu.  First the variables 
         that have been entered so far are displayed.  You are then 
         prompted for the NUMBER of the VARIABLE you wish to edit.  

         After you have chosen a variable the screen clears and the 
         variable title and labels are displayed.

         A prompt line allows you to edit the titles, labels, level, 
         columns, maximum and minimum, save the changes or <ESC> to 
         the menu.  

         If you do not change anything, the program will not save the 
         codebook to disk, even if you tell it to.  You must 
         explicitly save the changes to disk at the prompt.

         If you change something and then change your mind, press 
         <ESC>ape.  This will take you back to the menu without saving 
         the changes.  All changes are lost.

         



         DATA ENTRY

         Data entry is accomplished with the DATAMAN module which,
         you might have guessed, stands for "data manager."  You may 
         call the module from the main menu or you may type the module 
         name at the DOS prompt.

         First you will be asked for a data filename.  Use standard 
         file naming conventions and remember that your default DATA 
         drive is in effect.  Next you are prompted to enter your 
         codebook filename.  The data manager will look at your VRN 
         file to determine the number of columns required, keep track 
         of maximum and minimum values of interval level variables and 
         save the changes (if any) when you leave the input routine.  
         
         The menu now appears.  Press (1) to begin the entry of a new 
         data set.  If you have not entered the codebook filename you 
         will first be prompted to specify the number of data columns 
         required.  This is the middle number of the VRN file minus 
         one.

         Next you will see a data entry record line.  It specifies the 
         record number and provides a marked field to ease data entry.
         Above it is a column counter and below it lies several help 
         statements.

         Ex.

         Column 1
         Record #1  ____+____|__





         Fill the field completely and press <ENTER>.
         Missing Values (blanks) are zero (0) for Nominal/Ordinal
         and asterisks (*) for Interval/Ratio variables.
         Press <ESC> when finished or to return to menu.
         Use (^) for delete and (&) for insert.

         INSERT AND DELETE

         The carat (^) and the ampersand (&) keys have been changed 
         respectively to delete and insert to allow full use of the 
         ten-key pad.    
         



         MISSING VALUES

         Missing Values (blanks) are zero (0) for Nominal/Ordinal and 
         asterisks (*) for Interval/Ratio variables.  No blank spaces 
         are allowed.  Missing values are always zero for 
         nominal/ordinal variables.  If you forget and enter 
         asterisks, or just prefer them, don't worry.  They are 
         functionally equivalent to zeroes in SuperStat WHERE IT 
         MATTERS.  Namely, in FREQ and CROSS.  In SCATTER, where 
         interval level variables are processed, zeroes are important 
         numbers.  Therefore, asterisks are considered missing values 
         and are automatically not included.  

         
         DATA FILES

         DATAMAN creates two files, the ".DTA" file, which holds your 
         raw data, and the ".APD" file.

         Example APD file:
                           
         250,12 
         |   |
         |    -> number of columns
          -----> number of records

         Example record in DTA file:

         67864236181298367432

         
         SAVING RECORDS

         Each record is automatically saved each time you press 
         <ENTER> and the APD file is updated.  This is done to prevent 
         any loss of data.  It will be a bit slow for you if you never 
         have records over 10 to 15 columns long.  But losing data is 
         no fun and we prefer the extra second it takes to save one 
         record and move on the next.

         Each record is saved in ASCII format so you can transfer it 
         to different programs, or from your favorite word processor 
         to SuperStat.  Just use a fixed format in the manner of 
         DATAMAN, and the analysis modules should have no trouble 
         picking it up.  
         
         CAUTION: Some word processors do not allow non-document 
         processing and inbed spaces in front of your records.  Some 
         word processors imbed characters invisible to you but 
         apparent to the computer.  These problems can cause 
         unpredictable results.  
         



         SuperStat uses fixed field format to save space on the disks 
         for large data sets.  Commas take up a lot of room.

         SuperStat writes data to the disk SEQUENTIALLY.  That is, one 
         record at a time, one after the other, from the first to the 
         last.  It reads data the same way.  Except when editing, this 
         is the faster way to manipulate the data.  

         TAKING A BREAK

         Just press <ESC>ape at the beginning of the next line and you 
         will return to the menu.  When you come back to your data 
         entry, choose menu item (2) to add more data.  DATAMAN will 
         pick up where you left off and work just as it is explained 
         above.

         PRINTING DATA

         Choose menu item (3) to print a copy of your data set.  You 
         will print to the default printer.


         EDITING DATA

         Choose menu item (4) to edit the data.  It is best to keep a 
         list of records with errors so that you can go through the 
         data set sequentially.  You can't back up so if you skip a 
         record you need to edit, you'll have to begin again.  This is 
         when sequential file manipulation gets slow.  You have to 
         organize your editing to overcome it.

         The edit routine will ask you for the record you wish to edit 
         and then load it.  It will put the old record on that 
         familiar data entry line and position the cursor at column 
         one.  You need only correct the mistake and press enter.  
         Unlike other line entry in SuperStat, this one routine allows 
         you to press <ENTER> from anywhere on the line (except column 
         one) without ignoring all characters from the cursor, right, 
         to the end of the line prompt. 
        
         



         MANAGING FILES
         
         After you have completed your codebook and data entry, there 
         may be some manipulations you would like to do to your files 
         as a whole.  The FILEMAN module provides a number of options 
         for working with your data to whittle it down or build it up.

         You may, for instance, wish to have more than one person 
         enter your data to facilitate processing.  FILEMAN will join 
         the various files together into a single whole for further 
         processing and analysis.  

         In general, you are first asked for a first source filename 
         and if needed, a second source filename.  You are then 
         prompted for a new output filename.  It is assumed that you 
         do NOT want your original data altered (except when 
         encrypting).  
         
         
         ENCRYPTING DATA

         The first menu option allows you to encrypt your data based 
         on a single letter encryption key in the range of 1-9, a-z, 
         and A-Z.  It is the ASCII code that is operative.  Therefore, 
         you have 62 possible keys.  You are also prompted for a 
         password.  The password is also encrypted so that snoops 
         cannot decipher it.  

         If you forget either your key or your password, your data is 
         lost.  The analysis modules cannot work with the encrypted 
         data and we cannot decrypt your data for you.  If you keep an 
         un-encrypted version of the data, you defeat the purpose of 
         encryption.  The source file is destroyed but could probably 
         be un-deleted with one of the utilities available on the 
         market.  Therefore, if you really wish to hide the data, 
         encryt the data, then back up the encrypted file, and 
         finally, reformat the original data disk and any backup data 
         disks.  Before you reformat, be sure any other important 
         files on the disk are backed up.  Consult your DOS disk for 
         more information on formatting diskettes.  

         APPENDING DATA FILES

         The second menu option allows you to append data files (add 
         records from one file to another).  This is useful when more 
         than two people are entering data on different machines.  The 
         records must be the same length.  The files may have 
         different numbers of records in them.  If you are working 
         with two large files,  read the data from one disk and write 
         to another.  SuperStat will write the data wherever you wish 
         so if you have three disks, you can read one file from A:, 
         another from B:, and write to C:.  If your data fills both 
         source disks, you will have no choice in this matter.
         



         Ex.  How data files are appended.

         123123123123123       1st source data file
              /|\
               |
         987987987987987       2nd source data file

         MERGING DATA AND CODEBOOK FILES

         The third menu option allows you to merge data and codebook 
         files (add variables).  If you only wish to do one or the 
         other (presumably because one set of files, either data or 
         codebook, are not yet entered), a prompt gives you that 
         option.  If you have both sets of files entered, merge them 
         at the same time.  SuperStat does most of the gritty work of 
         keeping track of specifications for you and prevents 
         mistakes if it can.  Your data files must contain the same 
         number of records to use this feature.

         Ex. How data files are merged. 

         1st source data record          2nd source data record
         123123123123123123   <------    987987987987987987

         Since you are adding variables to the data set, your codebook 
         (if it exists) must be updated to allow the analysis modules 
         to use the added data.  The second source codebook is merged 
         to the end of the first source codebook.  Don't let this 
         confuse you.  The action of the codebook merge resembles the 
         action of the data append.  This is due to the diference in 
         the structure of the data and codebook files.

         Ex. How codebook files are merged.

         "NO",1,1,2
         Variable Title                  1st source codebook
         Label 1
         Label 2
            /|\
             |
             |
         "NO",2,1,3
         Variable title
         Label 1
         Label 2
         Label 3

         FILEMAN keeps track of the number of columns you are using 
         and prevents you from creating a codebook (or data file) 
         larger than SuperStat can handle and re-numbers the merged 
         variables to reflect their new position in the new file.
         



         SPLITTING DATA AND CODEBOOK FILES

         The fourth menu option allows you to separate variables away 
         from each other in the data set.  It has an action opposite 
         to that of merging with regard to both data and codebook 
         files.  Your original file is not destroyed, two files are 
         simply created where there was only one.  Again, the program 
         keeps track of the position of the variables in the new data 
         set.


         PERMANENT RECODES

         The fifth menu option allows you to permanently recode your 
         data.  If you will be using a variable in its recoded form on 
         a regular basis throughout your analysis, it may be 
         advantageous to recode it permanently and put it, along with 
         all of the other variables, into a separate data file.  This 
         will save time and keystrokes later.  You must recode both 
         the data file and its codebook for the analysis modules to 
         operate correctly.  

         You are prompted for the number of variables you wish to 
         recode (up to ten at a time), the number of the variable 
         (each in turn), and for each variable, the number of 
         nominal/ordinal categories you wish to create.  Then you are 
         prompted for the high and low values of the variable that are 
         to fit into the new categories and in turn, the value label 
         for each new value.  
         
         You are free to recode any variable except MRV's.  You are 
         free to recode zeroes to nines and fives to threes if that is 
         what you need to do.  

         When you complete the recode specifications the new recoded 
         data set is saved to disk under a the new filename you are 
         prompted for.

         RECODES IN THE ANALYSIS MODULES 

         The operation of all other recode routines is the same with 
         one exception.  When recoding a variable in the analysis 
         modules, the recode is transient, that is, it is NOT saved.  

         PERMANENT SELECTIONS

         The sixth menu option allows you to permanently select a 
         group of records based on one value of up to nine different 
         variables.  A new data file is created which uses the 
         original codebook.  No data is changed and no variables or 
         records are lost.  You simply state your criteria for the new 
         data set and the records of the source file are searched for 
         cases that match the criteria.  The matching records are then 
         written to a new file under a filename you are prompted for.  
         



         You are allowed to make up to 9 selections.  The actual 
         selections you make are of the values to variables.  You may 
         not select values of Interval or Multiple Response variables.  
         You may, however, recode an interval level variable and then 
         select one of the new categories. 
         
         You are prompted for the operator on which to base the 
         selection.  The operators are AND, OR, and NOT.  If you 
         choose AND, all of the selections must match the 
         corresponding values of the case for the case to be selected.  
         
         Ex.  If selected value 1 = corresponding case value AND
                 selected value 2 = corresponding case value AND
                 .
                 .
                 selected value 10 = corresponding case value THEN

              select the case and write it to the new file.
            
         If you choose OR, at least one of the selections must match 
         the corresponding value of the case for the case to be 
         selected. 
         
         Ex.  If selected value 1 = corresponding case value OR 
                 selected value 2 = corresponding case value OR 
                 .
                 .
                 selected value 10 = corresponding case value THEN

              select the case and write it to the new file.
                  
          If you choose NOT, if any of the selections of 
         match the corresponding values of the case, the case will not 
         be selected.  The action for NOT is actually AND NOT.

         Ex. 
          If selected value 1 does NOT = corresponding case value AND 
             selected value 2 does NOT = corresponding case value AND
             .
             .
             selected value 10 does NOT = corresponding case value THEN

          select the case and write it to the new file.
         

         SELECTIONS IN THE ANALYSIS MODULES
         
         The operation of all other selection routines is the same with 
         one exception.  When selecting a value of a variable in the 
         analysis modules, the selection is transient, that is, it is 
         NOT saved.  
         


                  
         WEIGHTING THE DATA SET

         The seventh menu option allows you to weight the data set 
         based on the selection of three weighting criteria.  The 
         selection operator is always AND.  In other words:

         If selected value 1 = corresponding case value AND
            selected value 2 = corresponding case value AND
            selected value 3 = corresponding case value THEN

         add another identical case to the data set the number times 
         specified as the weighting factor.

         You are prompted for the weighting factor.  It can be in the 
         range of 0.1 to 9.9.  As cases are found that match the 
         weighting criteria, the case is duplicated the number of 
         times indicated by the integer part of the weighting factor.  
         The fractional parts are added together each time a match is 
         found until they are greater than or equal to 1, at which 
         time 1 is added to the weighting factor, and the fractional 
         part is reduced by one.  The addition of 1 to the weighting 
         factor is a transient addition.  You will not end up with a 
         weighting factor of 45 through repeated addition.  
         
         The weighted data set is written to a new file under a new 
         filename which you are prompted for. 
        
         



         ANALYSIS

         There are three analysis modules supplied with SuperStat 
         Share; FREQ, CROSS, and SCATTER.  You may run the programs 
         from SSTMENU or you may type the filename at the DOS prompt.

         Ex. A>FREQ

         
         FREQUENCY DISTRIBUTIONS

         The frequency distribution module is called FREQ.  It will 
         analyze from 1 to 250 variables, display a bar chart, and 
         provide:

         Value counts
         Value percentages
         Mean Ordinal Rank
         Mode
         Median
         Standard Error at 95% confidence level

         Some of you will wonder why there are no interval level 
         statistics in the module.  It is simply because it is set up 
         to handle CODES, not numbers (although the codes themselves 
         are numbers).  It is basically a nominal and ordinal level 
         variable analyzer, and based on that fact, higher level 
         statistics are simply inappropriate.  If you insist on them 
         in spite of this, use SCATTER, it is the real number 
         analyzer.

         First you are prompted for the data filename and then the 
         codebook filename.  The reason you have to enter both is 
         simple; if you ever repeat a study, you can use your old 
         codebook.  The filenames can be the same because the data 
         entry modules give them different filename extentions.  FREQ, 
         CROSS, and SCATTER can tell them apart.

         After your filenames are entered the menu appears.  If you 
         wish to make selections or recodes, do so first.  The 
         routines are identical to those in FILEMAN except that the 
         changes you make are not saved.  

         Next you may choose to run all of the frequencies or just 
         one.  Either way, all of the variables are tabulated, you 
         just have some options about which variables are to be 
         printed and where if you choose to print only one.  
         
         If you choose to tabulate a single variable, you are first 
         given a display of your variables to choose from.  Press 
         <ENTER> to skip the display or the <SPACEBAR> to see 
         subsequent pages.  You are then prompted for the NUMBER of 
         the first variable you wish to see.
         



         Either way, you are then asked for the value you wish to be 
         used to tabulate multiple responses.  If you only have 
         MSA's, this is always one (1).  If you have MTO's, your 
         choices are in the range of values common to the variables 
         you specified to be included in each MTO, ie., if the 
         variables each have 6 values, you may choose to tabulate 
         value 4.
         
         NOTE: You are only prompted for the multiple response value
         if you have one in the data set.   
                  
         Either way, you are then asked whether you want missing 
         values included in the calculations.  If you do, every 
         calculation will be affected  by the higher N.  If you do 
         not, every calculation will be affected by the lower N.  You 
         are reminded on the output what your choice was.  

         The variables are now tabulated.  As FREQ works, it checks 
         for out of range values and if it finds any, it warns you on 
         screen and sets the value to zero (0).  If you wish to get a 
         listing of the error messages, use DOS piping to redirect 
         output to a disk file and press <ESC> at the next prompt 
         (consult your DOS manual for more information on DOS piping).

         If you chose to tabulate a single variable, the variable you 
         chose is displayed first.  You are then asked where you want 
         the out put to go; the printer, a disk file, both, or 
         neither.  You may also skip a page from the prompt.  The 
         program then lets you choose again.

         If you choose the printer, the output will go to the default 
         printer.  If you choose the disk or both, you will be asked 
         for an output filename and the file is then written.  If you 
         choose both, it is also printed to the default printer.  If 
         you choose neither, you are then asked which variable you 
         wish to see next.  If none, press <ESC>.  Otherwise, type the 
         number of the variable and press <ENTER>.  

         If you chose to print all the variables, the first variable 
         is displayed and then the print prompt described above.  Its 
         operation is the same.  You are NOT asked which variable you 
         wish to next.  Each variable is displayed, in turn, as it is 
         printed.  If you wish to see all the variables but don't want 
         them printed, press (N) for neither and be ready with the 
         <CTRL><NUMLOCK> combination to slow things down.  It works so 
         fast you will need to stop it to see anything.
         



         PIPING
         
         When you have learned the prompts it will be easier for you 
         to get your work done if you use piping to redirect input.  
         Piping fools the modules such that when they look for 
         keystrokes, DOS gives them characters you have typed in a 
         file on disk.  Using this method you can fully automate the 
         analysis process.  By combining batch files and piped input 
         files, you can set your machine and it will do the work while 
         you do something else.  The modules have all been set to 
         allow this to take place and streamline the process.  You can 
         literally run thousands of frequencies, crosstabs, and 
         scatter plots on any number of data sets WHILE YOU SLEEP.  
         Just make sure you have the keystrokes correct, and enough 
         paper and disk space to handle the job.

         For purposes of piping, the following keys have functional 
         equivalents:

         <ENTER>    = tilde (~)
         <SPACEBAR> = underline (_)
         <ESC>ape   = at sign (@)

         CAUTION:  ALWAYS end your input file with the appropriate 
         "return to DOS" character.  Otherwise, the "big red button" 
         will be required to get control of the computer back (also 
         known as the "on/off" switch).


         CROSS TABULATIONS

         The crosstabs module is named CROSS.  Just as in FREQ, you 
         are first asked for your data and codebook filenames.  Then 
         you see the menu.  

         Select and recode works just as in the other modules.  You 
         can recode all the values of one variable into a single value 
         and then select that value as an independent variable.  

         You are then shown the menu again and you may then choose to 
         run a crosstab.

         You are first shown a list of all your variables.  You may 
         skip the display by pressing <ENTER> or see other pages by 
         pressing <SPACEBAR>.  You are then asked for and independent 
         and dependent variable.

         Next you must tell the system whether you want missing values 
         included in the calculations.  
         



         Now CROSS runs through the data, does the calculations and 
         displays a bar chart.  You may skip the display of the charts 
         by pressing <ENTER> or you may see the others by pressing the 
         <SPACEBAR>.  There are as many charts as there are values of
         the independent variable.

         Next you are asked if you want to print the bar charts to the 
         line printer, a file on disk, or not at all.  Press the 
         letter in parentheses that corresponds to your choice.  If 
         you choose to print to disk you will be prompted for a 
         filename.  

         Next you are asked if you want the table printed to the line 
         printer, a file on disk, or not at all.  Press the letter of 
         your choice.  You will be asked for a filename if you choose 
         to print to disk.  

         After printing you will return to the menu.

         You are provided stats for bi-variate association, 
         significance, and chi-square appropriate for nominal and 
         ordinal level variables.  

         Goodman and Kruskals Tau is provided in place of Lambda 
         because it is a more sensitive measure.  Where you can SEE 
         some association and Lambda shows none, G & K's Tau will pick 
         it up.

         There are measures for nominal to nominal association and 
         significance (G & K's Tau and Chi-Square), ordinal to ordinal 
         association and significance (Gamma and Z-Score Gamma), and 
         nominal to ordinal association and significance (Wilcoxons 
         Theta and contingency coefficient).  Cramer's V is provided 
         as well, along with the degrees of freedom, N, and missing 
         values.

         You are also provided row, column, and total percentages and 
         the cells counts as well as complete information on row and 
         column totals and missing values.  


         SCATTERPLOTS
                 
         The scatterplot module is named SCATTER.  It is structured to 
         work very much like FREQ and CROSS.  You are asked for your 
         filenames and then you see a menu.  You can select and recode 
         just as in the other modules.  

         When you choose to run a scatterplot you are shown your 
         variables and prompted for the independent and dependent 
         variables.  
         



         Next you choose to consider zero as an absolute minimum or 
         not.  If you choose to do so, the minimum value of both 
         variables becomes zeroe (0), thus making your interval level 
         variables ratio variables.

         Missing values are automatically not included because they 
         are not numbers and SCATTER only works with numbers.  
         Remember that asterisks (*) are missing values in SCATTER.  

         Next SCATTER runs through the data, does the calculations, 
         and displays a 20x50 scatterplot on the screen.  Press a key 
         to see the statistics.  
         
         After viewing the stats you are asked if you want to print a 
         30x50 scatterplot or a 100x100 scatterplot to the line 
         printer, disk, or not at all.
         
         The 30x50 scatterplot is appropriate for daisy wheel printers 
         or disk files.  The 100x100 scatterplot is appropriate for 
         dot matrix printers compatible with the Epson standard 
         command set for sqeezing and compressing print.  Both formats 
         come very close to being a square plot on their appropriate 
         mediums.
         
         You are provide stats for bi-variate correlation, regression, 
         and one-way ANOVA, as well as descriptive stats on the 
         individual variables. 

         


         
                                   REFERENCES


         Evans, James D., Invitation to Psychological Research. Holt, 
              Rinehart, and Winston.  New York.  1985

         Manheim, Jarol B. and Rich, Richard C. Empirical Political 
              Analysis, Research Methods in Political Science. 
              Prentice Hall, Inc., Englewood Cliffs, NJ. 1981.

         Watson, George and McGaw, Dickinson.  Statistical Inquiry. 
              John Wiley and Sons.  New York.  1980.

         McClave, James T. and Dietrich, Frank H., II. Statistics 
              Dellen Publishing Co.  San Francisco.  1982. 

         Welch, Susan and Comer, John C. Quantitative Methods for 
              Public Administration, Techniques and Applications.  The 
              Dorsey Press.  Homewood, IL.  1983

         Babbie, Earl R. The Practice of Social Research.  Wadsworth 
              Publishing Co.  Belmont, CA.  1979.

         Phillips, John L.  Statistical Thinking.  W.H. Freeman and 
              Co.  San Francisco.  1973.

         Freeman, Linton C.  Elementary Applied Statistics. John 
              Wiley and Sons.  New York.  1965.

         Jones, E. Terrence.  Conducting Political Research. Harper 
              and Row.  New York.  1984.  pp. 166 - 167.
             

Directory of PC-SIG Library Disk #0639

 Volume in drive A has no label
 Directory of A:\

CODEMAN  EXE     65946   8-23-86   5:56p
DATAMAN  EXE     52234   8-23-86   3:30p
FILEMAN  EXE     73898   8-23-86   3:33p
FILES639 TXT       959   1-04-80   5:01p
GO       BAT        28  11-27-86  12:31p
GO       TXT       490   1-04-80   5:00p
NOTES639 TXT      2008   1-04-80   5:04p
SST      DEF       158   8-23-86   3:14p
SST      DOC     49589  12-08-86  11:03a
SSTMENU  EXE     52106   8-23-86   3:05p
       10 file(s)     297416 bytes
                       19456 bytes free