Saturday, 26 May 2018

VARIABLES AND ORGANIZATION OF THE DATA [WEISS (1999), ANDERSON & SCLOVE (1974) AND FREUND (2001)]


VARIABLES AND ORGANIZATION OF THE DATA
[WEISS (1999), ANDERSON & SCLOVE (1974) AND FREUND (2001)]


 VARIABLES

A characteristic that varies from one person or thing to another is called a Variable  i.e, a variable is any characteristic that varies from one individual member of the population to another. Examples of variables for humans are height, weight, number of siblings, sex, marital status, and eye color. The first three of these variables yield numerical information (yield numerical measurements) and are examples of quantitative (or numerical) variables, last three yield non-numerical information (yield non-numerical  measurements) and are examples of qualitative (or categorical) variables. Quantitative variables can be classified as either discrete or continuous.
Discrete  variable:  Some variables, such as the numbers of children in family, the numbers of car accident on the certain road on different days, or the numbers of students taking basics of statistics course are the results of counting and thus these are discrete variables. Typically, a discrete variable is a variable whose possible values are some or all of the ordinary counting numbers like 0, 1, 2, 3, . . . . As a definition, we can say that a variable is discrete if it has only a countable number of distinct possible values. That is, a variable is is discrete if it can assume only a finite numbers of values or as many values as there are integers.
Continuous variables: Quantities such as length, weight, or temperature can in principle be measured arbitrarily accurately. There is no invisible unit. Weight may be measured to the nearest gram, but it could be measured more accurately, say to the tenth of a gram. Such a variable, called continuous, is intrinsically different from a discrete variable.

SCALES
Scales for Qualitative Variables Besides being classified as either qualitative
or quantitative, variables can be described according to the scale on which
they are defined. The scale of the variable gives certain structure to the variable and also defines the meaning of the variable.

The categories into which a qualitative variable falls may or may not have
a natural ordering. For example, occupational categories have no natural ordering. If the categories of a qualitative variable are unordered, then the qualitative variable is said to be defined on a nominal scale, the word nominal referring to the fact that the categories are merely names. If the categories can be put in order, the scale is called an ordinal scale. Based on what scale a qualitative variable is defined, the variable can be called as a nominal variable or an ordinal variable. Examples of ordinal variables are education (classified e.g. as low, high) and "strength of opinion" on some proposal (classified according to whether the individual favors the proposal, is indifferent towards it, or opposites it), and position at the end of race (first, second, etc.).
Scales for Quantitative Variables: Quantitative variables, whether discrete
or continuous, are defined either on an interval scale or on a ratio scale.
If one can compare the differences between measurements of the variable meaningfully, but not the ratio of the measurements, then the quantitative variable is defined on interval scale. If, on the other hand, one can compare both the differences between measurements of the variable and the ratio of  the measurements meaningfully, then the quantitative variable is defined on ratio scale. In order to the ratio of the measurements being meaningful, the variable must have natural meaningful absolute zero point, i.e, a ratio scale is an interval scale with a meaningful absolute zero point. For example, temperature measured on the Certigrade system is a interval variable and the height of person is a ratio variable.
ORGANIZATION OF THE DATA
Observing the values of the variables for one or more people or things yield
data. Each individual piece of data is called an observation and the collection
of all observations for particular variables is called a data set or data matrix. Data set are the values of variables recorded for a set of sampling units.
For ease in manipulating (recording and sorting) the values of the qualitative
variable, they are often coded by assigning numbers to the different categories, and thus converting the categorical data to numerical data in a trivial sense. For example, marital status might be coded by letting 1,2,3, and 4 denote a person’s being single, married, widowed, or divorced but still coded data still continues to be nominal data. Coded numerical data do not share any of the properties of the numbers we deal with ordinary arithmetic. With records to the codes for marital status,
 we cannot write 3 > 1 or 2 < 4, and
we cannot write 2 − 1 = 4 − 3 or 1 + 3 = 4.
 This illustrates how important  it is always check whether the mathematical treatment of statistical data is really pegmatite.
Data is presented in a matrix form (data matrix). All the values of particular
Variable is organized to the same column; the values of variable forms the
column in a data matrix, Observation, i.e. measurements collected from
Sampling unit, forms a row in a data matrix. Consider the situation where
there are k numbers of variables and n numbers of observations (sample size
is n). Then the data set should look like


Variables
x11   x12      x13 . . . x1k
x21    x22      x23 . . . x2k
x31    x32     x33 . . . x3k
.        .         .         .
.        .         .         .
.        .         .         .
xn1     xn2       xn3 . . . xnk

where xij is a value of the j :th variable collected from i:th observation, i=1, 2, . . . , n and

j = 1, 2, . . . , k.

WE WILL CONTINUE IN OUR NEXT POST SUBSCRIBE TO GET ALL OUR UPDATES.

0 comments:

Post a Comment